Rogue Wave banner
Previous fileTop of DocumentContentsIndex pageNext file
Internationalization Module User's Guide
Rogue Wave web site:  Home Page  |  Main Documentation Page

5.1 Introduction

A piece of text can sometimes be represented by more than one sequence of Unicode characters. This is because the Unicode standard recognizes two types of character equivalence, in which different Unicode code points or sequences of code points are considered equivalent forms of the same information. The two types of character equivalence give rise to four normalization forms. Each normalization form produces a unique representation for a given string.

Normalization is the process of converting Unicode text to a unique representation. Normalization facilitates sorting, searching, conversion, and data exchange. The W3C recommends that all data be normalized as early as possible.

In the Internationalization Module, class RWUNormalizer normalizes Unicode text. This chapter describes how to use RWUNormalizer to:



Previous fileTop of DocumentContentsIndex pageNext file

Copyright © Rogue Wave Software, Inc. All Rights Reserved.

The Rogue Wave name and logo, and SourcePro, are registered trademarks of Rogue Wave Software. All other trademarks are the property of their respective owners.
Contact Rogue Wave about documentation or support issues.