public class OraNormalizer extends Object
OraNormalizer
class is a class used for normalizing strings following the Unicode Standard. Unicode characters may have different canonical formats. Before you can accurately compare characters, you must call the methods in this class to make sure that they are in the same canonical format. For more information regarding Unicode Normalization and the Unicode Standard, please refer to Technical Report 15 at www.unicode.org.Modifier and Type | Field and Description |
---|---|
static int |
CANONICAL_DECOMP
Canonical decompose the string and then canonical sort the string.
|
static int |
COMPATIBILITY_DECOMP
Compatible decompose the string and then canonical sort the string.
|
static int |
NFC
Canonical decomposition then composition.
|
static int |
NFD
Canonical decomposition.
|
static int |
NFKC
Compatibility decomposition then composition canonical sort the string.
|
static int |
NFKD
Compatibility decomposition.
|
static int |
NO_DECOMP
Canonical sort the string.
|
Modifier and Type | Method and Description |
---|---|
void |
canonicalSort(char[] sequence)
This method takes a char[] and performs canonical sort on the sequence.
|
String |
compose(String sequence)
This is the normalization method.
|
String |
decompose(String sequence, int mode)
This is the normalization method.
|
static OraNormalizer |
getInstance()
Returns a shared
OraNormalizer instance to the user. |
String |
normalize(String sequence, int mode)
This is the normalization method.
|
public static final int NO_DECOMP
public static final int CANONICAL_DECOMP
public static final int COMPATIBILITY_DECOMP
public static final int NFD
public static final int NFC
public static final int NFKD
public static final int NFKC
public static OraNormalizer getInstance()
OraNormalizer
instance to the user.OraNormalizer
instancepublic void canonicalSort(char[] sequence)
sequence
- a string to sortpublic String compose(String sequence)
sequence
- a string to composepublic String decompose(String sequence, int mode)
sequence
- a string to composemode
- decomposed mode; options are NO_DECOMPOSITION
, CANONICAL_DECOMPOSITION
, and COMPATIBLE_DECOMPOSITION
public String normalize(String sequence, int mode)
sequence
- a string to composemode
- decomposed mode; options are NFD, NFC, NFKD, NFKC