The Sun Match Engine supports several phonetic encoders, which are defined in the PhoneticEncodersConfig section of the Match Field file. Any encoders specified in the phonetic encoding structures (see Phonetic Encoding Structures) must also be defined in the PhoneticEncodersConfig section. The classes for the encoders are listed in Table 7.
Soundex - This algorithm is an industry standard for phonetically encoding first names.
French Soundex - This algorithm is based on the Soundex algorithm, but is customized for French characters and names.
Refined Soundex - This algorithm is similar to the Soundex algorithm, but is optimized for spell checking.
NYSIIS - This algorithm is an industry standard for phonetically encoding last names.
Metaphone - This algorithm is similar to the Soundex algorithm, but is better at identifying words that sound similar. This encoder is limited to encoding a single word in ASCII format containing only characters in the A - Z range. No punctuation or numbers can be in the input string.
Double Metaphone - This algorithm is an improvement on the Metaphone algorithm, at times returning two encodings for a word that could have multiple pronunciations.
Encoder |
Java Class |
---|---|
Soundex |
com.stc.eindex.phonetic.impl.Soundex |
NYSIIS |
com.stc.eindex.phonetic.impl.NYSIIS |
Metaphone |
com.stc.eindex.phonetic.impl.Metaphone |
Double Metaphone |
com.stc.eindex.phonetic.impl.DoubleMetaphone |
Refined Soundex |
com.stc.eindex.phonetic.impl.RefinedSoundex |
French Soundex |
com.stc.eindex.phonetic.impl.SoundexFR |