Individual Full Name Metaphone Pairs Cluster (dnClusterFullNameMeta)
The Full Name Metaphone Pairs cluster uses the normalized full name for the individual to generate a cluster key for every pair of names within the full name.
- Split the normalized full name into several name tokens, using space as a
delimiter.
Note:
Many other punctuation and noise characters are normalized to spaces before generating the cluster. For further information see Name Normalization. - Sort the name tokens alphabetically.
- Apply the Metaphone transformation (the standard double-metaphone algorithm) to each name token, outputting a key with a length of up to three characters.
- Concatenate the Metaphone values, generating a final key value for each distinct pair of tokens.
- Deduplicate the list of keys.
Table 5-5 Full Name Metaphone Pairs Cluster
dnFullName | Name Tokens and Metaphone values | Distinct Cluster Keys | dnClusterFullNameMeta |
---|---|---|---|
XIAO JIAN ZHONG | JIAN | JN
XIAO | S ZHONG| JNK |
JNS JNJNK SJNK | JNS| JNJNK |SJNK |
ZHONG XIAOJIAN | XIAOJIAN | SJN
ZHONG | JNK |
SJNJNK | SJNJNK |
MOHAMMED SANI ABACHE | ABACHE | ABX
MOHAMMED | MHM T SANI | SN |
APXMHM APXSN MHMSN | APXMHM| APXSN| MHMSN |
JOSEPH TSANGA ABANDA | ABANDA | APNT
JOSEPH | JSF TSANGA | TSNK |
APNJSF APNTSN JSFTSN | APNJSF| APNTSN| JSFTSN |
ABD AL WAHAB ABD AL HAFIZ | ABD | APT
ABD | APT AL | AL AL | AL HAFIZ | HFS WAHAB | AHP |
APTAPT APTAL APTHFS APTAHP ALAL ALHFS ALAHP HFSAHP | APTAPT| APTAL| APTHFS | APTAHP| ALAL| ALHFS | ALAHP| HFSAHP |
SULIMAN HAMD SULEIMAN AL BUTHE | AL | AL
BUTHE | P0 HAMAD | HMT SULEIMAN | SLMN SULIMAN | SLMN |
ALP0 ALHMT ALSLM P0HMT P0SLM HMTSLM SLMSLM | ALP0| ALHMT| ALSLM| P0HMT| P0SLM| HMTSLM | SLMSLM |
AL BUTHE SOLEIMAN HAMAD | AL | AL
BUTHE | P0 HAMAD | HMT SOLEIMAN | SLMN |
ALP0 ALHMT ALSLM P0HMT P0SLM HMTSLM | ALP0| ALHMT| ALSLM| P0HMT| P0SLM| HMTSLM |
REGINALD B GOODRIDGE | B | P
GOODRIDGE | KTRJ REGINALD | RJNLT |
KTRRJN
NOTE: Initials are ignored by default when generating cluster key |
KTRRJN |
REGINALD B SR GOODRICH | B | P
GOODRIDGE | KTRJ REGINALD | RJNLT SR | SR |
KTRRJN KTRSR RJNSR
NOTE: Initials are ignored by default when generating cluster keys |
KTRRJN| KTRSR| RJNSR |
STEPHEN JEQE NKOMO | JEQE | JK
NKOMO | NKM STEPHEN | STFN |
JKNKM JKSTF NKMSTF | JKNKM| JKSTF| NKMSTF |
S J NKOMO | J | J
NKOMO | NKM S | S |
NKM
Initials are ignored by default when generating cluster keys |
NKM |
STEPHEN JEKE N KOMO | JEKE | JK
KOMO | KM N |N STEPHEN | STFN |
JKKM JKSTF KMSTF | JKKM| JKSTF| KMSTF |