Individual Given Names Cluster (dnClusterGivenNames)
The Given Names cluster provides a further backup to the remaining clusters, especially to deal with cases where names are not necessarily well-structured into family and given names.
Note:
Depending on the quality and culture of the name information, this cluster will often not be required. You can test the number of additional alerts identified by the cluster by running matching with this cluster disabled, and then running with it enabled. Comparing the new relationships against the old will highlight the relationships identified by using this cluster.The default logic of the cluster builder is as follows:
- Split the normalized full name into several name tokens, using space as a
delimiter.
Note:
Many other punctuation and noise characters are normalized to spaces before generating the cluster. For more information see Name Normalization. - Standardize the normalized given names before clustering. This ensures, for example, that names such as 'William' and 'Bill' will be clustered together, although their raw Metaphone values are not the same. A space delimiter is used to split the name before standardizing.
- Apply the Metaphone transformation to the whole of the given names value after token standardization, outputting a key with a length of up to 4 characters.
The following table provides some examples.
Table 5-6 Given Names Cluster
dnGivenNames | Metaphone Values | dnClusterGivenNames |
---|---|---|
XIAO JIAN | SJN | SJN |
ZHONG | JNK | JNK |
MOHAMMED SANI | MHMT | MHMT |
JOSEPH TSANGA | JSFT | JSFT |
ABD AL WAHAB | APTL | APTL |
SULIMAN HAMD SULEIMAN | SLMN | SLMN |
AL BUTHE | ALP0 | ALP0 |
REGINALD B | RJNL | RJNL |
STEPHEN JEQE | STFN | STFN |
S J | SJ | SJ |
STEPHEN JEKE | STFN | STFN |