Individual Initials (dnClusterInitials)
The First Initial Last Name cluster provides a clustering method to group together names that share the same first name initial and last name, and allows some variation for transposed names.
The default logic of the cluster builder is as follows:
- Split the normalized given names into several name tokens, using a space character as the delimiter.
- Split the normalized family name into several name tokens, using a space character as the delimiter.
- Generate the cluster key value as follows:
- If there are two or more characters in the last token of the family name, then concatenate the first character of the given name with the last token of the family name.
- If the last token of the family name is a single initial, then concatenate that character with the first token of the given name
- Trim the cluster key to a maximum of 12 characters.
The following table provides some examples.
Table 5-9 First and Last Name Cluster
dnGivenNames | dnFamilyName | dnClusterFirstLast |
---|---|---|
MARTIN | JONES | MJONES |
MARTIN PETER | JONES | MJONES |
MARTIN | MORGAN JONES | MJONES |
JONES | M | MJONES |