Individual Name Matching Rules
The following are the individual name matching rules:
Table 5-15 Individual Name Matching Rules
Group Code | Matching Rule | Logic Summary | Example Matching Data |
---|---|---|---|
I001 | Exact name | Full name match after name standardization using full name map. | - |
I002 | Exact standardized Full name | Given names and family name match exactly. | JOSEPH TSANGA
JOSEPH T’SANGA |
I003 | Original script name exact | The original script Name fields match exactly. | АЛЕКСАНДР ОСОКИН
АЛЕКСАНДР ОСОКИН |
I004 | Standardized given name | Given names match after name standardization using Given name map. Family name matches exactly. | BILL JONES WILLIAM JONES |
I005 | Full name | The full name matches exactly, after standardization of all name tokens using the Given Name Map. | JOHN MIKE SMITH
JOHN MICHAEL SMITH |
I006 | Full name without titles | The full name matches exactly, after standardization of all name tokens using the Given Name Map and removal of titles. | DR DOUGLAS BAKER
DOUGLAS BAKER |
I007 | Abbreviated standardized given name | Given names match using a Starts With comparison, after name standardization using the Given Name Map. Family name matches exactly. | JOSEPH ABANDA TSANGA
JOSEPH T’SANGA |
I008 | Given name similar and sounds like | Given name matches with an Edit Distance of 1 or 2 after name standardization. At least one of the given names, excluding initials, must match by a 4-character Metaphone key. Family name matches exactly | JOSEPH ABANDA
JOESPH ABANDA |
I009 | First name similar and sounds like | The first given name matches with an Edit Distance of 1 or 2 and with a Character Match Percentage of 66% or more, after given name standardization. At least one of the given names, excluding initials, must match by a 4-character Metaphone key. Family name matches exactly. | AMER MOHAMMAD RASHEED
AL UBAIDI AMIR RASHID MOHAMMED AL UBAIDI |
I010 | Additional given names | All name tokens from the given names field with fewest tokens must be present in the other given names field. Family name matches exactly |
MOHAMMED HANIF DIN MOHAMED HANIF |
I011 | Additional names | All name tokens from the full name with fewest tokens must be present
in the other full name. At least 2 name tokens must match with the same
matching logic; that is, if a name only has one token it is not
considered a match. At least 2 name tokens must exist in the Full Name.
Note: Word Match Count may return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching is order sensitive. |
LOTFI RIHANI
LOTFI BEN ABDUL HAMID BEN ALI RIHANI |
I012 | Original script name in any order | All names in the original script name fields match, regardless of order. | Καρλος Μολινα
Μολινα Καρλος |
I013 | Original script name with typos | Original script name fields match with an 80%+ Character Match Percentage score. | Καρλος Μολινα
Καρλος Μολιννα |
I014 | All names in any order | All names in the full name match (using a Word Edit Distance of 0) after name token standardization, in any order. A single typo (1 character edit) is allowed in each name token. | ABDUL JABBER OMARI
OMARI ABDUL JABBER |
I015 | Abbreviated given name | Given names match using a Starts With comparison. Family name is a close metaphone match. | CHRIS
HUNT CHRISTOPHER HUNTER |
I016 | Abbreviated given name and family name typos | Given names match using a Starts With comparison, after name standardization using Given Name Map. Family name matches with an edit difference of 1-2. At least one of the family name tokens, excluding initials must match by a 4-character Metaphone key. | IBRAHIM ABDUL SALAM
MOHAMED BOYASSEER IBRAHIM BOYASEER |
I017 | Abbreviated given name without titles and family name with typos | The first given name matches with a Starts With match, after name token standardization and stripping titles. Family name matches with an edit difference of 1-2. At least one of the family name tokens, excluding initials, must match by a 4-character Metaphone key. | SAHIR
BARHAN DR SAHIR MUSA BERHIN |
I018 | Original script name in any order with typos | All names in the original script name fields match, regardless of order, with each name requiring an 80%+ Character Match Percentage score. | Хасан Ченгић
Ченгић Хасcан |
I019 | First name and full name similar and sounds like | The full name matches with a Character Match Percentage of 80% or above, after name token standardization. At least one of the family name tokens, excluding initials, must match by a 4-character Metaphone key. | MOHAMMAD HUSAYN
MASTASAEED MOHAMMAD HASSAN MASTASAEED |
I020 | Given name similar and family names and sounds like | The given name matches with an Edit Distance of 1 or 2, after name standardization. The given name matches by 4-character Metaphone key, after name standardization. The family name matches with an Edit Distance of 1-2. The family name matches by 4-character Metaphone key. | AMER MOHAMMAD RASHEED
AL UBAIDI AMIR RASHID MOHAMMED AL UBEIDI |
I021 | Abbreviated given name and family name similar | The first given name matches with a Starts With match, after name token standardization. The family name matches with an Edit Distance of 1 or 2. The family name matches by 4-character Metaphone key. | VIKTOR ANATOLYEVICH
BOUT VICTOR BOOT |
I022 | Full Name no whitespace | Combination of Given name an Family name without spaces | CHRIS
HUNT CHRISTOPHER HUNTER |
I023 | Original script name additional names | All names in one original script name field must be fully contained within the other field, provided there are at least two names in each field. | Миленко Врачар
Миленко Иванович Врачар |
I024 | Additional names typo tolerant | All name tokens from the full name with fewest tokens must be present
in the other full name. A character error tolerance of 20% is allowed
(that is, one character edit every 5 characters). At least 2 name tokens
must match with the same matching logic. If a name contains only one
token it is not considered a match according to this rule.
Note: Word Match Count may return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching is order sensitive. |
ABDUL WAHED SHAFIQ
ABDUL WAHAD |
I025 | Full name contained and multiple names in common | The full name matches with a Contains match, after standardization of all name tokens using the Given Name Map. At least 2 name tokens must match in the full name. | ABU BAKAR
ABU BAKAR BA’ASYI |
I026 | Full name characters longer | The full name matches with a Longest Common Substring Sum Percentage of 90%+, relating to the longer string, and considering substrings of 5 characters or more in length, after name standardization. | MOHAMMED AL GHABRA
ALGHABRA MUHAMAD RAMATULLAH WAHIDYAR FAQIR MOHAMMAD WAHIDYAR RAMA TULLAH |
I027 | Original script name additional names with typos | All names in one original script name field must be fully contained within the other field, provided there are at least two names (all of which have an 80%+ Character Match Percentage) in each field. | ЮРИ НЕЁЛОВ
Юрий Васильевич Неёлов |
I028 | Abbreviated first name | The first given name matches with a Starts With match, after name token standardization. Family name matches exactly. | KHADAF
ABUBAKAR
JANJALANI KHADAFFI JANJALANI |
I029 | Additional names in any order | All name tokens from the full name with fewest tokens must be present
in the other full name. At least 2 name tokens must match with the same
matching logic. If a name contains only one token it is not considered a
match according to this rule.
Note: Word Match Count may return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching is order insensitive. |
HA THI NGUYEN
THI HA |
I030 | Additional names in any order typo tolerant | All name tokens from the full name with fewest tokens must be present
in the other full name. A character error tolerance of 20% is allowed
(that is, one character edit every 5 characters). At least 2 name tokens
must match with the same matching logic. If a name contains only one
token it is not considered a match according to this rule.
Note: Word Match Count may return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching is order insensitive. |
STEPHENS MARTIN
MARRTIN JOHN STEPHENS |