6.3.2 Individual Name Matching Rules
Table 6-17 Individual Name Matching Rules
| Group Code | Matching Rule | Logic Summary | Example Matching Data | Example Matching Data |
|---|---|---|---|---|
| I001 | Exact name | Full name match after name standardization using full name map | - | - |
| I002 | Exact standardized Full name | Given names and family name match exactly. | - | - |
| JOSEPH JOSEPH | TSANGA T’SANGA | |||
| I003 | Original script name exact | The original script Name fields match exactly. | - | - |
| АЛЕКСАНДР ОСОКИН | АЛЕКСАНДР ОСОКИН | |||
| I004 | Standardized given name | Given names match aftername standardization using Given name map. Family name matches exactly. | - | - |
| BILL | JONES | |||
| WILLIAM | JONES | |||
| I005 | Full name | The full name matches exactly, after standardization of all name tokens using the Given Name Map. | - | |
| JOHN MIKE SMITH | ||||
| JOHN MICHAEL SMITH | ||||
| I006 | Full name without titles | The full name matches exactly, after standardization of all name tokens using the Given Name Map and removal of titles. | - | |
| DR DOUGLAS BAKER | ||||
| DOUGLAS BAKER | ||||
| I007 | Abbreviated standardized given name |
Given names match using a Starts With comparison, after name standardization using the Given Name Map. Family name matches exactly. |
- | - |
| JOSEPH ABANDA | TSANGA | |||
| JOSEPH | T’SANGA | |||
| I008 | Given name similar and sounds like | Given name matches with an Edit Distance of 1 or 2 after name standardization. At least one of the given names, excluding initials, must match by a 4-character Metaphone key. Family name matches exactly | - | - |
| JOSEPH | ABANDA | |||
| JOESPH | ABANDA | |||
| I009 | First name similar and soundslike | The first given name matches with an Edit Distance of 1 or 2 and with a Character Match Percentage of 66% or more, after given name standardization. At least one of the given names, excluding initials, must match by a 4-character Metaphone key. Family name matches exactly. | - | - |
| AMER MOHAMMAD RASHEED | AL UBAIDI | |||
| AMIR RASHID MOHAMMED | AL UBAIDI | |||
| I010 | Additional given names |
All name tokens from the given names field with fewest tokens must be present in the other given names field. Family name matches exactly . |
- | - |
| MOHAMMED | HANIF | |||
| DIN MOHAMED | HANIF | |||
| I011 | Additional names |
All name tokens from the full name with fewest tokens must be present in the other full name. At least 2 name tokens must match with the same matching logic; that is, if a name only has one token it is not considered a match. At least 2 name tokens must exist in the Full Name. Word Match Count may return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching is order sensitive. |
- | |
| LOTFI RIHANI | ||||
| LOTFI BEN ABDUL HAMID BEN ALI RIHANI | ||||
| I012 | Original script name in any order | All names in the original script name fields match, regardless of order. | - | - |
| Καρλος Μολινα | Μολινα Καρλος | |||
| I013 | Original script name with typos | Original script name fields match with an 80%+ Character Match Percentage score. | - | - |
| Καρλος Μολινα | Καρλος Μολιννα | |||
| I014 | All names in any order |
All names in the full name match (using a Word Edit Distance of 0) after name token standardization, in any order. A single typo (1 character edit) is allowed in each name token. |
- | |
| ABDUL JABBER OMARI | ||||
| OMARI ABDUL JABBER | ||||
| I015 | Abbreviated given name | Given names match using a Starts With comparison. Family name is a close metaphone match. | - | - |
| CHRIS | HUNT | |||
| CHRISTOPHER | HUNTER | |||
| I016 | Abbreviated given name and family name typos |
Given names match using a Starts With comparison, after name standardization using Given Name Map. Family name matches with an edit difference of 1-2. At least one of the family name tokens, excluding initials must match by a 4-character Metaphone key. |
- | - |
| IBRAHIM ABDUL SALAM | MOHAMED BOYASSEER | |||
| IBRAHIM | BOYASEER | |||
| I017 | Abbreviated given name without titles and family name with typos | The first given name matches with a Starts With match, after name token standardization and stripping titles. Family name matches with an edit difference of 1-2. At least one of the family name tokens, excluding initials, must match by a 4- character Metaphone key. | - | - |
| SAHIR | BARHAN | |||
| DR SAHIR MUSA | BERHIN | |||
| I018 | Original script name in any order with typos | All names in the original script name fields match, regardless of order, with each name requiring an 80%+ Character Match Percentage score. | - | - |
| Хасан Ченгић | Ченгић Хасcан | |||
| I019 | First name and full name similar and sounds like | The full name matches with a Character Match Percentage of 80% or above, after name token standardization. At least one of the family name tokens, excluding initials, must match by a 4- character Metaphone key. | - | - |
| MOHAMMAD HUSAYN | MASTASAEED | |||
| MOHAMMAD HASSAN | MASTASAEED | |||
| I020 | Given name similar and family names and sounds like | The given name matches with an Edit Distance of 1 or 2, after name standardization. The given name matches by 4- character Metaphone key, after name standardization. The family name matches with an Edit Distance of 1-2. The family name matches by 4-character Metaphone key. | - | - |
| AMER MOHAMMAD RASHEED | AL UBAIDI | |||
| AMIR RASHID MOHAMMED | AL UBEIDI | |||
| I021 | Abbreviated given name and family name similar | The first given name matches with a Starts With match, after name token standardization. The family name matches with an Edit Distance of 1 or 2. The family name matches by 4-character Metaphone key. | - | - |
| VIKTOR ANATOLYEVICH | BOUT | |||
| VICTOR | BOOT | |||
| I022 | Full Name no whitespace | Combination of Given name an Family name without spaces | CHRIS CHRISTOPHER | HUNT HUNTER |
| I023 | Original script name additional names | All names in one original script name field must be fully contained within the other field, provided there are at least two names in each field. | - | - |
| Миленко Врачар | Миленко Иванович Врачар | |||
| I024 | Additional names typo tolerant |
All name tokens from the full name with fewest tokens must be present in the other full name.A character error tolerance of 20% is allowed (that is, one character edit every 5 characters). At least 2 name tokens must match with the same matching logic. If a name contains only one token it is not considered a match according to this rule. NOTE: Word Match Countmay return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching is order sensitive. |
- | - |
| ABDUL WAHED SHAFIQ | - | |||
| ABDUL WAHAD | - | |||
| I025 | Full name contained and multiple names in common | The full name matches with a Contains match, after standardization of all name tokens using the Given Name Map. At least 2 name tokens must match in the full name. | - | - - |
| ABU BAKAR | - | |||
| ABU BAKAR BA’ASYI | - | |||
| I026 | Full name | The full name | - | - |
| - | characters | matches with a | ||
| MOHAMMED AL GHABRA | - | |||
| - | longer | Longest | ||
| - | - | Common | ALGHABRA MUHAMAD | - |
| - | - | Substring Sum Percentage of 90%+, relating to | ||
| RAMATULLAH WAHIDYAR FAQIR MOHAMMAD | - | |||
| - | - | the longer string, and considering | ||
| WAHIDYAR RAMA TULLAH | - | |||
| - | - | substrings of 5 | - | - |
| - | - | characters or | - | - |
| - | - | more in length, | - | - |
| - | - | after name | - | - |
| - | - | standardization. | - | - |
| I027 | Original script | All names in one | - | - |
| - | name | original script | ||
| - | additional names with typos | name field must be fully contained within the other field, provided | ||
| ЮРИ НЕЁЛОВ | Юрий Васильевич Неёлов | |||
| - | - | there are at least | - | - |
| - | - | two names (all of | - | - |
| - | - | which have an | - | - |
| - | - | 80%+ Character | - | - |
| - | - | Match | - | - |
| - | - | Percentage) in | - | - |
| - | - | each field. | - | - |
| I028 | Abbreviated | The first given | - | - |
| - | first name | name matches with a Starts With match, after | ||
| KHADAF ABUBAKAR | JANJALANI | |||
| - | - | - | - | |
|
name token standardization. |
KHADAFFI | JANJALANI | ||
| - | - | Family name | - | - |
| - | - | matches exactly. | - | - |
| I029 | Additional names in any order |
All name tokens from the full name withfewest tokens must be present in the other full name. At least 2 name tokens must match with the same matching logic. If a name contains only one token it is not considered a match according to this rule. NOTE: Word Match Countmay return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching isorder insensitive. |
- | - |
| HA THI NGUYEN | - | |||
| THI HA | - | |||
| I030 | Additional names in any order typo tolerant |
All name tokens from the full name with fewest tokens must be present in the other full name.A character error tolerance of 20% is allowed (that is, one character edit every 5 characters). At least 2 name tokens must match with the same matching logic. If a name contains only one token it is not considered a match according to this rule. NOTE: Word Match Countmay return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching isorder insensitive. |
- | - |
| STEPHENS MARTIN | - | |||
| MARRTIN JOHN STEPHENS | - | |||