Individual Name Matching Rules

The following are the individual name matching rules:

Table 5-15 Individual Name Matching Rules

Group Code Matching Rule Logic Summary Example Matching Data
I001 Exact name Full name match after name standardization using full name map. -
I002 Exact standardized Full name Given names and family name match exactly. JOSEPH TSANGA

JOSEPH T’SANGA

I003 Original script name exact The original script Name fields match exactly. АЛЕКСАНДР ОСОКИН

АЛЕКСАНДР ОСОКИН

I004 Standardized given name Given names match after name standardization using Given name map. Family name matches exactly. BILL JONES WILLIAM JONES
I005 Full name The full name matches exactly, after standardization of all name tokens using the Given Name Map. JOHN MIKE SMITH

JOHN MICHAEL SMITH

I006 Full name without titles The full name matches exactly, after standardization of all name tokens using the Given Name Map and removal of titles. DR DOUGLAS BAKER

DOUGLAS BAKER

I007 Abbreviated standardized given name Given names match using a Starts With comparison, after name standardization using the Given Name Map. Family name matches exactly. JOSEPH ABANDA TSANGA

JOSEPH T’SANGA

I008 Given name similar and sounds like Given name matches with an Edit Distance of 1 or 2 after name standardization. At least one of the given names, excluding initials, must match by a 4-character Metaphone key. Family name matches exactly JOSEPH ABANDA

JOESPH ABANDA

I009 First name similar and sounds like The first given name matches with an Edit Distance of 1 or 2 and with a Character Match Percentage of 66% or more, after given name standardization. At least one of the given names, excluding initials, must match by a 4-character Metaphone key. Family name matches exactly. AMER MOHAMMAD RASHEED

AL UBAIDI

AMIR RASHID MOHAMMED

AL UBAIDI

I010 Additional given names All name tokens from the given names field with fewest tokens must be present in the other given names field. Family name matches exactly

MOHAMMED

HANIF

DIN MOHAMED

HANIF

I011 Additional names All name tokens from the full name with fewest tokens must be present in the other full name. At least 2 name tokens must match with the same matching logic; that is, if a name only has one token it is not considered a match. At least 2 name tokens must exist in the Full Name.

Note: Word Match Count may return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching is order sensitive.

LOTFI RIHANI

LOTFI BEN ABDUL HAMID BEN ALI RIHANI

I012 Original script name in any order All names in the original script name fields match, regardless of order. Καρλος Μολινα

Μολινα Καρλος

I013 Original script name with typos Original script name fields match with an 80%+ Character Match Percentage score. Καρλος Μολινα

Καρλος Μολιννα

I014 All names in any order All names in the full name match (using a Word Edit Distance of 0) after name token standardization, in any order. A single typo (1 character edit) is allowed in each name token. ABDUL JABBER OMARI

OMARI ABDUL JABBER

I015 Abbreviated given name Given names match using a Starts With comparison. Family name is a close metaphone match. CHRIS

HUNT

CHRISTOPHER

HUNTER

I016 Abbreviated given name and family name typos Given names match using a Starts With comparison, after name standardization using Given Name Map. Family name matches with an edit difference of 1-2. At least one of the family name tokens, excluding initials must match by a 4-character Metaphone key. IBRAHIM ABDUL SALAM

MOHAMED BOYASSEER

IBRAHIM

BOYASEER

I017 Abbreviated given name without titles and family name with typos The first given name matches with a Starts With match, after name token standardization and stripping titles. Family name matches with an edit difference of 1-2. At least one of the family name tokens, excluding initials, must match by a 4-character Metaphone key. SAHIR

BARHAN

DR SAHIR MUSA

BERHIN

I018 Original script name in any order with typos All names in the original script name fields match, regardless of order, with each name requiring an 80%+ Character Match Percentage score. Хасан Ченгић

Ченгић Хасcан

I019 First name and full name similar and sounds like The full name matches with a Character Match Percentage of 80% or above, after name token standardization. At least one of the family name tokens, excluding initials, must match by a 4-character Metaphone key. MOHAMMAD HUSAYN

MASTASAEED

MOHAMMAD HASSAN

MASTASAEED

I020 Given name similar and family names and sounds like The given name matches with an Edit Distance of 1 or 2, after name standardization. The given name matches by 4-character Metaphone key, after name standardization. The family name matches with an Edit Distance of 1-2. The family name matches by 4-character Metaphone key. AMER MOHAMMAD RASHEED

AL UBAIDI

AMIR RASHID MOHAMMED

AL UBEIDI

I021 Abbreviated given name and family name similar The first given name matches with a Starts With match, after name token standardization. The family name matches with an Edit Distance of 1 or 2. The family name matches by 4-character Metaphone key. VIKTOR ANATOLYEVICH

BOUT

VICTOR

BOOT

I022 Full Name no whitespace Combination of Given name an Family name without spaces CHRIS

HUNT

CHRISTOPHER

HUNTER

I023 Original script name additional names All names in one original script name field must be fully contained within the other field, provided there are at least two names in each field. Миленко Врачар

Миленко Иванович Врачар

I024 Additional names typo tolerant All name tokens from the full name with fewest tokens must be present in the other full name. A character error tolerance of 20% is allowed (that is, one character edit every 5 characters). At least 2 name tokens must match with the same matching logic. If a name contains only one token it is not considered a match according to this rule.

Note: Word Match Count may return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching is order sensitive.

ABDUL WAHED SHAFIQ

ABDUL WAHAD

I025 Full name contained and multiple names in common The full name matches with a Contains match, after standardization of all name tokens using the Given Name Map. At least 2 name tokens must match in the full name. ABU BAKAR

ABU BAKAR BA’ASYI

I026 Full name characters longer The full name matches with a Longest Common Substring Sum Percentage of 90%+, relating to the longer string, and considering substrings of 5 characters or more in length, after name standardization. MOHAMMED AL GHABRA

ALGHABRA MUHAMAD

RAMATULLAH WAHIDYAR FAQIR MOHAMMAD

WAHIDYAR RAMA TULLAH

I027 Original script name additional names with typos All names in one original script name field must be fully contained within the other field, provided there are at least two names (all of which have an 80%+ Character Match Percentage) in each field. ЮРИ НЕЁЛОВ

Юрий Васильевич Неёлов

I028 Abbreviated first name The first given name matches with a Starts With match, after name token standardization. Family name matches exactly. KHADAF ABUBAKAR

JANJALANI

KHADAFFI

JANJALANI

I029 Additional names in any order All name tokens from the full name with fewest tokens must be present in the other full name. At least 2 name tokens must match with the same matching logic. If a name contains only one token it is not considered a match according to this rule.

Note: Word Match Count may return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching is order insensitive.

HA THI NGUYEN

THI HA

I030 Additional names in any order typo tolerant All name tokens from the full name with fewest tokens must be present in the other full name. A character error tolerance of 20% is allowed (that is, one character edit every 5 characters). At least 2 name tokens must match with the same matching logic. If a name contains only one token it is not considered a match according to this rule.

Note: Word Match Count may return >1 if a single name matches twice in a longer name string. For example, ‘ABDUL’ matches ‘ABDUL ABDUL’ with a Word Match Count of 2. Matching is order insensitive.

STEPHENS MARTIN

MARRTIN JOHN STEPHENS