3.3.1 Entity Matching Rules
Note:
Wherever the term 'word' is used below, this means that there is a space-delimited token in the prepared names.Table 3-9 Entity Matching Rules
Group Code | Name Matching Rule | Summary of Rule Logic | Example Matching Data |
---|---|---|---|
V010 | Vessel part standardized name exact | The part-standardized entity name matches the name of a listed vessel exactly. | DYNASTY |
- | - | - | DYNASTY |
V020 | Vessel name exact | The entity name matches the name of a listed vessel after number cardinal and ordinal standardization. | 4THOCEAN |
- | - | - | FOURTHOCEAN |
V030 | Vessel part standardized name with typos | The part-standardized entity name matches the name of a listed vessel with a Character Match Percentage of 80-99%. | RAHIM |
- | - | - | RAHIM3 |
V040 | Vessel name with typos | The entity names match with a Character Match Percentage of 80-99% after number cardinal and ordinal standardization. | RAHUM3 |
- | - | - | TRAHIMTHREE |
E010 | Part-standardized name exact | The part-standardized entity name matches a listed entity name exactly. |
HUMANAPPEAL INTERNATIONAL |
- | - | - | HUMANAPPEAL INTERNATIONAL |
E020 | Name exact | The entity names match exactly after number cardinal and ordinal standardization. | NOVEMBER17 |
- | - | - | NOVEMBER SEVENTEEN |
E030 | Original script name exact | The original script names match exactly. | НИАЭПОАО |
- | - | - | НИАЭПОАО |
E040 | Name without suffixes exact | The entity names match exactly after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. | CAPITAL DIRECT LTD |
- | - | - | CAPITAL DIRECT AG |
E050 | Name without business words similar and sounds like | The entity names match with a Word Match Percentage of 80% after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. The first word of each name has the same 4-character Metaphone key. | PARAGON UK |
- | - | - |
PARAGON INVESTMENT CORPORATION |
E060 | Name without business words exact | The entity names match exactly after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. |
LIFEHEALTHCARE GROUPHOLDINGS LTD |
- | - | - | LIFE HEALTH CARE INC |
E070 | Name without business words has all words out- of order | All remaining words in each entity name match exactly, but in any order, after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. | EDUCATION FOR HEALTH |
- | - | - | HEALTH EDUCATION SERVICES |
E080 | Name without suffixes 'Starts With' and multiple names in common | The entity names are a 'Starts With' match after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. There are at least two significant words (not common business words) in common between the two names. The listed name is not an acronym alias of a longer primary entity name. | BAE SYSTEMS (LANCASTER HOUSE) LIMITED |
- | - | - | BAESYSTEMS PLC |
E090 | Name without business words has all words with typos | All remaining words in each entity name match with a Character Match Percentage of 80 or more, after number cardinal and or dinalstandardization, and after common company prefixes, suffixes and other words are removed. | GERBERA ASSOCIATES LTD |
- | - | - | BERBERA |
E100 | Original script name in any order | All words in the Original Script Names match exactly, in any order. | ОАОНИАЭП |
- | - | - | НИАЭПОАО |
E110 | Original script name with typos | The Original Script Names match with a Character Match Percentage of 80% or more. | Επαναστατική Αριστερά |
- | - | - | Επανασταική Αριστερά |
E120 | Name without business words with typos, and sounds like | The entity names match with a Character Match Percentage of 80 ore more after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. The first word of each name has the same 4-character Metaphone key and the first three letters of each name are the same. | GOLDSTREAM PROPERTIESLTD |
- | - | - | GOLDSTEIN PROPERTIESINC |
E130 | Name without suffixes contains,similar and multiple names in common | The entity names are a 'Contains' match and the Word Edit Distance is no more than one between the names (where each word matches with a Character Match Percentage of 80 or more), after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. There are at least two significant words (not common business words) in common between the two names. | HAMPSHIRE HERITAGE DEVELOPMENTS LTD |
- | - | - | HERITAGE DEVELOPMENT CORPORATION |
E140 | Name has additional words, sounds like and multiple names in common | All words in the shorter entity name exist in the longer entity name (in order) after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. There are at least two significant words (not common business words) in common between the two names. The list name is not an acronym alias of a longer primary entity name. | MOSCOWCITY CENTER PLC |
- | - | - | MOSCOWCENTER |
E150 | Name without business words contains, sounds like and multiple names in common | The entity name is a 'Contains' match with a listed entity name, after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. There are at least two significant words (not common business words) in common between the two names. The first word of each name has the same 4-character Metaphone key. | HI-TECH RECRUITMENTLTD |
- | - | - | HITECH GROUP |
E160 | Original script name in any order with typos | All words in the original script name match with a Character Match Percentage of 80 or more, in any order. | ΜαύροςΣεπτέµβρης |
- | - | - | Σεπτέµβρης Μαύροςς |
E170 | Name without business words has most words out-oforder Name Matching Rule | The entity names match (in any order) with a Word Match Percentage of between 75 and 99, after number cardinaland ordinal standardization, and after common company prefixes, suffixes and other words are removed. The list name is not an acronym alias of a longer primary entity name. Summary of Rule Logic | BACKTO HEALTH CLINICSLIMITED |
- | - | - | BACKTO HEALTH CHIROPRACTIC |
- | - | - | Example Matching Data |
E180 | Name without business words, similar, sounds like, with multiple names and a residual token in common | All words in the shorter entity name exist in the longer entity name (in order) after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. There are at least two significant words (not common business words) in common between the two names, and at least one of these is not a word in the English dictionary or a very common word in Watchlist name data. The list name is not an acronym alias of a longer primary entity name. | CHARLESASH UK LTD |
- | - | - | CHARLES F ASH CONSTRUCTIONCO INC |
E190 | Name without business words, similar with typos, sounds like, with multiple names and residual token in common. |
All words in the shorter entity name match with a Character Match Percentage of 80 or more in the longer entity name (in order) after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. There are at least two significant words (not common business words) that match with a Character Match Percentage of 80 or more, and at least one of these is not a word in the English dictionary or a very common word in Watchlist name data. The list name is not an acronym alias of a longer primary entity name. The group name differs from the rule name. |
CLARKSHOME BAKERY LTD |
- | - | - | CLARKHOMES INC |
E200 | Name without business words, similar, sounds like,and residual token in common | All words in the shorter entity name match in the longer entity name (in order) after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. The names match with a Word Match Percentage of 50 or more when common business words are not stripped. There are at least two significant words (not common business words) that match. The first word of each name has the same 4- character Metaphone key. The list name is not an acronym alias of a longer primary entity name. | AMERICAN MILITARY SUPPLY |
- | - | - | AMERICANSUPPLY CO |
E210 | Name has additional words tolerant, sounds like and multiple names in common |
All words in the shorter entity name match in the longer entity name (in order) with a Character Match Percentage of 80 or more after number cardinal and ordinal standardization. There are at least two significant words (not common business words) in common between the two names. The list name is not an acronym alias of a longer primary entity name. |
GENERALATOMICS |
- | - | - | GENERAL BUREAU OF ATOMIC ENERGY GBAE |
E220 | Name without suffixes contains, similar and residual token in common | The entity names are a 'Contains' match and the Word Edit Distance is no more than one between the names (where each word matches with a Character Match Percentage of 80 or more), after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. There is at least one significant word in common (not a common business word, a word in the English dictionary or a very common word in Watchlist name data). | ACCLAIMACM LTD |
- | - | - | ACM |
E230 | Name without suffixes 'Starts With' and residual token in common | The entity names are a 'Starts With' match after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. There is at least one significant word in common (not a common business word, a word in the English dictionary or a very common word in Watchlist name data). The listed name is not an acronym alias of a longer primary entity name. |
ENRONMETALS BROKERSLTD |
- | - | - | ENRONCORP |
E240 | Name without suffixes 'Starts With' and substring in common | The entity names are a 'Starts With' match, and there is a common substring at least 8 characters in length, after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. The listed name is not an acronym alias of a longer primary entity name. | ACCURATE SECTION BENDERSLTD |
- | - | - | ACCURATE |
E250 | Name without suffixes contains, residual token in common and significant overlap | The entity names are a 'Contains' match and the Word Match Percentage is 50 or more, after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. There is at least one significant word in common (not a common business word, a word in the English dictionary or a very common word in Watchlist name data). | NONEMERGENCY TRANSPORTINC |
- | - | - | ACTION NON EMERGENCY TRANSPORTATION |
E260 | Name without common tokens exact, and multiple residual tokens in common | The entity names match exactly, with at least two words matching, after number cardinal and ordinal standardization, and after common company prefixes, suffixes, and other words, and all English dictionary and common Watchlist name words are removed. | LIFECARE CENTER PUNTA GORDA |
- | - | - | PORTOF PUNTA GORDA |
E270 | Original script name has additional names | All words in the shorter original script name match in the longer original script name (in order), and there are at least two matching words. | Въоръжена ислямскагрупа |
- | - | - | Въоръжена група |
E280 | Name without suffixes contains, multiple names in common and significant overlap | The entity names are a 'Contains' match and the Word Match Percentage is 50 or more, after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. There is at least two significant words (not common business words) that match with a Character Match Percentage of 80 or more. | CITYTRANS LTD |
- | - | - | CAPITAL CITY TRANS SERVINC |
E290 | Name without business words similar and full name sounds like | The entity names match with a Character Match Percentage of between 80 and 99 after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. The names share the same metaphone key after number cardinal and ordinal standardization. | IBERIAAIRLINES |
- | - | - | IBERAIRLINES |
E300 | Name without business words similar with typos, sounds like and significant overlap | All words in the shorter entity name match with a Character Match Percentage of 80 or more in the longer entity name (in order) after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. The names match with a Word Match Percentage of 50 or more when common business words are not stripped. There are at least two significant words (not common business words) that match with a Character Match Percentage of 80 or more. The first word of each name has the same 4character Metaphone key. The list name is not an acronym alias of a longer primary entity name. | MEDCLINIC LTD |
- | - | - | MED AMERICA CLINICSINC |
E310 | Name has additional words, sounds like and residual token in common | All words in the shorter entity name exist in the
longer entity name (in order) after number cardinal and ordinal
standardization.
There is at least one significant word (not a common business word, an English dictionary word or a word or a common Watchlist name word) in common between the two names. The list name is not an acronym alias of a longer primary entity name. |
DJCASE AND ASSOCIATES INC |
- | - | - | DJAND ASSOCIATES INC |
E320 |
Name has additional words with typos, sounds like and residual token in common |
All words in the shorter entity name match with a Character Match Percentage of 80 or more in the longer entity name (in order) after number cardinal and ordinal standardization. There is at least one significant word (not a common business word, an English dictionary word or a word or a common Watchlist name word) that matches with a Character Match Percentage of 80 or more. The list name is not an acronym alias of a longer primary entity name. |
GARLOCK |
- | - | - | GARLICK HELICOPTERS INC |
E330 | Name has additional words, sounds like and substring in common |
All words in the shorter entity name exist in the longer entity name (in order) after number cardinal and ordinal standardization. There is a common substring of at least 8 characters in length between the two names after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. The list name is not an acronym alias of a longer primary entity name. |
NATIONWIDE SECRETARIAL SERVICESLTD |
- | - | - | NATIONWIDE SERVICES |
E340 | Name without business words, similar, sounds like and multiple names in common | All words in the shorter entity name match in the longer entity name (in order) after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. There are at least two significant words (not common business words) that match. The first word of each name has the same 4- character Metaphone key. The list name is not an acronym alias of a longer primary entity name. | CENTRAL OKLAHOMA FAMILY MEDICALCENTER |
- | - | - | CENTRALMEDICAL INC |
E350 |
Name without business words, similar with typos, sounds like and multiple names in common |
All words in the shorter entity name match with a Character Match Percentage of 80 or more in the longer entity name (in order) after number cardinal and ordinal standardization, and after common company prefixes, suffixes and other words are removed. There are at least two significant words (not common business words) that match with a Character Match Percentage of 80 or more. The first word of each name has the same 4- character Metaphone key. The list name is not an acronym alias of a longer primary entity name. | BLACKCHAIR LTD |
- | - | - | BLACK WORLD COLLEGEOF HAIR DESIGN |
E360 | Name without business words has typos and sounds like | The entity names match with a Character Match Percentage of between 80 and 99 after number cardinaland ordinal standardization, and after common company prefixes, suffixes and other words are removed. The first word of each name has the same 4-character Metaphone key. | BOURNE CHIROPRACTIC LTD |
- | - | - | BARNO CHIROPRACTIC |
E370 | Name without suffixes contains with typos and multiple names in common | The entity names are a ''Contains'' match where each word matches with a Character Match Percentage of 80 or more after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. There are at least two significant words (not common business words) that match. | NEWORLEANS |
- | - | - | MEDICABOF METRO NEW ORLEANS |
E380 | Name without suffixes contains, similar,and multiple words in common | The entity names are a 'Contains' match and the Word Edit Distance is no more than one between the names (where each word matches with a Character Match Percentage of 80 or more), after number cardinal and ordinal standardization, and after common company prefixes and suffixes are removed. There are at least two significant words (not common business words) that match with a Character Match Percentage of 80 or more. | GROSVENOR NURSINGSERVICES |
- | - | - | NURSINGSERVICES INC |
E390 | Original script name has additional names with typos | All words in the shorter original script name match in the longer original script name (in order) with a Character Match Percentage of 80 or more, and there are at least two matching words. | Арабски революционни бригади |
- | - | - | Арабски революциони |
E400 | Name has additional words and sounds like | All words in the shorter entity name exist in the longer entity name (in order) after number cardinal and ordinal standardization. | ATRIUM INCORPORATORS WORLDWIDELTD |
- | - | - | ATRIUM |
E410 | Name has additional words with typos and sounds like |
All words in the shorter entity name match in the longer entity name (in order) with a Character Match Percentage of 80 or more after number cardinal and ordinal standardization. The first word of each name has the same 4-character Metaphone key. |
BRILLIANT GENERAL BUILDING CONTRACTOR LTD |
- | - | - | BRILLIANCE |
E420 | Name without business words loose match and full name sounds like | The entity names match with a Character Match Percentage of between 60 and 79 after number cardinaland ordinal standardization, and after common company prefixes, suffixes and other words are removed. The names have the same Metaphone key. | BRC |
- | - | - | PRC |