E.5.1 Fuzzy Name Matcher Utility

During BD Datamap processing, the Fuzzy Name Matcher utility is used to match names of individuals and corporations (candidates) against a list of names (targets).

The utility calculates a score that indicates how strongly the candidate name matches the target name. All matches are case-insensitive.

The Fuzzy Name Matcher engine supports matching on ASCII, extended ASCII, AND the first 128 encoded characters of the UTF-8 character set (which is equivalent to ASCII, as the same encoding). Any UTF-8 characters beyond this (such as Chinese, Arabic, and so on) will be ignored (will not cause the engine to crash, but such names will not match). Any encoding other than ASCII, extended ASCII, and UTF-8 will cause unpredictable behavior and likely cause the engine to crash (as they are not supported).

Using the Fuzzy Name Matcher Utility

The utility typically runs as part of automated processing that a job scheduling tool such as Maestro or Unicenter AutoSys manages. You can also execute the utility through a UNIX shell script, which the next section describes.

Executing the Fuzzy Name Matcher Utility

To execute the Fuzzy Name Matcher Utility manually, type the following at the UNIX command line:
fuzzy_match.sh –t <target_name_list> -c <candidate_name_list> -r <result_file>