The first name category file defines standardized versions of first names and assigns a gender classification for each name. This file is used to standardize first names when comparing person names. The gender classification helps to further clarify the match. The Sun Match Engine uses this file when a first name field is defined for normalization or standardization in the Match Field file.
The syntax of this file is:
original-value standardized-form gender-class
You can modify or add entries in this table as needed. Table 10 describes the columns in the personFirstName*.dat file.
Table 10 First Name Category File
Following is an excerpt from the personFirstNameUS.dat file. Certain rows contain a zero (0) for the standardized form, indicating that the name is already standard (for example, Stephen, Sterling, and Summer).
STEPHEN 0 M STEPHENIE STEPHANIE F STEPHIE STEPHANIE F STEPHINE STEPHANIE F STEPHNIE STEPHANIE F STERLING 0 M STEVE STEPHEN M STEVEN STEPHEN M STEVIE STEPHEN N STEW STUART M STEWART STUART M STU STUART M STUART 0 M SU SUSAN F SUE SUSAN F SUHANTO 0 M SULLIVAN 0 F SULLY SULLIVAN F SUMMER 0 F |