Understanding the Sun Match Engine

The Person Constants File (personConstants*.cfg)

The person constants file defines certain information about the standardization files used for processing person data, primarily the number of lines contained in each file. The number of lines specified here must be equal to or greater than the number of lines actually contained in each file. The constants file for United States data is in the Standardization node of the project and is named personConstants.cfg; the person constants file for the other domains is located under the domain name node.

Table 9 lists and describes each parameter in the constants file. The files referenced by these parameters are described on the following pages.

Table 9 Person Constants File Parameters

Parameter 

Description 

words

The maximum number of words in a given free-form text field containing a person name. This parameter is not currently used. 

conjmax

The maximum number of lines in the person conjunction reference file (personConjon*.dat).

jrsrmax

The maximum number of lines in the generational suffix category file (personGenSuffix*.dat).

nickmax

The maximum number of lines in the first name category file (personFirstName*.dat).

lastmax

The maximum number of lines in the last name category file (personLastName*.dat).

premax

The maximum number of lines in the last name prefix category file (personLastNamePrefix*.dat).

titlmax

The maximum number of lines in the title category file (personTitle*.dat).

sufmax

The maximum number of lines in the occupational suffix category file (personOccupSuffix*.dat).

skpmax

The maximum number of lines in the business name reference file (businessOrRelated*.dat).

ptrnmax1

The maximum number of lines in the person patterns file (personNamePatt.dat).

twomax

The maximum number of lines in the two-character reference file for occupational suffixes (personTwo*.dat).

thremax

The maximum number of lines in the three-character reference file for occupational suffixes (personThree*.dat).

blnkmax

The maximum number of lines in the special characters reference file (personRemoveSpecChars.dat).

dashSize

The maximum number of lines in the hyphenated name category file (personFirstNameDash.dat).