Understanding the Sun Match Engine

Sun Match Engine Match and Standardization Types

Indicators are used in the Match Field file to reference the type of matching and standardization to perform on each field. You must specify one of these indicators, called match types and standardization types, for the fields you define for standardization or matching. The match types correspond to the match types listed in the first column of the match configuration file (matchConfigFile.cfg). The standardization types are defined internally in the match engine. The Sun Match Engine uses these types to determine how to process each field.

Table 4 lists the default standardization types; Table 5 lists the default match types. You can modify the match type names but not the standardization type names. For more information about match and standardization types, see Master Index Match Types and Field Names (Repository) in Understanding Sun Master Index Processing (Repository). Note that the match types you can specify in the Match Field file (listed in Table 5) are not the same values you specify for the Match Type field drop-down list in the wizard when you create the master index application.

Table 4 Standardization Types

This indicator ... 

processes this data type ... 

Address 

Free-form street address fields.

PersonName 

Pre-parsed name fields (including any first, middle, last, or alias names).

BusinessName 

Free-form business names.

The standardization types listed above correspond to the three categories of match types listed below. You can also specify miscellaneous match types, which do not correspond to any standardization types.

Table 5 Match Types

This indicator ... 

processes this data type ... 

Business Name Match Types

PrimaryName

The parsed name field of a business name. 

OrgTypeKeyword

The parsed organization type field of a business name. 

AssocTypeKeyword

The parsed association type field of a business name. 

AliasList

The parsed alias type field of a business name. 

IndustrySectorList

The parsed industry sector field of a business name. 

IndustryTypeKeyword

The parsed industry type field of a business name. 

Url

The parsed URL field of a business name. 

Address Match Types

StreetName

The parsed street name field of a street address. 

HouseNumber

The parsed house number field of a street address. 

StreetDir

The parsed street direction field of a street address. 

StreetType

The parsed street type field of a street address. 

Person Name Match Types

FirstName

A first name field, including middle name, alias first name, and alias middle name fields. 

LastName

A last name field, including alias last name fields. 

Date Match Types

DateDays

The day, month, and year of a date field. 

DateMonths

The month and year of a date field. 

DateHours

The hour, day, month, and year of a date field. 

DateMinutes

The minute, hour, day, month, and year of a date field. 

DateSeconds

The seconds, minute, hour, day, month, and year of a date field. 

Miscellaneous Match Types

String

A generic string field. 

Numeric

A numeric field. 

Integer

A field containing integers. 

Real

A field containing real numbers. 

SSN

A field containing a social security number. 

Char

A field containing a single character. 

pro

Any field on which you want the Sun Match Engine to use prorated weights. 

Exac

Any field you want the Sun Match Engine to match character for character.