Understanding the Sun Match Engine

Sun Match Engine Field Identifiers

The Sun Match Engine breaks down fields into various components. For example, it breaks addresses into floor number, street number, street name, street direction, and so on. Some of these components are similar and are typically stored in the same field in the database. In the default configuration, for example, when the standardization engine finds a house number, rural route number, or PO box number, the value is stored in the HouseNumber database field. You can customize this as needed, as long as any field you specify to store a component is also included in the object structure defined for the master index application.

The Sun Match Engine uses field identifiers to determine how to process fields that are defined for normalization or parsing. The IDs are defined internally in the match engine and are referenced in the Match Field file. The field IDs you specify for each field in the Match Field file determine how that field is processed by the standardization engine. The field IDs for person names determine how each name is normalized. The field IDs for business names specify which business type key file to use for standardization. The field IDs for addresses determine which database fields store each field component and how each component is standardized.

Table 3 lists each field component generated by the Sun Match Engine along with their corresponding field IDs. You can only specify the predefined field IDs that are listed in this table.

Table 3 Standardization Field Identifiers

Field ID 

Description 

Person Name Standardization Field Identifiers

FirstName

Specifies a first name field for normalization. 

LastName

Specifies a last name field for normalization. 

Address Standardization Field Identifiers

HouseNumber

Specifies the parsed house number from a standardized address field. By default, this is stored in the field_name_HouseNo field (or the HouseNumber field for Sun Master Patient Index).

RuralRouteIdentif

Specifies the parsed rural route identifier from a standardized address field. By default, this is stored in the field_name_HouseNo field (or the HouseNumber field for Sun Master Patient Index).

BoxIdentif

Specifies the parsed PO box number from a standardized address field. By default, this is stored in the field_name_HouseNo field (or the HouseNumber field for Sun Master Patient Index).

MatchStreetName

Specifies the parsed and standardized street name from a standardized address field and is used internally by the match engine. If you want to store the standardized street name in the database (recommended), map this field to the street name field in the database. By default, this is stored in the field_name_StName field (or the StreetName field for Sun Master Patient Index).

OrigStreetName

Specifies the parsed street name from an address field. If you want to store the original street name in the database, map this field to the street name field in the database. This address component is not included in the default standardization structure, but you can add it if needed. 

RuralRouteDescript

Specifies the parsed rural route description from a standardized address field. By default, this is stored in the field_name_StName field (or the StreetName field for Sun Master Patient Index).

BoxDescript

Specifies the PO box type from a standardized address field. By default, this is stored in the field_name_StName field (or the StreetName field for Sun Master Patient Index).

PropDesPrefDirection

Specifies the parsed property direction from a standardized address field. This field ID handles cases where the direction is a prefix to the property description. By default, this is stored in the field_name_StDir field (or the StreetDir field for Sun Master Patient Index).

PropDesSufDirection

Specifies the parsed property direction from a standardized address field. This field ID handles cases where the direction is a suffix to the property description. By default, this is stored in the field_name_StDir field (or the StreetDir field for Sun Master Patient Index).

StreetNamePrefDirection

Specifies the parsed street direction from a standardized address field. This field ID handles cases where the direction is a prefix to the street name. By default, this is stored in the field_name_StDir field (or the StreetDir field for Sun Master Patient Index).

StreetNameSufDirection

Specifies the parsed street direction from a standardized address field. This field ID handles cases where the direction is a suffix to the street name. By default, this is stored in the field_name_StDir field (or the StreetDir field for Sun Master Patient Index).

StreetNameSufType

Specifies the parsed street type from a standardized address field. This field ID handles cases where the street type is a suffix to the street name. By default, this is stored in the field_name_StType field (or the StreetType field for Sun Master Patient Index).

StreetNamePrefType

Specifies the parsed street type from a standardized address field. This field ID handles cases where the street type is a prefix to the street name. By default, this is stored in the field_name_StType field (or the StreetType field for Sun Master Patient Index).

PropDesSufType

Specifies the parsed property type from a standardized address field. This field ID handles cases where the street type is a suffix to the property description. By default, this is stored in the field_name_StType field (or the StreetType field for Sun Master Patient Index).

PropDesPrefType

Specifies the parsed property type from a standardized address field. This field ID handles cases where the street type is a prefix to the property description. By default, this is stored in the field_name_StType field (or the StreetType field for Sun Master Patient Index).

HouseNumPrefix

Specifies the parsed house number prefix from a standardized address field (such as the “A” in “A 1587 4th Street”). This address component is not included in the default standardization structure, but you can add it if needed. 

SecondHouseNumberPrefix

Specifies the parsed second house number prefix from a standardized address field (such as “25” in “25 319 10th Ave.”). This address component is not included in the default standardization structure, but you can add it if needed.

SecondHouseNumber

Specifies the parsed second house number prefix from a standardized address field. This address component is not included in the default standardization structure, but you can add it if needed.

HouseNumSuffix

Specifies the parsed house number suffix from a standardized address field. This address component is not included in the default standardization structure, but you can add it if needed. 

OrigSecondStreetName

Specifies the parsed second street name from a standardized address field (for example, an address might include a cross-street or a thoroughfare and dependent thoroughfare). This address component is not included in the default standardization structure, but you can add it if needed.

SecondStreetNameSufDirection

Specifies the parsed second street direction from a standardized address field. This address component is not included in the default standardization structure, but you can add it if needed.

SecondStreetNameSufType

Specifies the parsed second street type from a standardized address field. This address component is not included in the default standardization structure, but you can add it if needed.

StreetNameExtensionIndex

Specifies the parsed street name extension from a standardized address field. This address component is not included in the default standardization structure, but you can add it if needed. 

WithinStructDescript

Specifies the parsed internal descriptor (such as “Floor”) from a standardized address field. This address component is not included in the default standardization structure, but you can add it if needed. 

WithinStructIdentif

Specifies the parsed internal identifier (such as a floor number) from a standardized address field. This address component is not included in the default standardization structure, but you can add it if needed. 

OrigPropertyName

Specifies the parsed original property name (such as the name of a complex or business park) from a standardized address field. This address component is not included in the default standardization structure, but you can add it if needed. 

MatchPropertyName

Specifies the parsed match property name from a standardized address field and is used internally by the match engine for blocking and phonetic encoding. This address component is not included in the default standardization structure, but you can add it if needed. 

CenterDescript

Specifies the parsed structure description from a standardized address field. This address component is not included in the default standardization structure, but you can add it if needed. 

CenterIdentif

Specifies the parsed structure identifier from a standardized address field. This address component is not included in the default standardization structure, but you can add it if needed. 

ExtraInfo

Specifies any extra information that was not included in any of the other parsed components. This address component is not included in the default standardization structure, but you can add it if needed. 

Business Name Standardization Field Identifiers

PrimaryName

Specifies the field containing the parsed name in a free-form text business name field. 

OrgTypeKeyword

Specifies the field containing the parsed organization type in a free-form text business name field. 

AssocTypeKeyword

Specifies the field containing the parsed association type in a free-form text business name field. 

IndustrySectorList

Specifies the field containing the parsed industry sector in a free-form text business name field. 

IndustryTypeKeyword

Specifies the field containing the parsed industry type in a free-form text business name field (industry type is a subset of the sector). 

AliasList

Specifies the field containing the parsed alias in a free-form text business name field. 

Url

Specifies the field containing the parsed URL in a free-form text business name field.