Understanding the Sun Match Engine

The Address Master Clues File (addressMasterClues*.dat)

The address master clues file lists common terms in street addresses as defined by the United States Postal Service (USPS), the United Kingdom’s Royal Mail, the Australian Postal Corporation, or France’s La Poste (depending on the domain in use). For each common term, this file specifies a normalized value, defines postal information, and categorizes the terms into street address component types. A term can be categorized into multiple component types.

The syntax of this file is:

ID-number common-term normalized-term short-abbrev postal-abbrev CFCCS type-token usage-flag postal-flag

You can modify or add entries in this table as needed. Table 17 describes the columns in the addressMasterClues*.dat file.

Table 17 Address Master Clue File Columns

Column 

Description 

ID-number

A unique identification number for the address common term. This number corresponds to an ID number for the same term in the address clues file. 

common-term 

A common address term, such as Park, Village, North, Route, Centre, and so on. 

normalized-term 

The normalized version of the common term. 

short-abbrev 

A short abbreviation of the common term. 

postal-abbrev 

The standard postal abbreviation of the common term. 

CFCCS 

The census feature class code of the term (as defined in the Census Tiger® database). The following values are used:

  • A – Road

  • B – Railroad

  • C – Miscellaneous

  • D – Landmark

  • E – Physical feature

  • F – Nonvisible feature

  • H – Hydrography

  • X – Unclassified

type-token 

The type of address component represented by the common term. Types are specified by an address token (for more information, see Address Type Tokens).

usage-flag 

A flag indicating how the term is used (for more information, see Pattern Classes)

postal-flag 

The standard postal code for the term. 

Following is an excerpt from the addressMasterCluesUS.dat file.


11Alley                    Alley            Al         Aly A        TY R U
12Alternate Route          Alt Rte          Alt        Alt A        TY R
15Arcade                   Arcade           Arc        Arc A        TY R U
16Arroyo                   Arroyo           Arryo      ArryHA       TY R
17Autopista                Atpta            Apta       AptaA        TY R
18Avenida                  Avenida          Ava        Ava A        TY R
19Avenue                   Avenue           Ave        Ave A        TY R U
26Boulevard                Blvd             Blvd       BlvdA        TY R U
32Bulevar                  Blvr             Blv        Blv A        TY R
33Business Route           Bus Rte          BusRt      BsRtA        TY R
34Bypass                   Bypass           Byp        Byp A        TY R U
36Calle                    Calle            Calle      ClleA        TY R
37Calleja                  Calleja          Cja        Cja A        TY R
38Callejon                 Callej           Cjon       CjonA        TY R
39Camino                   Camino           Cam        Cam A        TY R
47Carretera                Carrt            Carr       CarrA        TY R
48Causeway                 Cswy             Cswy       CswyAH       TY R U
51Center                   Center           Ctr        Ctr DA       TY R U