Understanding the Master Index Standardization Engine

Output Symbol Definitions

The output symbol definitions name each output symbol that can be produced by the defined states. This section can define additional processing for output symbols using the rules described in Standardization Processing Rules Reference. Each output symbol defined in the state model definitions must match a value defined here. Below is an excerpt from the output symbol definitions for PersonName processing.


<outputSymbols>
   <outputSymbol name="salutation"/>
   <outputSymbol name="firstName"/>
   <outputSymbol name="middleName"/>
   <outputSymbol name="nickname"/>
   <outputSymbol name="lastName"/>
   <outputSymbol name="generation"/>
   <outputSymbol name="title"/>
   <outputSymbol name="conjunction"/>
</outputSymbols>

The following table lists and describes the XML elements and attributes for the output symbol definitions.

Element 

Attribute 

Description 

outputSymbols 

 

A list of output symbols for each processing state. 

outputSymbol 

 

A definition for one output symbol. 

 

name 

The name of the output symbol 

occurrenceConcatenator 

 

An optional class to specify the character that separates contiguous occurrences of the same output symbol. For example, this is used in the PhoneNumber data type to concatenate phone number components that are separated by dashes. Components are concatenated using blanks. 

 

class 

The name of the occurrence concatenator class. One concatenator class is predefined.  

property 

 

A parameter for the occurrence concatenator class. For the default class, the parameter specifies a separator character. 

 

name 

The name of the parameter. For the default class, the name is “separator”. 

 

value 

The parameter value. 

tokenConcatenator 

 

An optional class to specify the character that separates non-contiguous occurrences of the same output symbol. For example, this is used in the PhoneNumber data type to concatenate phone number components. 

 

class 

The name of the token concatenator class. one concatenator class is predefined. 

property 

 

A parameter for the token concatenator class. For the default class, the parameter specifies a separator character. 

 

name 

The name of the parameter. For the default class, the name is “separator”. 

 

value 

The value of the parameter.