Documentation Home
> Understanding the Sun Match Engine
Understanding the Sun Match Engine
Book Information
Understanding the Sun Match Engine
Related Topics
About the Sun Match Engine
Sun Match Engine Overview
About the Sun Match Engine Matching Algorithm
Sun Match Engine Standardization and Matching Process
Sun Match Engine Data Types
How the Sun Match Engine Works
Sun Match Engine Matching Weight Formulation
Matching and Unmatching Probabilities
Agreement and Disagreement Weight Ranges
Sun Match Engine Standardization Configuration
Sun Match Engine Standardization File Types
Sun Match Engine Internationalization
Sun Match Engine Matching Configuration
The Sun Match Engine Match Configuration File
Sun Match Engine Match Configuration File Format
Match Configuration File Sample
Probability Type
Matching Rules
Sun Match Engine Matching Comparison Functions
The Match Constants File
Sun Match Engine and the Sun Match Engine
Master Index Components and the Sun Match Engine
Searching and Matching in Sun Match Engine Applications (Repository)
Standardization and Matching Process in Master Index Applications (Repository)
The Master Index Match String (Repository)
Sun Match Engine Field Identifiers
Sun Match Engine Match and Standardization Types
Sun Match Engine Configuration File Modifications
Configuring the Master Index Matching Service (Repository)
Master Index Standardization Configuration (Repository)
Normalization Structures
Standardization Structures (Parsing and Normalization)
Phonetic Encoding Structures
Master Index Match String Configuration (Repository)
Match and Standardization Engine Configuration
Master Index Phonetic Encoder Configuration (Repository)
Sun Match Engine Person Data Type Configuration
Sun Match Engine Person Matching Overview
Sun Match Engine Person Data Processing Fields
Person Data Match String Fields
Person Data Standardized Fields
Person Data Object Structure
Sun Match Engine Match Configuration for Person Data
Sun Match Engine Person Data Standardization Files
Sun Match Engine Common Standardization Files for Person Data
The Hyphenated Name Category File (personFirstNameDash.dat)
The Person Name Patterns File (personNamePatt.dat)
The Special Characters Reference File (personRemoveSpecChars.dat)
Sun Match Engine Domain-Specific Standardization Files for Person Data
The Conjunction Reference File (personConjon*.dat)
The Person Constants File (personConstants*.cfg)
The First Name Category File (personFirstName*.dat)
The Generational Suffix Category File (personGenSuffix*.dat)
Last Name Prefix Category File (personLastNamePrefix*.dat)
The Last Name Category File (personLastName*.dat)
The Occupational Suffix Category File (personOccupSuffix*.dat)
The Three-Character Suffix File (personThree*.dat)
The Title Category File (personTitle*.dat)
The Two-Character Suffix File (personTwo*.dat)
The Business-Related Category File (businessOrRelated*.dat)
Configuring the Sun Match Engine Standardization Files for Person Data
Configuring the Master Index Matching Service for Person Data (Repository)
Configuring the Standardization Structure for Person Data (Repository)
Person Data Normalization Structures
Person Data Phonetic Encoding
Configuring the Match String for Person Data (Repository)
Sun Match Engine Address Data Type Configuration
Sun Match Engine Address Matching Overview
Sun Match Engine Address Data Processing Fields
Address Data Match String Fields
Address Data Standardized Fields
Address Data Object Structure
Match Configuration for Address Data (Repository)
Sun Match Engine Standardization Configuration for Address Data
The Address Constants File (addressConstants*.cfg)
The Address Clues File (addressClueAbbrev*.dat)
The Address Internal Constants File (addressInternalConstants*.cfg)
The Address Master Clues File (addressMasterClues*.dat)
The Address Patterns File (addressPatterns*.dat)
The Address Output Patterns File (addressOutPatterns*.dat)
Address Pattern File Components
Address Type Tokens
Pattern Classes
Pattern Modifiers
Priority Indicators
Modifying Sun Match Engine Address Data Configuration Files
Configuring the Matching Service for Address Data (Repository)
Configuring the Standardization Structure for Address Data (Repository)
Address Standardization Structures
Address Phonetic Encoding
Configuring the Match String for Address Data (Repository)
Sun Match Engine Business Names Data Type Configuration
Sun Match Engine Business Name Matching Overview
Sun Match Engine Business Name Processing Fields
Business Name Match String Fields
Business Name Standardized Fields
Business Name Object Structure
Sun Match Engine Match Configuration for Business Names
Sun Match Engine Standardization Configuration for Business Names
The Business Constants File (bizConstants.cfg)
The Adjectives Key Type File (bizAdjectivesTypeKeys.dat)
The Alias Key Type File (bizAliasTypeKeys.dat)
The Association Key Type File (bizAssociationTypeKeys.dat)
The General Terms Reference File (bizBusinessGeneralTerms.dat)
The City or State Key Type File (bizCityorStateTypeKeys.dat)
The Business Former Name Reference File (bizCompanyFormerNames.dat)
The Merged Business Name Category File (bizCompanyMergerNames.dat)
The Primary Business Name Reference File (bizCompanyPrimaryNames.dat)
The Connector Tokens Reference File (bizConnectorTokens.dat)
The Country Key Type File (bizCountryTypeKeys.dat)
The Industry Sector Reference File (bizIndustryCategoryCode.dat)
The Industry Key Type File (bizIndustryTypeKeys.dat)
The Organization Key Type File (bizOrganizationTypeKeys.dat)
The Business Patterns File (bizPatterns.dat)
Business Name Tokens
The Special Characters Reference File (bizRemoveSpecChars.dat)
Modifying Sun Match Engine Business Name Configuration Files
Configuring the Matching Service for Business Names (Repository)
Configuring the Standardization Structure for Business Names (Repository)
Business Name Standardization Structures
Business Name Phonetic Encoding
Configuring the Match String for Business Names (Repository)
Fine-Tuning Weights and Thresholds for Sun Match Engine (Repository)
Data Analysis Overview
Customizing the Match Configuration and Thresholds
Determining the Match Fields
Customizing the Match Configuration
Probabilities or Agreement Weights
Defining Relative Value
Determining the Weight Range
Weight Ranges Using Agreement Weights
Weight Ranges Using Probabilities
Comparison Functions
Determining the Weight Thresholds
Specifying the Weight Thresholds
Fine-tuning the Thresholds
Match Configuration Comparison Functions for Sun Match Engine (Repository)
Sun Match Engine Comparison Functions
Bigram Comparators
Bigram String Comparator (b1)
Advanced Bigram String Comparator (b2)
Uncertainty String Comparators
Generic String Comparator (u)
Advanced Generic String Comparator (ua)
Simplified String Comparator (us)
Simplified String Comparator - FirstName (uf)
Simplified String Comparator - LastName (ul)
Simplified String Comparator - House Numbers (un)
Language-specific String Comparator (usu)
Exact char-by-char Comparator (c)
Numeric Comparators
Generic Number Comparator (n)
Integer Comparator (nI)
Real Number Comparator (nR)
Alphanumeric Comparator (nS)
Date Comparators
Date Comparator - Year only (dY)
Date Comparator - Month-Year (dM)
Date Comparator - Day-Month-Year (dD)
Date Comparator - Hour-Day-Month-Year (dH)
Date Comparator - Min-Hour-Day- Month-Year (dm)
Date Comparator - Sec-Min-Hour-Day- Month-Year (ds)
Prorated Comparator (p)
Sun Match Engine Comparison Function Options
© 2010, Oracle Corporation and/or its affiliates