Documentation Home
> Understanding the Master Index Match Engine
Understanding the Master Index Match Engine
Book Information
Understanding the Master Index Match Engine
About the Master Index Match Engine
Related Topics
Master Index Match Engine Overview
Data Matching Concepts
Deterministic and Probabilistic Data Matching
Weighting Thresholds
Probabilities and Direct Weights
Matching and Unmatching Probabilities
Agreement and Disagreement Weight Ranges
How the Master Index Match Engine Works
Master Index Match Engine Structure
Master Index Match Engine Configuration Files
Master Index Match Engine Matching Weight Formulation
Master Index Match Engine Data Types
The Master Index Match Engine and the Master Index Standardization Engine
Sun Master Index Standardization and Matching Process
Master Index Match Engine Matching Configuration
The Master Index Match Engine Match Configuration File
Master Index Match Engine Match Configuration File Format
Match Configuration File Sample
Probability Type Section
Matching Rules Section
Master Index Match Engine Matching Comparison Functions At a Glance
Master Index Match Engine Comparator Definition List
Master Index Match Engine Comparison Functions
Bigram Comparators
Bigram Comparator (b1)
Advanced Bigram Comparator (b2)
Uncertainty String Comparators
Advanced Jaro String Comparator (u)
Winkler-Jaro String Comparator (ua)
Condensed String Comparator (us)
Advanced Jaro Adjusted for First Names (uf)
Advanced Jaro Adjusted for Last Names (ul)
Advanced Jaro Adjusted for House Numbers (un)
Advanced Jaro AlphaNumeric Comparator (ujs)
Unicode String Comparator (usu)
Unicode AlphaNumeric Comparator (usus)
Exact Character-to-Character Comparator (c)
Numeric Comparators
Integer Comparator (nI)
Real Number Comparator (nR)
Condensed AlphaNumeric SSN Comparator (nS)
Date Comparators
Date Comparator With Years as Units (dY)
Date Comparator With Months as Units (dM)
Date Comparator With Days as Units (dD)
Date Comparator With Hours as Units (dH)
Date Comparator With Minutes as Units (dm)
Date Comparator With Seconds as Units (ds)
Prorated Comparator (p)
Creating Custom Comparators for the Master Index Match Engine
Custom Comparator Overview
About the Comparator Package
Defining Custom Comparators
Before You Begin
Step 1: Create the Custom Comparator Java Class
initialize
Description
Syntax
Parameters
Return Value
Throws
compareFields
Description
Syntax
Parameters
Return Value
Throws
setRTParameters
Description
Syntax
Parameters
Return Value
Throws
stop
Description
Syntax
Parameters
Return Value
Throws
Step 2: Register the Comparator in the Comparators List
To Register the Comparators
Step 3: Define Parameter Validations (Optional)
To Define Parameter Validations
validateComparatorsParameters
Description
Syntax
Parameters
Return Value
Throws
Step 4: Define Data Source Handling (Optional)
To Define Data Source Handling
handleComparatorsDataSources
Description
Syntax
Parameters
Return Value
Throws
DataSourcesProperties Class
getDataSourcesList
Description
Syntax
Parameters
Return Value
Throws
isDataSourceLoaded
Description
Syntax
Parameters
Return Value
Throws
setDataSourceLoaded
Description
Syntax
Parameters
Return Value
Throws
getDataSourceObject
Description
Syntax
Parameters
Return Value
Throws
Step 5: Define Curve Adjustment or Linear Fitting (Optional)
To Define Curve Adjustment or Linear Fitting
processCurveAdjustment
Description
Syntax
Parameters
Return Value
Throws
Step 6: Compile and Package the Comparator
Step 7: Import the Comparator Package Into Sun Master Index
To Import a Comparison Function
Step 8: Configure the Comparator in the Match Configuration File
Master Index Match Engine Configuration for Common Data Types
The Master Index Match String
Master Index Match Engine Match String Fields
Person Data Match String Fields
Address Data Match String Fields
Business Name Match String Fields
Master Index Match Engine Match Types
Configuring the Match String for a Master Index Application
Configuring the Match String for Person Data
Configuring the Match String for Address Data
Configuring the Match String for Business Names
Fine-Tuning Weights and Thresholds for Sun Master Index
Data Analysis Overview
Customizing the Match Configuration and Thresholds
Determining the Match Fields
Customizing the Match Configuration
Probabilities or Agreement Weights
Defining Relative Value
Determining the Weight Range
Weight Ranges Using Agreement Weights
Weight Ranges Using Probabilities
Comparison Functions
Determining the Weight Thresholds
Specifying the Weight Thresholds
Weight Distribution Method
Percentage Method
Fine-tuning the Thresholds
© 2010, Oracle Corporation and/or its affiliates