Configuring Sun Master Indexes

Specifying the Master Index Duplicate Threshold

The duplicate threshold is the lowest matching probability weight at which two records are considered potential duplicates of one another. Any records with lower probability weights are not considered to be possible matches. Any records between the duplicate and match thresholds are flagged as potential duplicates, and must be resolved manually. You can configure the duplicate threshold by using the Configuration Editor or by modifying the XML file directly.

ProcedureTo Specify the Duplicate Threshold (Configuration Editor)

  1. In the Projects window, right-click the Configuration node in the project you want to modify, and then click Edit.

    The Configuration Editor appears.

  2. Click the Matching tab.

  3. In the Duplicate Threshold field, enter the lowest weight at which two records should be considered a potential match.


    Note –

    This value can be any float value lower than the match threshold but higher than the lowest possible matching probability weight.


  4. On the Configuration Editor toolbar, click Save.

ProcedureTo Specify the Duplicate Threshold (XML Editor)

  1. In the Projects window, expand the Configuration node in the project you want to modify, and then double-click master.xml.

  2. Scroll to the DuplicateThreshold element in the DecisionMakerConfig element.

  3. Change the value of the parameter-value element. For example:


    <parameter>
       <parameter-name>DuplicateThreshold</parameter-name>
       <parameter-type>java.lang.Float</parameter-type>
       <parameter-value>7.9</parameter-value>
    </parameter>

    Note –

    This value can be any float value lower than the match threshold but higher than the lowest possible matching probability weight.


  4. Save and close the file.