Configuring Sun Master Indexes (Repository)

Specifying the Master Index Duplicate Threshold (Repository)

The duplicate threshold is the lowest matching probability weight at which two records are considered potential duplicates of one another. Any records with lower probability weights are not considered to be possible matches. Any records between the duplicate and match thresholds are flagged as potential duplicates, and must be resolved manually. You can configure the duplicate threshold by using the Configuration Editor or by modifying the XML file directly.

ProcedureTo Specify the Duplicate Threshold (Configuration Editor)

  1. In the Projects window, right-click the master index application you want to modify, and then click Open.

  2. If the Configuration Editor dialog box appears, click Edit to check out the listed files.

    The Configuration Editor appears.

  3. Click the Matching tab.

  4. In the Duplicate Threshold field, enter the lowest weight at which two records should be considered a potential match.


    Note –

    This value can be any float value lower than the match threshold but higher than the lowest possible matching probability weight.


  5. On the Configuration Editor toolbar, click Save.

ProcedureTo Specify the Duplicate Threshold (XML Editor)

  1. In the Projects window, expand the Configuration node in the project you want to modify, and then double-click the Threshold file.

  2. Scroll to the DuplicateThreshold element in the DecisionMakerConfig element.

  3. Change the value of the parameter-value element. For example:


    <parameter>
       <parameter-name>DuplicateThreshold</parameter-name>
       <parameter-type>java.lang.Float</parameter-type>
       <parameter-value>7.9</parameter-value>
    </parameter>

    Note –

    This value can be any float value lower than the match threshold but higher than the lowest possible matching probability weight.


  4. Save and close the file.