Siebel Data Quality Administration Guide > Data Quality Concepts >

Calculation of Match Scores


After data quality identifies candidate records, they are sent to the third-party software. The software calculates a match score from 0 to 100 to indicate the degree of similarity between the candidate records and the current record.

The match score is calculated using a large number of rules that compensate for how frequently a given name or word appears in a language. The rules then weigh the similarity of each field on the record according to the real-world frequency of the name or word. For example, Smith is a common last name, so a match on a last name of Smith would carry less weight than a match on a last name that is rare.

The algorithms used to calculate match scores are complex. These algorithms are the intellectual property of third-party software vendors; Oracle Corporation cannot provide details about how these algorithms work.

Calculation of Match Scores Using the Universal Connector with Third-Party Data Quality Vendors

The third-party software examines the candidate records, computes a match score for each record that is identified as a duplicate, and returns the duplicate records to data quality. The match score is a number that represents the similarity of a record to the current active record. It is calculated taking into account a large number of rules along with a number of other factors and weightings.

Siebel Data Quality Administration Guide Copyright © 2018, Oracle and/or its affiliates. All rights reserved. Legal Notices.