|Bookshelf Home | Contents | Index | PDF|
After data quality identifies candidate records, they are sent to the third-party software. The software calculates a match score from 0 to 100 to indicate the degree of similarity between the candidate records and the current record.
The match score is calculated using a large number of rules that compensate for how frequently a given name or word appears in a language. The rules then weigh the similarity of each field on the record according to the real-world frequency of the name or word. For example, Smith is a common last name, so a match on a last name of Smith would carry less weight than a match on a last name that is rare.
The algorithms used to calculate match scores are complex. These algorithms are the intellectual property of third-party software vendors; Oracle Corporation cannot provide details about how these algorithms work.
The third-party software examines the candidate records, computes a match score for each record that is identified as a duplicate, and returns the duplicate records to data quality. The match score is a number that represents the similarity of a record to the current active record. It is calculated taking into account a large number of rules along with a number of other factors and weightings.
|Siebel Data Quality Administration Guide||Copyright © 2013, Oracle and/or its affiliates. All rights reserved. Legal Notices.|