Siebel Data Quality Administration Guide > Siebel Data Quality Universal Connector > Siebel Data Quality Universal Connector Functionality >

Data Matching (Deduplication) Process for Universal Connector


The data matching (deduplication) functionality of the Siebel Data Quality Universal Connector uses validated third-party vendor software for the matching rules and algorithms and maintenance of any match keys or match logic.

The methodologies and matching capabilities of external applications vary by vendor. Matching rules and weightings are typically configurable within the external application. After running batch deduplication, the Siebel Data Quality Universal Connector reports the possible matches in the Duplicate Resolution views in the Data Administration screen (View > Site Map > Data Administration > Data Quality). The data administrator can then manually merge the records in the Data Quality administration views. For more information about merging duplicate records, see Merging Duplicate Records.

During the batch deduplication process, all records in the database are passed to the third-party software. The software uses an optimized algorithm to separate records into groups to reduce the number of record comparisons. One key difference of the Siebel Data Quality Universal Connector from the Siebel Data Quality Matching Server is that the key generation and deduplication is combined into one process. After batch deduplication using the Siebel Data Quality Universal Connector, the key values for records are saved as files on your hard disk by the third-party vendor software. During real-time duplication, the third-party vendor software uses the key values stored in the files to find possible duplicates.

TIP:  You should run batch deduplication against a business component before running real-time deduplication. For more information about batch deduplication, see Batch Mode and for more information about real-time deduplication, see Real-Time Mode.


 Siebel Data Quality Administration Guide 
 Published: 15 May 2003