Siebel Data Quality Administration Guide > Data Quality Performance Considerations >

Performance Considerations for Siebel Data Quality Matching Server


The three key tables S_PER_DEDUP_KEY, S_PRSP_DEDUP_KEY, and S_ORG_DEDUP_KEY may include six times more records than their corresponding base tables, depending on the key type used in the key generation stage. For limited key type, they may include two to four times more records. For standard key type, they may include at least six times more records. Work with a database administrator and follow recommendations for the Siebel Data Quality Matching Server to optimize the parameters for your database:

To execute concurrent Data Quality Manager server tasks to deduplicate data

  1. Query the base tables to find the search specifications so that each query result set contains between 50,000 and 75,000 records.
  2. NOTE:  For users who have over 100,000 records in their base tables, it can be time consuming to reprocess all the records. For that reason, the recommended record set for subsequent matching and key generation jobs consists only of outdated and new records. Outdated records are records that are modified after they are tagged. In SQL terms, outdated records are those in which [Updated] > (DQ tag).

    For example, the following table provides search specification solutions using the Object WHERE Clause to run key generation or deduplication.

    To Query for...
    Key Generation Example
    Deduplication Example
    Updated records
    ([DeDup Key Modification Date]<[Updated])
    ([DeDup Last Match Date]<[Updated])
    New
    records
    ([DeDup Key Modification Date] IS NULL)
    ([DeDup Last Match Date] IS NULL)
    Updated and new records
    ([DeDup Key Modification Date]<[Updated]) OR ([DeDup Key
    Modification Date]IS NULL)
    (([DeDup Last Match Date]<[Updated]) OR ([DeDup Last Match Date] IS NULL))

  3. Start concurrent server tasks using the search specifications provided in Step 1 until the entire table is completely processed.

 Siebel Data Quality Administration Guide 
 Published: 15 May 2003