Siebel Data Quality Administration Guide > Data Quality Concepts >

Match Key Generation


When data matching is performed in real time or in batch mode, data quality searches in the database for records that potentially match the current record (the record entered by a real-time user or the active record in the batch job). These records are called candidate records. When comparing the current records with existing records in the database, data quality does not use raw data, but instead uses match key values.

Match keys are calculated by applying an algorithm to specified fields in customer records. Typically keys are generated from a combination of name, address, and other identifier fields, for example, a person's name (first name, middle name, last name) for prospects and contacts, or the account name for accounts.

You generate match keys for records in the database by using batch jobs, as described in Generating or Refreshing Keys Using Batch Jobs.

Typically, an administrator generates and refreshes keys on a periodic basis by running batch jobs. In such batch jobs, keys can be generated for all account keys, all contact keys, all prospect keys, or subsets as defined by search specifications that include a WHERE clause.

Because key data can become out of sync with the base tables, you must refresh the key data periodically. Key generation re-generates the keys for all the records covered by the search specification. Key refresh however, only re-generates the keys for records that are new or have been modified since your last key generation, and which are covered by the search specification. Key refresh is therefore much faster than key generation.

For example, if there are records as follows:

  • Record 1. The record has a key and has not been updated.
  • Record 2. The record has been updated therefore the key is out of sync with the record.
  • Record 3. The record is a new record and no key is generated for it yet.

If you generate match keys with a search specification that covers record 1, 2, and 3, new keys are generated for record 1, 2, and 3. However, if you refresh match keys with a search specification to cover record 1, 2, and 3, new keys are generated for record 2 and 3 only.

The batch capability is useful in the following circumstances:

  • If you deploy data quality in a Siebel application implementation that already contains data
  • If you receive new data using an input method that does not involve object manager, such as EIM or batch methods such as the List Import Service Manager
  • To periodically review data to ensure the correctness of previous matching efforts.

For instructions about using batch jobs to generate or refresh keys, see Generating or Refreshing Keys Using Batch Jobs.

Additionally, if real-time data matching is enabled for users, keys are automatically generated (or refreshed) for a record whenever the user saves a new Account, Contact, or List Mgmt Prospective Contact record or modifies and commits an existing record to the database.

If no keys are generated for a certain record, that record is ignored as a potential candidate record when matching takes place.

Siebel Data Quality Administration Guide Copyright © 2013, Oracle and/or its affiliates. All rights reserved. Legal Notices.