How do you automatically link duplicate records?

You can create a duplicate identification batch and define subset rules to retrieve a subset of the records to identify duplicates within the batch or in the database.

The auto linking task involves the following two steps:
  1. Create a duplicate identification batch and select Create link request for all records as the automatic processing option.
  2. Perform the task Run Request Dispatch Job to disposition the duplicate resolution sets.

Create a duplicate identification batch and a subset rule

We first proceed to creating a duplicate identification batch. To do this, you can specify the criteria for retrieving a subset of records in the duplicate identification batch. This is known as defining Subset rules, which are also known as Batch Selection Criteria rules. The data quality engine identifies potential duplicates from this subset of records based on one of these rules:
  • Match all keywords: Select this option to perform an AND operation.
  • Match any keyword: Select this option to perform an OR operation.

Now that you have an overview of the task, let's first create a duplicate identification batch to identify duplicate organizations in the registry, and then create a rule to retrieve a subset of records where the organization was created after 6 June 2024. Note that you can use predefined or custom attributes.

  1. Click Navigator > Customer Data Management > Duplicate Identification.
  2. Click the Create button.

    The Create Duplicate Identification Batch page appears.

  3. Enter a batch name and description.
    Note:

    Another way is to copy an existing duplicate identification batch and quickly create a new batch from it. You can modify the details for this batch before submitting it.

  4. Specify the Batch Match Mode such as Against the Registry or Within the Batch.

    In the Within the Batch Match mode, the duplicate identification is limited to the records in a batch that meet the subset rule conditions. In the Against the Registry Batch Match mode, the process aggregates the records that meet the subset rule conditions in a batch, and these records are matched against one another as well as against other records in the database.

  5. Select the Party Type as Organization.
  6. Select the Automatic Processing Option as Create link request for all records.
  7. Provide the Batch Options. The batch options available depends upon the selected Automatic Processing Option. The following options are available when Create link request for all records is selected as the Automatic Processing Option:
    • Select an appropriate value for Cluster Key Level such as Typical.
    • Enter a value between 1 and 101, such as 70 for Match Threshold.
    • Enter a value between 1 and 101, such as 75 for Autolink Threshold.
      Note: You need to keep in mind that the Autolink Threshold values that you provide in the Batch Options area override the values set in the Manage Customer Hub Profile Options page.
  8. Click Add menu option or button under Duplicate Identification Batch: Selection Criteria.
  9. Specify the Apply Rules option as Match any keyword.
  10. Enter the following sample information in the Duplicate Identification Batch: Selection Criteria table:

    Duplicate Identification Batch: Selection Criteria

    Object Attribute Operator Value
    Organization Creation date After 6 June 2024
  11. Click Save and Close or Schedule per your requirement.

Run Request Dispatch Job

We will now proceed to running the request to disposition the duplicate resolution sets.

The dispatch job processes any resolution request in Pending or Submitted status. You can run this job in two modes:
  • On demand: Submit request dispatch jobs directly from the Duplicate Resolution work area by selecting the batch and clicking the Dispatch button. This submits the job for immediate processing.
  • Per a specific schedule: Do the following steps to set up a recurring job using the Run Request Dispatch Job. A setup user can access the Run Request Dispatch Job task in the Setup and Maintenance work area.
    1. Click the Advanced button on the Run Request Dispatch Job page.
    2. Click Schedule tab and select the Using a Schedule radio button.
    3. Select the frequency you want and click Submit.

    To see the list of dispatch jobs, and their statuses, navigate to Tools > Scheduled Processes.