Data Quality Services and Data Import

Data Quality Services ensure the quality of data being imported into the registry. You can configure a data import or file import process to activate these services before importing data into the database.

You can use the following data quality services:

  • Batch Deduplication: Allows you to define deduplication within the data being loaded.

  • Registry Deduplication: Allows you to define deduplication of the data being loaded against the records that already exist in the registry.

  • Import to Registry options: Allows you to define the import process mode, data cleansing, and geography validation.

Data Quality Services for File-Based Import

You can define Registry deduplication options in the file-based import process. You can't define the other Data Quality Services, such as Batch Deduplication and Import to Registry options, for a file-based import activity during the file-based import process. However, an import activity can be paused and sent for administrator review after preprocessing and before importing the data. The administrator can review the import process and configure data quality services.

Import activities are paused and sent for administrator review if the HZ_IMP_PAUSE_FILE_IMPORT profile option is set to Yes.

Data Quality Services for Data Import

You can configure data quality services for data import. For both registry and batch deduplication methods, you select a match configuration to identify duplicates and duplicate resolution action. You can specify:

  • Run the import process in preview mode.

  • Cleanse data before import.

  • Perform geography validation of data before import.

You can configure geography validation for data import at the site level by setting the HZ_IMP_DEFAULT_GEO_VALID_ADDRESS profile option to Yes. The addresses are validated against the master reference geography data, according to the geography-based address validation settings for each country. The addresses with validation errors aren't imported.