JavaScript is required to for searching.
Skip Navigation Links
Exit Print View
Loading the Initial Data Set for a Master Index     Java CAPS Documentation
search filter icon
search icon

Document Information

Loading the Initial Data Set for a Master Index

Related Topics

Initial Bulk Match and Load Overview

Initial Bulk Match and Load Process Overview

Data Preparation, Matching, and Loading Procedure Overview

Distributed Processing

About the Bulk Match Process

Block Distribution

Record Matching

EUID Assignment

Master Index Image Creation

Potential Duplicate Creation

About the Bulk Load Process

About the Cluster Synchronizer

Required Format for Flat Data Files

Generating the Initial Bulk Match and Load Tool

To Generate the Initial Bulk Match and Load Tool

Configuring the Environment

To Configure the Environment

Creating the Cluster Synchronizer Database

To Create the Cluster Synchronization Tables

Configuring the Initial Bulk Match and Load Tool

Configuring the Initial Bulk Match and Load Tool Processing

To Configure the IBML Tool

Configuring Initial Bulk Match and Load Tool Logging

To Configure IBML Tool Logging

Initial Bulk Match and Load Tool Configuration Properties

Initial Bulk Match and Load Tool Field Validation Configuration

Initial Bulk Match and Load Tool Blocking Query Configuration

Initial Bulk Match and Load Tool Match String Configuration

Initial Bulk Match and Load Tool Processing Configuration

FTP Server Configuration

Cluster Synchronizer Database Configuration

SQL*Loader Configuration

Data Reader Configuration

Additional Properties

Initial Bulk Match and Load Tool Logging Properties

Performing a Match Analysis

Running the Bulk Matcher in Analysis Mode

To Run the Bulk Matcher in Analysis Mode

Reviewing the Match Analysis Results

Reconfiguring the Matching Logic

To Reconfigure the Matching Logic

Performing the Bulk Match

To Perform the Bulk Match

Running the Bulk Match and Bulk Load in One Step (SQL*Loader Only)

To Run the Bulk Match and Bulk Load in One Step

Loading the Matched Data Into the Master Index Database

Loading Matched Data Using SQL*Loader

To Load Matched Data Using SQL*Loader

Loading Matched Data Using the Command-Line Bulk Loader

To Load Matched Data Using the Command-Line Bulk Loader

Command-Line Bulk Loader Properties

Performing the Bulk Match

After you perform the match analysis and are satisfied with the matching logic configuration, you are ready to match the full set of data to be loaded into the master index database.

If you are using SQL*Loader to load the matched data, you can run the Bulk Matcher and the Bulk Loader all in the same step. For instructions on how to do this, see Running the Bulk Match and Bulk Load in One Step (SQL*Loader Only).

To Perform the Bulk Match

  1. Complete the steps under Performing a Match Analysis.
  2. From the master IBML Tool home directory, run cluster-truncate.sql against the cluster synchronizer database.
  3. For each IBML Tool, open loader-config.xml (located in the IBML Tool home directory in the conf subdirectory).
  4. Set the matchAnalyzerMode property to false.
  5. Verify that the rest of the properties are configured correctly, and then save and close the file.

    For information about the configurable properties, see Initial Bulk Match and Load Tool Configuration Properties.

  6. To configure and run the match process, do one of the following.
    • If the master loader is running on Windows:
      1. Navigate to the master IBML Tool home directory and open run-loader.bat for editing.
      2. Change the value of the JDBC_JAR_PATH variable in the first line to the location and name of the database driver for the master index database platform; for example, set JDBC_JAR_PATH=C:\oracle\jdbc\lib\ojdbc14.jar.
      3. Close and save the file.
      4. Double-click run-loader.bat or type run-loader from a command line.
    • If the master loader is running on UNIX:
      1. Navigate to the master IBML Tool home directory and open run-loader.sh for editing.
      2. Change the value of the JDBC_JAR_PATH variable in the first line to the location and name of the database driver for the master index database platform; for example, export JDBC_JAR_PATH=${oracle_home}/jdbc/lib/ojdbc14.jar.
      3. Close and save the file.
      4. Type sh run-loader.sh at the command line.
  7. Examine the log files in the logs directory of the IBML Tool home directory to verify that no errors occurred during the match process.
  8. Examine the files located in the masterindex folder of your working directory to be sure all tables were populated.

    Figure 4 Master Index Image Files

    image: Figure shows the master index image files created in the working directory.
  9. If you configured the IBML Tool to keep the temporary folders in the working directory, you can examine the blocking, EUID, and match files for additional verification.

    Figure 5 IBML Tool Working Directory

    image:Figure shows the structure of the IBML Tool working directory.
  10. To load the data into the master index database, follow one of the procedures under Loading the Matched Data Into the Master Index Database.