Skip Navigation Links | |
Exit Print View | |
![]() |
Loading the Initial Data Set for a Master Index Java CAPS Documentation |
Loading the Initial Data Set for a Master Index
Initial Bulk Match and Load Overview
Initial Bulk Match and Load Process Overview
Data Preparation, Matching, and Loading Procedure Overview
About the Cluster Synchronizer
Required Format for Flat Data Files
Generating the Initial Bulk Match and Load Tool
To Generate the Initial Bulk Match and Load Tool
Creating the Cluster Synchronizer Database
To Create the Cluster Synchronization Tables
Configuring the Initial Bulk Match and Load Tool
Configuring the Initial Bulk Match and Load Tool Processing
Configuring Initial Bulk Match and Load Tool Logging
To Configure IBML Tool Logging
Initial Bulk Match and Load Tool Configuration Properties
Initial Bulk Match and Load Tool Field Validation Configuration
Initial Bulk Match and Load Tool Blocking Query Configuration
Initial Bulk Match and Load Tool Match String Configuration
Initial Bulk Match and Load Tool Processing Configuration
Cluster Synchronizer Database Configuration
Initial Bulk Match and Load Tool Logging Properties
Running the Bulk Matcher in Analysis Mode
To Run the Bulk Matcher in Analysis Mode
Reviewing the Match Analysis Results
Reconfiguring the Matching Logic
To Reconfigure the Matching Logic
Running the Bulk Match and Bulk Load in One Step (SQL*Loader Only)
The IBML Tool provides two methods to load the master data images generated by the Bulk Matcher. A command line tool is provided to generate and then run the extract, transform, and load (ETL) collaborations that load the data. Alternatively, you can use SQL*Loader to load the data if the master index database is running on Oracle.
Perform one of the following procedures to load the matched data into your master index database:
If the master index database runs on an Oracle platform, you can use either SQL*Loader or the command-line Bulk Loader to load the matched data into the database. SQL*Loader cannot be used for a SQL Server or MySQL database.
A new directory named sqlldr is created in the working directory.
Note - Any records that contained bad data and were not inserted into the master index database are written to the sqlldr/bad directory. Any records that contained bad data and were discarded are written to the sqlldr/discard directory.
You can use the command-line Bulk Loader to load data into an Oracle, MySQL, or SQL Server database. Using the command-line tool does not require the use of NetBeans, but it does require that NetBeans be installed on the master loader machine.
This generates a zip file in the IBML Tool home directory.
This generates an ETL collaboration and creates a new directory, ETLloader, in the IBML Tool home directory.
The ETL collaboration is generated by a file that includes configurable properties you need to define. The file is named genCollab.bat for Windows and genCollab.sh for UNIX. It is located in the directory where you extracted the IBML Tool files on the master processor. The following table lists and describes the default properties for the file.
Tip - If you get a usage error when running the Bulk Loader after configuring the properties below, remove the double-quotes from around the paths and filenames (but not from the delimiters).
Table 8 Command–Line Bulk Loader Properties
|