The Data Integrator Wizard was enhanced in Java CAPS 6 Update 1. The instructions in this topic might differ from what is available in Release 6.
You can use the Data Integrator Wizard to generate the Bulk Loader for a master index application. The Bulk Loader loads data that has already been cleansed, standardized, and matched into a master index database. The source files for the Bulk Loader are those generated by the Bulk Matcher.
Complete the steps under Creating a New Data Integrator Project.
Make sure the master index database is running, and that your NetBeans IDE is connected to the master index database.
In order to specify the source files for the Bulk Loader, you need to run the Bulk Matcher first. For more information see, Loading the Initial Data Set for a Sun Master Index.
On the NetBeans Projects window, expand the new Data Integrator project and right-click Collaborations.
Point to New, and then select ETL.
The New File Wizard appears with the Name and Location window displayed.
Enter name for the collaboration.
Click Next.
On the Select Type of ETL Loader window on the New File Wizard, select Bulk Loader.
Click Next.
The Select or Create Database window appears.
To specify a staging database to use for external data sources (for this project only), do one of the following:
Click Next.
The Select JDBC Target Tables window appears.
To choose the target tables to load the extracted data into, do the following:
Under Available Connections, select the master index database.
Under Schemas, select the schema that contains the tables to load the data into.
Under Schemas, select only the tables that correspond to the data files produced by the Bulk Matcher, and then click Select.
You can use the Shift and Control keys to select multiple tables at once. If you select target tables that do not correspond to the Bulk Matcher files, collaborations without source table are generated and the project fails to build.
Click Next.
The Choose Bulk Loader Data Source window appears.
To specify the source data for the Bulk Loader, do the following:
In the upper portion of the window, browse to the location of the of the output files from the Bulk Matcher.
These files are located in NetBeansProjects_Home/Project_Name/loader-generated/loader/work/masterindex, where work is the location you specified for the working directory in loader-config.xml.
Select all of the data files in the masterindex directory, and then click Add.
Click Next.
The Map Selected Collaboration Tables window appears.
To map source and target data, do the following:
To disable constraints on the target tables, select Disable Target Table Constraints.
Select the SQL statement type to use for the transfer. You can select insert, update, or both.
The wizard automatically maps the source and target tables for you. Review the mapping to verify its accuracy.
Not every table on the left will be mapped. For example, system tables such as SBYN_COMMON_HEADER, SBYN_COMMON_DETAIL, SBYN_APPL, and SBYN_SYSTEMS do not need to be mapped.
Click Finish.
An ETL collaboration is created for each target table. This might take a few minutes to generate.
You can further configure the ETL collaboration in the ETL Collaboration Editor. For more information, see Configuring ETL Collaborations.
To load the data into the master index database, you can either run each collaboration individually, or you can generate a batch file that will run all collaborations for you. For more information, see Loading Matched Data Using the Data Integrator Wizard Bulk Loader in Loading the Initial Data Set for a Sun Master Index.