Configure Data Sourcing

The schema type selected in the previous step requires the definition of database objects to be used for model creation. Data Sourcing step of Workspace provisioning allows to select tables from Oracle, Hive, or File based data sources from which data has to pulled into the Oracle based Workspace Data Schemas. However, unlike the data sourcing from RDBMS data sources, the tables will not get physicalized in the target schema and hence it is expected that the tables with compatible structures are already present in target RDBMS Schema. You can also select DMM operations such as Procedures, Functions, Sequences, and Package while configuring Data Sourcing. Once a workspace has been provisioned using DMM, it is stored in ftpshare path- ftpshare/dmm/DATE.
In case any of the selected tables are not present in the target schema, those tables are included in the failed objects count in workspace provisioning summary.
This window shows the different icons for Oracle, File, and Hive data sources.
Enter the details in this window.

Figure 6-6 Data Sourcing - External Data Source


This image displays the Data Sourcing- External Data Source.

To configure Data Sourcing, follow these steps:
  1. You can select Data Source from Data Source Name drop-down list or create a new Data Source. To create a new Data Source, see the Configure Workspace Schema section.
  2. Select the Target Data Schema. You can select multiple Data Sources for a Target Data Schema.
  3. For example, if there are D1, D2 and D3 Data Sources, then you can select the tables from all these Data Sources, tables from two Data Sources, tables from one Data Source, or as required. Here, multiple combination of tables are possible with Data Source and Target Data Source.
  4. If two Data Sources are having same tables (from different Data Sources), then the columns from the first selected table will be used. For example:
  5. If table A has columns C1, C2, C3 and Table B has columns C1, C2, and C4,then the data from the first table will be used.
  6. During the data population, only columns C1 and C2 will be used and those will be marked in Green color.
  7. Select the type of objects to be displayed in the pane that follows the drop-down list. The Object Type drop-down list will be enabled after selecting the Data Source from Data Source Name drop-down list. The following are the options in the drop-down list:
    • Table
    • View
    • Synonym
    • Function
    • Procedure
    • Package
    • Sequence
  8. Click Next to go to the next step. or Click Skip to skip the step.