66/77
6 Data Loading and Updates
This section discusses options for initial data loading and data updates. It illustrates how you can load files in Studio, or using Data Processing CLI.
- Data loading options
BDD offers several options for data loading. You can load data by running the data loading workflow with DP CLI. Also, in Studio, you can upload a personal file or import data from a JDBC source.
- Data loading and sample size
You can load either a sample or a full data set. If you load a sample, you can go to a full data set later. This topic summarizes how to get from a sample to a full data set.
- Studio-loaded files: data update diagram
The diagram in this topic shows data sets loaded in Studio by uploading a personal file or importing data from a JDBC source. It illustrates how you can reload this data set in Studio. Also, you can update the data set with DP CLI, and increase its size from sample to full.
- DP CLI-loaded files: data update diagram
The diagram in this topic shows data sets loaded by Data Processing component of BDD, from Hive. The diagram illustrates how you can update this data set using DP CLI, and increase its size from sample to full.
- Data update options
Here is a summary of how you can update data loaded into BDD, and when each type of update is useful to use.