Updating Data Sets
This section describes how to run update operations on BDD data sets.
About data set updates
You can update data sets by running Refresh updates and Incremental updates with the DP CLI.
Obtaining the Data Set Logical Name
The Data Set Logical Name specifies the data set to be updated.
Refresh updates
A Refresh update replaces the schema and all the records in a project data set with the schema and records in the source Hive table.
Incremental updates
An Incremental update adds new records to a project data set from a source Hive table.
Creating cron jobs for updates
You can create
cron
jobs to run your Refresh and Incremental updates.