You can update data sets by running Refresh updates and Incremental updates with the DP CLI.
When first created, a BDD data set may be sampled, which means that the BDD data set has fewer records than its source Hive table. In addition, more records can be added to the source Hive table, and these new records will not be added to the data set by default.
Note that the equivalent of a DP CLI Refresh update can done in Studio via the Load Full Data Set feature. However, Incremental Data updates can be performed only via the DP CLI, as Studio does not support this feature.
Re-pointing a data set
if you created a data set by uploading source data into Studio and want to run Refresh and Incremental updates, you should change the source data set to point to a new Hive table. (Note that this change is not required if the data set is based on a table created directly in Hive.) For information on this re-pointing operation, see the topic on converting a project to a BDD application in the Studio User's Guide.