This topic describes the syntax of the --refreshData flag.
./data_processing_CLI --refreshData <logicalName>
or
./data_processing_CLI --refreshData <logicalName> --table <tableName>or
./data_processing_CLI --refreshData <logicalName> --table <tableName> --database <dbName>
The logicalName value is available in the Data Set Logical Name property in Studio. For details, see Obtaining the Data Set Logical Name.
Use of the --table and --database flags
When a data set is first created, the names of the source Hive table and the source Hive database are stored in the DSI (DataSet Inventory) metadata for that data set. The --table flag allows you to override the default source Hive table, while the --database flag can override the database set in the data set's metadata.
Note that these two flags are ephemeral. That is, they are used only for the specific run of the operation and do not update the metadata of the data set.
If these flags are not specified, then the Hive table and Hive database that are used are the ones in the data set's metadata.
Use these flags when you want to temporarily replace the data in a data set with that from another Hive table. If the data change is permanent, it is recommended that you create a new data set from desired Hive table. This will also allow you to create a Transformation script that is exactly tailored to the new data set.