This topic describes how to run a Refresh update operation.
To run a Refresh update on a data set:
...
client token: N/A
diagnostics: N/A
ApplicationMaster host: web2014.example.com
ApplicationMaster RPC port: 0
queue: root.fcalvill
start time: 1437157181086
final status: SUCCEEDED
tracking URL: http://web2014.example.com:8088/proxy/application_1436970078353_0020/A
user: fcalvill
Refreshing existing collection: default_edp_171506f0-e2d6-4ed1-8f5e-052a1fad721a_10135
Collection key for new record: refreshed_edp_34cdbff2-2e5f-4c09-9388-2b9f5ae3148e
data_processing_CLI finished with state SUCCESS
EDP: DatasetRefreshConfig{hiveDatabase=, hiveTable=,
collectionToRefresh=edp_cli_edp_479776cd-2d93-4de0-bfc0-196b7f16b2b5_10121,
newCollectionName=refreshed_edp_0f49f22d-7344-4448-b82f-3c70bfad6314, op=REFRESH_DATASET}
You can also check the Dgraph HDFS Agent log for the status of the Dgraph ingest operation.
Note that future Refresh updates on this data set will continue to use the same data set key. You will also use this key if you set up a Refresh update cron job for this data set.