This topic describes how to run a Refresh update operation.
To run a Refresh update on a data set:
... client token: N/A diagnostics: N/A ApplicationMaster host: web2014.example.com ApplicationMaster RPC port: 0 queue: root.fcalvill start time: 1437157181086 final status: SUCCEEDED tracking URL: http://web2014.example.com:8088/proxy/application_1436970078353_0020/A user: fcalvill Refreshing existing collection: default_edp_171506f0-e2d6-4ed1-8f5e-052a1fad721a_10135 Collection key for new record: refreshed_edp_34cdbff2-2e5f-4c09-9388-2b9f5ae3148e data_processing_CLI finished with state SUCCESS
EDP: DatasetRefreshConfig{hiveDatabase=, hiveTable=, collectionToRefresh=edp_cli_edp_479776cd-2d93-4de0-bfc0-196b7f16b2b5_10121, newCollectionName=refreshed_edp_0f49f22d-7344-4448-b82f-3c70bfad6314, op=REFRESH_DATASET}
You can also check the Dgraph HDFS Agent log for the status of the Dgraph ingest operation.
Note that future Refresh updates on this data set will continue to use the same data set key. You will also use this key if you set up a Refresh update cron job for this data set.