This topic shows some workflow examples using the DP CLI.
Excluding specific Data Enrichment modules
The --excludePlugins flag (abbreviated as -ep) specifies a list of Data Enrichment modules to exclude when enrichments are run. This flag should be used only enrichments are being run as part of the workflows (for example, with the --excludePlugins flag).
./data_processing_CLI --excludePlugins <excludeList>
where excludeList is a space-separated string of one or more of these Data Enrichment canonical module names:
address_geo_tagger
(for the Address GeoTagger)ip_geo_extractor
(for the IP Address GeoTagger)reverse_geo_tagger
(for the Reverse GeoTagger)tfidf_term_extractor
(for the TF.IDF Term extractor)doc_level_sentiment_analysis
(for the document-level Sentiment Analysis module)language_detection
(for the Language Detection module)./data_processing_CLI --table masstowns --runEnrichment --excludePlugins reverse_geo_tagger
For details on the Data Enrichment modules, see Data Enrichment Modules.
Cleaning up aborted jobs
./data_processing_CLI --cleanAbortedJobs
... [2015-07-13T10:18:13.683-04:00] [DataProcessing] [INFO] [] [org.apache.spark.Logging$class] [tid:main] [userID:fcalvill] client token: N/A diagnostics: N/A ApplicationMaster host: web12.example.com ApplicationMaster RPC port: 0 queue: root.fcalvill start time: 1436797065603 final status: SUCCEEDED tracking URL: http://web12.example.com:8088/proxy/application_1434142292832_0016/A user: fcalvill Clean aborted job completed. data_processing_CLI finished with state SUCCESS
EDP: CleanAbortedJobsConfig{}
Ping checking the DP components
./data_processing_CLI --pingCheck
... [2015-07-14T14:52:32.270-04:00] [DataProcessing] [INFO] [] [com.oracle.endeca.pdi.logging.ProvisioningLogger] [tid:main] [userID:fcalvill] Ping check time elapsed: 7 ms data_processing_CLI finished with state SUCCESS