This section provides information on configuring and using the
Data Processing Command Line Interface utility.
DP CLI overview
The DP CLI (Command Line Interface) shell utility is used to launch Data Processing workflows, either manually or via a cron job.
DP CLI configuration
The DP CLI has a configuration file, edp.properties, that sets its default properties.
DP CLI flags
The DP CLI has a number of runtime flags that control its behavior.
Using whitelists and blacklists
A whitelist specifies which Hive tables should be processed in Big Data Discovery, while a blacklist specifies which Hive tables should be ignored during data processing.
DP CLI cron job
You can specify that the BDD installer create a cron job to run the DP CLI.