Data set settings controlled by edp-cli.properties or in Studio

In this release, for the most part, the data set's workflow configuration is controlled by the Workflow Manager Service and its configuration file, edp.properties. However, there are a few exceptions, where you control the data set's workflow characteristics in other locations. This topic summarizes these exceptions.

In this release, three locations exist that set a data set's configuration:

  1. The edp.properties file in the Workflow Manager Service directory, $BDD_HOME/workflowmanager/dp/config. This file is newly added in this release, along with the Workflow Manager Service.
  2. The edp-cli.properties file in the CLI directory, $BDD_HOME/dataprocessing/edp_cli/config.
    Note: In this release, this file is renamed. In the previous release, its name was edp.properties.
  3. The Data Processing Settings page in the Control Panel in Studio.
This table indicates exceptions and lists those data set's workflow settings that you change in edp-cli.properties, or in Studio. (All other data set workflows and their settings are set in the edp.properties file in the Workflow Manager directory.)
Data set workflow setting If loaded or reloaded in Studio If created or refreshed from DP CLI
Sample size Change bdd.sampleSize in Data Processing Settings in the Control Panel on each Studio node. Change maxRecordsForNewDataSet in edp-cli.properties, or override it with the --maxRecords flag on the command line.
Language Change the data set's language when loading a new file in Studio. Change the defaultLanguage property in edp-cli.properties.
Whether to run enrichments Change bdd.enableEnrichments in Data Processing Settings page in the Control Panel on each Studio node. Change runEnrichment in edp-cli.properties, or override it with the --runEnrichment flag on the command line.
Access type The data set is always loaded as private and you cannot change this setting. However, you can include the data set into a project and share the project with others in your group. Change datasetAccessType in edp-cli.properties.

For complete information on using these configuration files, see the Data Processing Guide.

For information on changing Data Processing settings in Studio, see the Administrator's Guide.