In this release, for the most part, the data set's workflow configuration is controlled by the Workflow Manager Service and its configuration file, edp.properties. However, there are a few exceptions, where you control the data set's workflow characteristics in other locations. This topic summarizes these exceptions.
In this release, three locations exist that set a data set's configuration:
| Data set workflow setting | If loaded or reloaded in Studio | If created or refreshed from DP CLI |
|---|---|---|
| Sample size | Change bdd.sampleSize in Data Processing Settings in the Control Panel on each Studio node. | Change maxRecordsForNewDataSet in edp-cli.properties, or override it with the --maxRecords flag on the command line. |
| Language | Change the data set's language when loading a new file in Studio. | Change the defaultLanguage property in edp-cli.properties. |
| Whether to run enrichments | Change bdd.enableEnrichments in Data Processing Settings page in the Control Panel on each Studio node. | Change runEnrichment in edp-cli.properties, or override it with the --runEnrichment flag on the command line. |
| Access type | The data set is always loaded as private and you cannot change this setting. However, you can include the data set into a project and share the project with others in your group. | Change datasetAccessType in edp-cli.properties. |
For complete information on using these configuration files, see the Data Processing Guide.
For information on changing Data Processing settings in Studio, see the Administrator's Guide.