Component: RUN DQ RULE

The following table describes the fields in the Run DQ Rule.

Table 11-14 Fields in the Run DQ Rule and their Descriptions

Property Description
DQ Group Name Refers to the Data Quality Groups consisting of associated Data Quality Rule definition(s). Select the required DQ Group from the drop-down list.
Rejection Threshold Specify the percentage of Rejection Threshold (%) limit in numeric value. This refers to the maximum percentage of records that can be rejected in a job. If the percentage of failed records exceeds the Rejection Threshold, the job will fail. If the field is left blank, the default the value is set to 100%.
Additional Parameters

Specify the Additional Parameters as filtering criteria for execution in the pattern Key#Data type#Value; Key#Data type#Value;…etc.

Here the Data type of the value should be “V” for Varchar/Char, or “D” for Date with “MM/DD/YYYY” format, or “N” for numeric data. For example, if you want to filter some specific region codes, you can specify the Additional Parameters value as $REGION_CODE#V#US;$CREATION_DATE#D#07/06/1983;$ACCOUNT

_BAL#N#10000.50;

Note: In case the Additional Parameters are not specified, the default value is fetched from the corresponding table in configuration schema for execution.

Parameters

Comma separated parameters where first value is considered as the threshold percentage, followed by additional parameters which are a combination of three tokens. Example, “90”,”PARAM1”,”D”,”VALUE1”,”PARAM2”,”V”,”VALUE2”.

Note: Parameter ‘Fail if threshold is breached” is defaulted to “Yes” for RRF executions.

Optional Parameter

For DQ Rule execution on Spark, specify EXECUTION_VENUE=Spark in this field.

Note that, you should have registered a cluster from DMT Configurations > Register Cluster window with the following details:

  • Name- Enter name of the Hive information domain.
  • Description- Enter a description for the cluster.
  • Livy Service URL- Enter the Livy Service URL used to connect to Spark from OFSAA.