Constrained frequency analysis reports list the frequencies of various data values found in the specified fields based on defined rules. For example, you can define rules that will only include certain patterns or that will exclude certain values. This topic includes two constrained analysis definitions along with corresponding sample reports.
<ConstrainedFrequencyAnalysis> <fields> <field fieldName="Person.SSN"/> </fields> <ruleList> <rule> <dataLength fieldName="Person.SSN" len="10" more="false"/> </rule> </ruleList> </ConstrainedFrequencyAnalysis> |
The above analysis generates a report for social security numbers with less than 10 characters (which means the hyphens are likely missing). Below is a sample output.
CF_PROFILE_CONSTRAINED_FRQ_1_1–100000.csv
PERSON.SSN |
FREQUENCY |
300555444 |
1 |
299557777 |
1 |
822331111 |
2 |
999999999 |
98 |
000000000 |
115 |
The following analysis generates a report for dates of birth that are prior to 01/01/1899 (which means they likely contain typographical errors). Below is a sample output.
<ConstrainedFrequencyAnalysis> <fields> <field fieldName="Person.DOB"/> </fields> <ruleList> <rule> <dataRange fieldName="Person.DOB" min="01/01/0001" max="01/01/1899"/> </rule> </ruleList> </ConstrainedFrequencyAnalysis> |
CF_PROFILE_CONSTRAINED_FRQ_2_1–100000.csv
PERSON.DOB |
FREQUENCY |
07/08/53 |
1 |
10/04/51 |
1 |
09/16/1682 |
1 |
12/28/1680 |
2 |
05/09/1898 |
2 |