1.3.7.12 Record Completeness Profiler

The Record Completeness Profiler allows you to get an overview of how complete, or otherwise, your data is. It informs you how many of the attributes that constitute the record contain data, and bands the records according to the number of complete attributes.

Use the Record Completeness Profiler to find dummy records that have been entered without capturing the proper information. When this occurs, users usually enter the minimum amount of data that they have to, meaning the record will be less complete than a full valid record.

The following table describes the configuration options:

Configuration Description

Inputs

Specify any number of attributes required in completeness analysis.

Options

None.

Outputs

Describes any data attribute or flag attribute outputs.

Data Attributes

None.

Flags

The following flags are output:

  • PercentPopulated: indicates the percentage of the selected attributes which are populated. Possible values are a number between 0 and 100.

  • PopulatedAttributes: indicates the number of the selected attributes which are populated, out of the total number of attributes selected. Possible values are 'Number' of 'Number'.

The following table describes the statistics produced by the profiler:

Statistic Description

Record completeness %

Each distinct percentage completeness found, calculated in terms of the percentage of attributes analyzed that were not null.

Complete attributes

How many attributes were not null, and how many attributes were analyzed.

Matching records

The number of records with the given percentage completeness.

Note:

The Record Completeness Profiler assesses whether each attribute value is Null or Not Null. Empty Strings, or values that contain only spaces or other non-printing characters are, by default, converted to Null values in the Reader, so that you have a consistent view of whether or not each value contains any meaningful data. However, if such No Data values are not converted to Nulls in either the snapshot or the Reader, they will be considered as 'complete'.

Example

In this example, the Record Completeness Profiler assesses completeness across 4 attributes in a table of Customer records, producing the following summary results:

Record Completeness % Complete Attributes Matching Records

50.0

2 of 4

4

75.0

3 of 4

130

100.0

4 of 4

866

Drill down to find records with a specific level of completeness. For example, to see the record above with 75% record completeness drill down on that row of the summary grid.

CU_NO CU_ACCOUNT Title Gender PercentPopulated PopulatedAttributes

13815

00-23615-PB

M

75

3 of 4

13840

00-23631-JD

Miss

75

3 of 4

13913

00-23719-LR

M

75

3 of 4

13989

00-23817-LR

Ms

75

3 of 4

14130

00-23900-JD

Ms

75

3 of 4

14166

00-23945-LR

Mr

75

3 of 4