8.1 Managing Data Analysis Tool

The Data Analysis Tool enables you to determine how well a customer has populated the Production Data Model.

By reviewing the quality of data in each of the tables that the schema identifies, the Data Analysis Tool indicates how well the supplied data can support scenarios. The tool does not make assumptions about data quality. Rather, it provides a repeatable way to run a set of analytical queries across the data. You can then use the results to direct further analysis.

The following are the key features of the Data Analysis Tool:
  • Counts all table rows in the schema.
  • Identifies unique values and their distribution against the table.
  • Determines the number of null occurrences for a specified column.
  • Determines the number of padded spaces that occur for a specified column.
  • Checks referential integrity between tables.
The tool provides its results in either a text or Hypertext Markup Language (HTML) file. You can then use these results to direct an investigation for data quality.

Note:

To use the Data Analysis Tool effectively, you must have basic knowledge of Structured Query Language (SQL) and Extensible Markup Language (XML).
The following sections provide instructions for using the tool: