1.3.7.8 Min/Max Profiler

The Max/Min Profiler examines the extremes of the data in each attribute, and returns:

  • The shortest value

  • the longest value

  • the 'lowest' value

  • the 'highest' value

Use the Max/Min Profiler to gain an initial understanding of your data. The Max/Min Profiler gives you a quick overview of whether or not your data conforms to its length and valid value restrictions, and allows you to find 'outliers'; that is, values that are clearly out of range, such as number amounts that are larger or smaller than expected, date values that are earlier or later than expected, or text values that consist only of invalid characters such as '#', or of data cheats such as 'aaa' or 'zzz'.

The following table describes the configuration options:

Configuration Description

Inputs

Specify any attributes in which you want to find Data Maxima and Minima.

Options

None.

Outputs

Describes any data attribute or flag attribute outputs.

Data Attributes

None.

Flags

The following flag is output:

  • [Attribute Name].CharLength: indicates the number of characters in the attribute. Possible values are the Number of characters.

The Max/Min Profiler requires a batch of records to produce useful statistics. It must therefore run to completion before its results are available, and is not suitable for a process that requires a real time response.

When executed against a batch of transactions from a real time data source, it will finish its processing when the commit point (transaction or time limit) configured on the Read Processor is reached.

The following table describes the statistics produced by the profiler for each attribute.

Statistic Description

Minimum Length

The number of characters of the shortest value in the attribute, in number of characters.

Maximum Length

The number of characters of the longest value in the attribute, in number of characters.

Minimum Value

The 'lowest' value in the attribute.

For Number attributes, this is the lowest numeric value.

For Date attributes, this is the earliest date.

For Text attributes, this is the first value alphabetically.

Note that Null values are ignored in this analysis, but other types of No Data (for example, values consisting only of spaces) are not.

Maximum Value

The 'highest' value in the attribute.

For Number attributes, this is the highest numeric value.

For Date attributes, this is the latest date.

For Text attributes, this is the last value alphabetically.

Note that Null values are ignored in this analysis, but other types of No Data (for example, values consisting only of spaces) are not.

Clicking on the Additional Information button shows the number and percentage of records with the minimum length, maximum length, minimum value and maximum value, alongside the above statistics.

Example

In this example, the Max/Min Profiler examines all attributes in a table of Customer records:

Table 1-122 Max/Min Profiler

Input Field Total number Minimum Length Maximum Length Minimum Value Maximum Value

CU_NO

2010

2

6

10

875825

CU_ACCOUNT

2010

7

12

00-0-XX

OO-24282-LR

TITLE

2010

1

12

1

The Reverend

NAME

2010

4

29

# ADAMS

aaaaaaaaa

GENDER

2010

1

1

1

M

BUSINESS

2010

2

41

Stoke Newington Town Hall

e-sites.co.uk

ADDRESS1

2010

1

50

(Brassfounders) LD, Coursington Road

kjhkg

ADDRESS2

2010

1

31

WARRINGTON

jhgfhj

ADDRESS3

2010

1

22

Aberdeen

jhvgj

POSTCODE

2010

1

8

1P1 3HS

gjhgj

AREA_CODE

2010

1

4

0

2920

TEL_NO

2010

1

7

1

4227051

EMAIL

2010

1

50

5

zoe.peckham@btopenworld.com

ACC_MGR

2010

2

3

22

WH

DT_PURCHASED

2010

5

10

01/01/1995

Brian

DT_ACC_OPEN

2010

5

10

01/01/1995

Brian

DT_LAST_PAYMENT

2010

19

19

01-Jan-1970 00:00:00

21-Mar-2004 00:00:00

DT_LAST_PO_RAISED

2010

19

19

01-Jan-1970 00:00:00

14-Feb-2004 00:00:00

BALANCE

2010

1

10

-999999

410.5