Group Shuffle

Purpose

The Group Shuffle masking format enables you to randomly reorder (shuffle) column data within discrete units, or groups, where there is a relationship among the members of each group.

Inputs

Grouping Columns (Optional): One or more reference columns that should be used to group the values in the column to be masked. The grouping columns and the column to be masked must belong to the same table.

Supported Data Types

Character
Numeric
Date

Characteristics

Supports Double-Byte Characters: Yes
Combinable: No
Deterministic: No
Reversible: No
Uniqueness: Yes, this masking format ensures uniqueness for columns that have unique constraints

Example

Suppose you have two groups of employees: managers (M) and workers (W). You want to shuffle all the salaries, but you do not want the salaries of the managers getting mixed into the salaries of the workers. You can use the Group Shuffle masking format to shuffle the SALARY column within each group, which is derived from the unique values in the JOB_CATEGORY column.

The following table illustrates a group shuffle on the SALARY column, where the JOB_CATEGORY column is the grouping column. The rows with JOB_CATEGORY = M belong to one group and the SALARY values belonging to this group are shuffled within the group. Similarly, the rows with JOB_CATEGORY = W belong to another group and the SALARY values belonging to this group are shuffled within the group.

EMPLOYEE	JOB_CATEGORY	SALARY	SHUFFLED_SALARY
Alice	M	90	88
Bill	M	88	90
Carol	W	72	70
Denise	W	57	45
Eddie	W	70	57
Frank	W	45	72