Group Shuffle
Purpose
The Group Shuffle masking format enables you to randomly reorder (shuffle) column data within discrete units, or groups, where there is a relationship among the members of each group.
Inputs
- Grouping Columns (Optional): One or more reference columns that should be used to group the values in the column to be masked. The grouping columns and the column to be masked must belong to the same table.
Supported Data Types
- Character
- Numeric
- Date
Characteristics
- Supports Double-Byte Characters: Yes
- Combinable: No
- Deterministic: No
- Reversible: No
- Uniqueness: Yes, this masking format ensures uniqueness for columns that have unique constraints
Example
Suppose you have two groups of employees: managers (M) and workers (W). You want to shuffle all the salaries, but you do not want the salaries of the managers getting mixed into the salaries of the workers. You can use the Group Shuffle masking format to shuffle the SALARY
column within each group, which is derived from the unique values in the JOB_CATEGORY
column.
The following table illustrates a group shuffle on the SALARY
column, where the JOB_CATEGORY
column is the grouping column. The rows with JOB_CATEGORY
= M belong to one group and the SALARY
values belonging to this group are shuffled within the group. Similarly, the rows with JOB_CATEGORY
= W belong to another group and the SALARY
values belonging to this group are shuffled within the group.
EMPLOYEE | JOB_CATEGORY | SALARY | SHUFFLED_SALARY |
---|---|---|---|
Alice | M | 90 | 88 |
Bill | M | 88 | 90 |
Carol | W | 72 | 70 |
Denise | W | 57 | 45 |
Eddie | W | 70 | 57 |
Frank | W | 45 | 72 |