Create a Bin Column in a Data Flow

Use a bin to categorize your data by creating a new column based on the value of a measure. For example, you might categorize values for RISK into three bins for low, medium, and high.

Use the Bin step in the data flow editor.
  1. Click Add a step (+), and select Bin.
    You also create bins when you add columns using the Add Column step.
  2. Select the column whose values you want to categorize.
  3. Use the options on the Bin pane to configure your bin:
    Field Description
    Bin You'll see the column that you selected in Step 2. To categorize values in a different column, click the column name and select a different column.
    Method Specify how the data boundaries are calculated.
    • In the Manual method, the range is divided by the number of bins.
    • In the Equal Width method, the histogram range is divided into intervals of the same size. For equal width binning, the column values are measured, and the range is divided into equal-sized intervals. The edge bins can accommodate very low or very high values in the column.
    • In the Equal Height method, the height of each bin is same or very slightly different but the histogram range is equal. For equal height or frequency binning, the intervals of each bin is based on each interval containing approximately the equal number of elements (that is, records). Equal Height method is preferred specifically for the skewed data.
    Histogram View Based on the Method selected, the histogram range (width) and histogram count (height) of the bins are updated.
    List View If you select the Manual method, you can change the name of the bins, and you can define the range for each bin.

    Based on your changes, the data preview (for example, the bin column name) is updated.