Data Preparation for O-Cluster

Use Automatic Data Preparation (ADP) for binning and handling missing values, ensuring optimal clustering performance.

ADP bins numerical attributes for O-Cluster. It uses a specialized form of equi-width binning that computes the number of bins per attribute automatically. Numerical columns with all nulls or a single value are removed. O-Cluster handles missing values naturally as missing at random.

Note:

O-Cluster does not support nested columns, sparse data, or unstructured text.

Related Topics