Data Preparation for O-Cluster
Use Automatic Data Preparation (ADP) for binning and handling missing values, ensuring optimal clustering performance.
ADP bins numerical attributes for O-Cluster. It uses a specialized form of equi-width binning that computes the number of bins per attribute automatically. Numerical columns with all nulls or a single value are removed. O-Cluster handles missing values naturally as missing at random.
Note:
O-Cluster does not support nested columns, sparse data, or unstructured text.
Related Topics