Add Database Analytics to a Data Flow

Database analytics enable you to detect anomalies, cluster data, sample data, and unpivot data. Database analytics are executed in the database, not in Oracle Analytics, therefore you must be connected to an Oracle database or Oracle Autonomous Data Warehouse.

Use the Database Analytics step in the data flow editor.
Before you start, create a connection to your Oracle database or Oracle Autonomous Data Warehouse and use it to create a data set.
  1. In the data flow editor, click Add a step (+), and select Database Analytics.
    If you aren't connected to an Oracle database or Oracle Autonomous Data Warehouse, you won't see the Database Analytics option.
  2. At the Select Database Analytics page, select a function type then click OK.
    Function Types Description
    Dymanic Anomaly Detection Detect anomalies in your input data without a pre-defined model. For example, you might want to highlight unusual financial transactions.

    When you deploy this function with large data sets, configure the partition columns to maximise performance.

    Dynamic Clustering Cluster your input data without a pre-defined model. For example, you might want to characterize and discover customer segments for marketing purposes.

    When you deploy this function with large data sets, configure the partition columns to maximise performance.

    Un-pivoting Data Transpose data that's stored in columns into row format. For example, you might want to transpose multiple columns showing a revenue metric value for each year to a single revenue column with multiple value rows for the year dimension. You simply select the metric columns to transpose and specify a name for the new column.You'll get a new dataset with fewer columns and more rows.
    Sampling Data Selects a random sample percentage of data from a table. You simply specify the percentage of data you want to sample. For example, you might want to randomly sample ten percent of your data.
  3. On the Analytics Operation <type> pane, configure the operation.
    • Use the Outputs area to specify the data columns to analyze.
    • Use the Parameters area to configure options for the operation.