Predictive Query nodes enable you to score data dynamically without a predefined model. Predictive Queries use in-database scoring technology.
Note:
Predictive Query Nodes require Oracle 12c Release 1 or later.
Scoring using Predictive Query nodes has the following limitations:
The transient models created during the running of Predictive Query node are not available for inspection or fine tuning.
If it is necessary to inspect the model, correlate scoring results with the model, specify special algorithm settings, or run multiple scoring queries that use the same model, then a predefined model must be created.
The output of a Predictive Query is the output of an Apply operation.
See Also:
There are several Predictive Query nodes:
An Anomaly Detection Query node analyses the input for anomalies. That is, it detects unusual cases in data.
Note:
Predictive Query Nodes require Oracle 12c Release 1 or later.
Anomaly Detection Query can run in parallel.
Related Topics
You create an Anomaly Detection Query node to build an Anomaly Detection model to analyze and detect anomalous occurrences such as fraud.
In the an Anomaly Detection Query Node dialog box, you can specify or change the characteristics of the models to build.
To edit an Anomaly Detection Query node:
See Also:
Oracle Data Miner automatically selects output for query. You can select and edit the parameters of the output functions.
The default output is listed in the Anomaly Prediction Outputs section in the Anomaly Predictions tab. The defaults are:
Prediction
Prediction Details
Prediction Probability
You can select Prediction Set and edit parameters of the output functions. You can perform the following tasks:
Delete: To delete an output, select the output and click .
Add: To add an output, click . Use the Add Anomaly Function dialog box to select an output.
Edit: To edit an output, either double-click the function or select the function and click . Use Edit Anomaly Function dialog box to make changes.
The output of a Predictive Query is the output of an Apply (Scoring) operation.
See Also:
You can add an anomaly function in the Anomaly Function dialog box.
To add anomaly function:
You can edit the anomaly function in the Edit Anomaly Function dialog box.
To edit an anomaly function:
In the Properties pane, you can examine and change the characteristics or properties of a node.
To view the properties of a node, click the node and click Properties. If the Properties pane is closed, then go to View and click Properties. Alternately, right-click the node and click Go to Properties.
The context menu options depend on the type of the node. It provides the shortcut to perform various tasks and view information related to the node.
To view the Anomaly Detection Query node context menu, right-click the node. The following options are available in the context menu:
Edit. Opens Edit an Anomaly Detection Query dialog box.
View Data. It performs two functions:
Runs the node on a small sample of the data.
Opens the View Data for a Predictive Query dialog box.
Performance Settings. This opens the Edit Selected Node Settings dialog box, where you can set Parallel Settings and In-Memory settings for the node.
Show Runtime Errors. Displayed only if there is an error.
Show Validation Errors. Displayed if there are validation errors.
A Clustering Query node returns the clusters in the input.
Note:
Predictive Query nodes require Oracle 12c Release 1 or later.
A Clustering Query can run in parallel.
Related Topics
You create a Clustering Node to build clustering models.
To create a Clustering Query in an existing workflow:
In the Edit Clustering Query Node dialog box, you can specify or change the characteristics of the models to build.
To edit a Clustering Query node:
See Also:
The output of a Predictive Query is the output of an Apply (scoring) operation.
Oracle Data Miner automatically selects output for query. The default outputs are listed in the Cluster Prediction Outputs section in the Cluster Predictions tab. The defaults are:
Cluster Details
Cluster Distance
Cluster ID
Cluster Probability
You can also select Cluster Set, and edit parameters of the output functions. You can perform the following tasks:
Delete: To delete an output, select the output and click .
Add: To add an output, click . Use Add Cluster Function dialog box to select an output.
Edit: To edit an output, either double-click the function or select the function and click . Use the Edit Cluster Function dialog box to make changes.
See Also:
In the Add Cluster Function dialog box, you can add cluster functions. To add Cluster function:
In the Properties pane, you can examine and change the characteristics or properties of a node.
To view the properties of a node, click the node and click Properties. If the Properties pane is closed, then go to View and click Properties. Alternately, right-click the node and click Go to Properties.
The Clustering Query Properties pane has these sections:
The context menu options depend on the type of the node. It provides the shortcut to perform various tasks and view information related to the node.
To view the Clustering Query node context menu, right-click the node. The following options are available in the context menu:
Edit. Opens the Edit a Clustering Query dialog box.
View Data. It perform two functions:
Runs the node on a small sample of data.
Opens the View Data for a Predictive Query dialog box.
Performance Settings: This opens the Edit Selected Node Settings dialog box, where you can set Parallel Settings and In-Memory settings for the node.
Show Runtime Errors. Displayed only if there is an error.
Show Validation Errors. Displayed if there are validation errors.
A Feature Extraction Query extracts features from the input.
Note:
Predictive Query nodes require Oracle 12c Release 1 or later.
A Feature Extraction Query node can run in parallel.
You create a Feature Extraction Query note to extract features from the data source or input.
To create a Feature Extraction Query to an existing workflow:
In the Feature Extraction Query Node dialog box, you can specify or change the characteristics of the models to build.
To edit a Feature Extraction Query Node:
See Also:
The output of a Predictive Query is the output of an Apply (scoring) operation. Oracle Data Miner automatically selects output for query. The default output is Feature Set. You can also select feature ID, feature details, and feature value. You can edit the parameters of the functions and perform the following tasks:
Delete: To delete an output, select the output and click .
Add: To add an output, click . Use the Add Feature Function dialog box to select an output.
Edit: To edit an output, either double-click the function or select the function and click . Use the Edit Feature Function dialog box to make changes.
See Also:
In the Properties pane, you can examine and change the characteristics or properties of a node.
To view the properties of a node, click the node and click Properties. If the Properties pane is closed, then go to View and click Properties. Alternately, right-click the node and click Go to Properties.The Feature Extraction Query properties enables you to view and change information about this Predictive Query node. If the Properties pane is closed, then go to View and Properties. Alternately, right-click the node and click Go to Properties.
The Feature Extraction Query Properties has these sections:
The context menu options depend on the type of the node. It provides the shortcut to perform various tasks and view information related to the node.
To view the Feature Extraction Query node context menu, right-click the node. The following options are available in the context menu:
Edit. Opens Edit Feature Extraction Query dialog box.
View Data. It perform two functions:
Runs the node on a small sample of the data.
Opens the View Data for a Predictive Query dialog box.
Performance Settings. This opens the Edit Selected Node Settings dialog box, where you can set Parallel Settings and In-Memory settings for the node.
Show Runtime Errors. Displayed only if there is an error.
Show Validation Errors. Displayed if there are validation errors.
A Prediction Query node performs classification and regression using the input.
The data type of the target determines whether classification or regression is performed. A Prediction Query can run in parallel.
Note:
Predictive Query nodes require Oracle 12c Release 1 or later.
This section on Prediction Query node contains the following topics:
See Also:
You create a Prediction Query node to perform the data mining functions Classification or Regression on the input data, depending on the type of input data.
To create a Prediction Query in an existing workflow:
In the Edit Prediction Query Node dialog box, you can specify or change the characteristics of the models to build.
To edit a Prediction Query node:
You must add at least one target. Targets can have different mining types.
To add a target:
Partitioning columns result in building a virtual model for each unique partition. Because the virtual model uses data only from a specific partition, it can potentially predict cases more accurately than if you did not select a partition.
In addition to selecting attributes, you can specify partitioning expressions. Partitioning expressions are concatenated and the result expression is the same for all predictive functions.
Optionally, you can add partitioning expressions.
Use Expression Builder to create an expression.
To specify a partitioning expression, click .
Suppose one of the partitions is AGE. Here is a sample partitioning expression:
CASE WHEN AGE < 20 THEN 1 WHEN AGE >=20 AND AGE < 40 THEN 2 WHEN AGE >=40 AND AGE < 60 THEN 3 ELSE 4 END
Suppose this expression is named Expression_1
. After you run the node, the output includes a column titled Expression_1
. This column will contain the value 1
if AGE
is less than 20
, 2
if AGE
is 20
or larger but less than 40
, and so forth.
See Also:
The output of a Predictive Query is the output of an Apply (Scoring) operation. Oracle Data Miner automatically selects output for the target. The default output is listed in the Prediction Outputs section in the Predictions tab.
For Classification, the default output are:
Prediction
Prediction Details
Prediction Probability
You can also select Prediction Set.
For Regression, the default outputs are:
Prediction
Prediction Details
For Classification or Regression, you can edit the parameters of functions and the output for each target one at a time. You can perform the following tasks:
To edit Prediction Output Function, select a target and click . You edit output for each target one at a time. Use the Edit Prediction Function dialog box to make changes.
To delete an output, select the output and click .
To add an output, select the target in the Targets section and click . Use the Add Prediction Output Function dialog box to select an output.
See Also:
To add Prediction Output Function:
The Input tab shows all columns that are used as input for the Predictive Query.
Target (for Prediction Query) and Case ID columns are identified with special icons. The Rule column in the grid explains why an attribute is not used.
By default, Determine inputs automatically (using heuristics) is selected. After you run the node, you can click the link to View Heuristic Results Details. To change the inputs, deselect Determine inputs automatically (using heuristics). You can perform the following tasks:
Override defaults, and add or remove input columns.
Change Mining Types: To change the mining type of a column, click the Mining Type entry for the column, and select a new mining type from the drop-down list.
Ignore columns: If you do not want to use a column as input, click the Input entry for the attribute and select from the drop-down list. It ignored the selected column, and not used for input. To use a column, select
from the drop-down list.
Search column: To search for columns, use the Find field.
The View Heuristic Results Details dialog box provides detailed information about automatic changes made to the input.
For example, the mining type is changed to Categorical
when the number of unique values is less than the threshold value of 5.
A column that has a constant value is excluded (not used as input).
The Output tab shows the columns that will be used in the output to identify the prediction data.
By default, all target columns for Prediction Query, the Case ID column, and partitioning columns are automatically added to Additional Output. You can perform the following tasks:
Add Additional Output: To add additional output, click Automatic to turn off automatic selection. Then click . Use the Add Supplemental dialog to add columns to the output.
Remove Output Columns: To remove columns, select the column and click .
You must run a Predictive Query node to view the data.
To run Predictive Query nodes, right-click the node and select either Run or View Data. Virtual models may take a while to be formulated. The View Data option generates a small sample output of the query.
Regardless of how you run it, select View Data to view the results of the query.
See Also:
For Predictive Query Nodes, the View Data viewer displays the output from a node is run. It also displays the results when the query is applied to a small subset of the data.
You can view prediction details to see Prediction Details in a separate window.
See Also:
"Data Source Node Viewer"In the Properties pane, you can examine and change the characteristics or properties of a node.
To view the properties of a node, click the node and click Properties. If the Properties pane is closed, then go to View and click Properties. Alternately, right-click the node and click Go to Properties.
The Prediction Query Node Properties has these sections:
The context menu options depend on the type of the node. It provides the shortcut to perform various tasks and view information related to the node.
To view the Prediction Query node context menu, right-click the node. The following options are available in the context menu:
Edit. Opens Edit a Prediction Query dialog box.
View Data. It perform two functions:
Runs the node on a small sample of the data.
Opens the View Data for a Predictive Query dialog box.
Performance Settings. This opens the dialog box, where you can set Parallel Settings and In-Memory settings for the node.
Show Runtime Errors. Displayed only if there is an error.
Show Validation Errors. Displayed if there are validation errors.