The solution uses in-database OML4SQL, which is available by default with database version 19c and above.
Multi-Class Classification
The machine learning framework builds individual model, for each of the following industry accepted algorithms,
Decision Tree
Naive Bayes
Random Forest
Support Vector Machines
Neural Network
Generalized Linear Models
When we compare models, we select the one that best represents the underlying data.
Model Metrics used to compare models,
AUC(Area Under the Curve) – The model’s robustness is measured by this value, which is between 0 and 1. The closure the value is to 1, the better the model.
Accuracy – Measured as a function of predicted vis-à-vis actual in the testing dataset.
The important attributes, their individual weights and constant value used in building the model are stored in the database.
Regression
The machine learning framework builds individual model, for each of the following industry accepted algorithms,
Support Vector Machines
Neural Network
Generalized Linear Models
The models are compared, and the best model is selected which best represents the underlying data.
Model Metrics used to compare models,
RMSE(Root Mean Square Error)
The important attributes, their individual weights and constant value used in building the model are stored in the database.
Note:
Overall, there are one multi-classification model and three regression models for this solution