8 Cross-Validate Models
Cross-validation is a model improvement technique that avoids the limitations of a single train-and-test experiment by building and testing multiple models through repeated sampling from the available data.
Predictive models are usually built on given data and verified on held-aside or unseen data. The purpose of cross-validation is to offer better insight into how well the model would generalize to new data and to avoid over-fitting and deriving wrong conclusions from misleading peculiarities of the seen data.
The ore.CV
utility R function uses Oracle Machine Learning for R for performing cross-validation of regression and classification models.
For a select set of algorithms and cases, the function ore.CV
performs cross-validation for models that were generated by OML4R regression and classification functions using in-database data.
The ore.CV
function works with models generated by the following OML4R functions:
-
ore.odmDT
-
ore.odmGLM
-
ore.odmNB
-
ore.odmSVM
You can also use ore.CV
to cross-validate models generated with some R regression functions through OML4R embedded R execution. Those R functions are the following:
-
lm
-
glm
-
svm
To download the function ore.CV
, see define the function and run the function.