Changes in This Release for Oracle R Enterprise

Releases of Oracle R Enterprise often contain new features. The features in the current release and in some previous releases are described in the following topics:

Changes in Oracle R Enterprise 1.5.1

Oracle R Enterprise 1.5.1 has some new features that are compatible with Oracle Database 12c, Release 12.1.0.2 and earlier, and other new features compatible with Oracle Database 12c, Release 12.2.

New Features for Oracle Database Release 12.1.0.2 and Earlier

Oracle R Enterprise 1.5.1 has the new OREdplyr package, improved performance of row ordering in ore.frame objects, and faster loading of the Oracle R Enterprise packages.

OREdplyr Package for Data Manipulation

The dplyr package provides a grammar of data manipulation functions for data.frame objects and numeric objects. The new OREdplyr package implements much of this functionality for ore.frame and ore.numeric objects. This enables in-database execution of dplyr functionality such as selecting, filtering, ordering, and grouping columns and rows, and joining, summarizing, sampling, and ranking rows.

New Features for Oracle Database Release 12.2.0.1

Oracle R Enterprise 1.5.1 has the new graph analytics package OAAgraph and has new functions in the Oracle R Enterprise Data Mining package OREdm.

OAAgraph Package

The OAAgraph package provides an R interface to the powerful Oracle Spatial and Graph Property Graph In-Memory Analyst (PGX) for use in combination with Oracle R Enterprise and database tables.

The package provides a single, unified interface supporting the complementary use of machine learning and graph analytics technologies.

Graph analytics use graph representations of data, in which data entities are nodes and relationships are edges. Machine learning produces models that identify patterns in data for both descriptive and predictive analytics. Together, these technologies complement and augment one another.

New Features of the OREdm Package

The OREdm package has some new functions that use in-database Oracle Data Mining algorithms to create models in the database and new arguments for some functions.

New Functions in the OREdm Package

New functions in the OREdm Oracle Data Mining package that use in-database algorithms are the following:

  • ore.odmEM, Expectation Maximization Models

  • ore.odmESA, Explicit Semantic Analysis Models

  • ore.odmRAlg, Extensible R Algorithm Models

  • ore.odmSVD, Singular Value Decomposition Models

The ore.odmRAlg enables users to use registered R scripts to create models that use the Oracle Data Mining in-database model framework.

Other new functions are the following:

  • partitions, which returns partition names from a partitioned model

  • settings, which returns the Oracle Data Mining parameter settings used to build the model.

New Arguments to Some Functions for Oracle Data Mining Model Build Configuration and Text Processing

The new arguments for some of the data mining model functions are:

  • odm.setting

  • ctx.setting

odm.setting

The odm.setting value is a list that specifies Oracle Data Mining parameter settings. Both Oracle Data Mining global and algorithm-specific parameters can be specified to configure the model build. Some new features are enabled through the parameter settings. For example, you can use this argument to specify the creation of a partitioned model, which is an ensemble model that consists of multiple sub-models. When you specify the parameter ODMS_PARTITION_COLUMNS and the names of the columns by which to partition the input data, the function returns a model with a sub-model for each partition. The partitions are based on the unique values found in the columns.

Partitioned models can automate scoring by allowing you to reference the top-level model only, which causes the proper sub-model to be chosen based on the values of the partitioned column or columns for each row of data to be scored.

ctx.setting

With this argument, you can specify Oracle Text attribute-specific settings. You specify the columns that should be treated as text and the type of text transformation to apply.

This argument applies to the following functions:

  • ore.odmESA, Explicit Semantic Analysis

  • ore.odmGLM, Generalized Linear Models

  • ore.odmKMeans, k-Means

  • ore.odmNMF, Non-Negative Matrix Factorization

  • ore.odmSVD, Singular Value Decomposition

  • ore.odmSVM, Support Vector Machines

Note:

To create an Oracle Text policy, the user must have the CTXSYS.CTX_DDL privilege.

Changes in Oracle R Enterprise 1.5

Oracle R Enterprise 1.5 introduces functions for managing Oracle R Enterprise datastores and scripts in the Oracle Database script repository. It also contains a new function in the OREmodel package, new transparency layer methods for some functions in the stats package, and other enhancements.

New Features in Oracle R Enterprise 1.5

Oracle R Enterprise 1.5 has new R functions and SQL procedures, new transparency layer methods, and enhancements to some functions.

Oracle R Enterprise 1.5 has new R functions and SQL procedures for managing Oracle R Enterprise datastores and the Oracle R Enterprise R script repository. Users can now share with other users access to datastores and registered R scripts. This release also has a new modeling function, ore.randomForest, new svd and prcomp statistical function methods that take ore.frame objects and can use parallel processing, and other enhancements.

The following topics briefly describe the new features.

R for Datastore and Script Repository Management

Oracle R Enterprise 1.5 provides new R functions for managing Oracle R Enterprise datastores and the R script repository.

The owner of a datastore or registered R script can now share with other users read privilege access to the datastore or script. This release also has new arguments to functions that create datastores and scripts and that give information about them. This datastore and script management functionality has both R and SQL interfaces.

The R functions for managing datastores are the following:

  • ore.delete, which deletes a datastore, is unchanged.

  • ore.grant is a new function that grants read privilege access to a datastore or script.

  • ore.load, which loads objects from a datastore into an R environment, has the new argument owner that specifies the datastore owner.

  • ore.revoke is a new function that revokes read privilege access to a datastore or script.

  • ore.save, which creates a datastore, has the new argument grantable that specifies whether read access can be granted to the datastore.

  • ore.datastore, which lists information about a datastore, has the new argument type that specifies the type of datastore. The values of type are the character strings user (the default), grant, granted, and all.

  • ore.datastoreSummary, which provides detailed information about datastores, has the new argument owner that specifies a datastore owner.

The R functions for managing scripts are the following:

  • ore.grant is a new function that grants read privilege access to a datastore or script.

  • ore.revoke is a new function that revokes read privilege access to a datastore or script.

  • ore.scriptCreate, which adds an R function to the script repository, has the new arguments global and overwrite. The argument global specifies whether the script is public or private. A public script is available to all users. If global = FALSE, then access to the script must be granted by the owner to other users. The argument overwrite specifies whether the content of a script can be replaced.

  • ore.scriptDrop, which deletes a script, has the new arguments global, which specifies whether the script to drop is public or not, and silent, which specifies whether to report an error if the script cannot be dropped.

  • ore.scriptList, which lists information about scripts, has the new argument type that specifies the type of script. The values of type are the character strings user (the default), global, grant, granted, and all.

  • ore.scriptLoad is a new function that loads a script into the R environment.

PL/SQL and Data Dictionary Views for Datastore and Script Repository Management

Oracle R Enterprise 1.5 provides new PL/SQL procedures for managing Oracle R Enterprise datastores and the R script repository.

The owner of a datastore or registered R script can now share with other users read privilege access to the datastore or script. This release also has new arguments to procedures that create datastores and scripts and that give information about them. This functionality has both R and SQL interfaces. Oracle Database data dictionary views provide information about datastores and scripts.

The SQL procedures for controlling access to Oracle R Enterprise datastores and registered R scripts are described in the following sections:

PL/SQL Procedures for Managing Datastores

The PL/SQL procedures for managing datastores are the following:

  • rqDropDataStore, which deletes a datastore, is unchanged.

  • rqGrant is a new procedure that grants read privilege access to a datastore or script.

  • rqRevoke is a new procedure that revokes read privilege access to a datastore or script.

PL/SQL Procedures for Managing Scripts

The PL/SQL procedures for managing scripts are the following:

  • rqGrant is a new procedure that grants read privilege access to a datastore or script.

  • rqRevoke is a new procedure that revokes read privilege access to a datastore or script.

  • rqScriptCreate has the new arguments global and overwrite. The argument global specifies whether the script is global or private. A global script is a public script that is available to all users. If global = FALSE, then access to the script must be granted by the owner to other users. The argument overwrite specifies whether the content of a script can be replaced.

  • rqScriptDrop has the new arguments global, which specifies whether the script to drop is global or not, and silent, which specifies whether to report an error if a script cannot be dropped.

Data Dictionary Views for Datastores

The Oracle Database dictionary views related to datastores are the following:

  • ALL_RQ_DATASTORES

  • RQUSER_DATASTORECONTENTS

  • USER_RQ_DATASTORES

  • USER_RQ_DATASTORE_PRIVS

Data Dictionary Views for Scripts

The Oracle Database dictionary views related to scripts are the following:

  • ALL_RQ_SCRIPTS

  • USER_RQ_SCRIPTS

  • USER_RQ_SCRIPT_PRIVS

ore.groupApply Function Changes

Function ore.groupApply now supports partitioning on multiple columns.

The INDEX argument can now take an ore.vector or ore.frame object that contains ore.factor objects or columns, each of which is the same length as argument X. Function ore.groupApply uses the INDEX object to partition the data in X before sending it to function FUN.

For an example of the use of the INDEX object to partition the data, see “Partitioning on Multiple Columns”.

ore.randomForest Modeling Function

The ore.randomforest function builds a random forest model on data in an ore.frame object.

It uses embedded R execution to grow random forest trees in parallel in R sessions on the database server. It returns an ore.randomforest object. In Oracle R Enterprise 1.5, function ore.randomForest supports classification but not regression.

The ore.randomforest function uses the same algorithm as that adopted by the CRAN R randomForest package but it has better runtime memory usage as well as ensemble tree size.

The scoring method predict on an ore.randomforest model also runs in parallel. Oracle recommends that you set the cache.model argument to TRUE when sufficient memory is available. Otherwise, you should set cache.model to FALSE to prevent memory overuse.

To use ore.randomforest, you must install either Oracle R Distribution (ORD) 3.2 or the CRAN R randomForest package. Oracle recommends that you use the function ore.randomforest in ORD 3.2, which offers better performance and scalability than the CRAN R randomForest. If you only install the R randomForest package, ore.randomForest issues a warning message at run time. The CRAN R randomForest package is one of the supporting packages in Oracle R Enterprise 1.5.

The global option ore.parallel determines the degree of parallelism to use in the Oracle R Enterprise server. The argument groups controls the granularity of the ore.randomForest model.

For an example of using ore.randomForest, see "Building a Random Forest Model."

ore.summary Function Changes

Function ore.summary has improved performance. It also has a different signature and the data types for some arguments have changed.

The function’s syntax is now the following:

ore.summary(data, var, stats = c("n", "mean", "min", "max"),
            class = NULL, types = NULL, ways = NULL, weight = NULL,
            order = NULL, maxid = NULL, minid = NULL, mu = 0,
            no.type = FALSE, no.freq = FALSE) 

The differences between the ore.summary in Oracle R Enterprise Release 1.5 and previous releases are the following:

  • The performance of ore.summary has improved; it now returns over an order of magnitude faster.

  • The arguments var, stats, and class now take a vector of character strings; previously, they took a concatenated string using a comma as the separator.

  • Argument types is now a list of character string vectors that specifies combinations of columns in the class argument.

  • Arguments maxid and minid are named vectors of character strings.

  • Arguments group.by and no.level are not supported.

  • Argument mu in previous releases was named mu0.

For examples of using ore.summary, see “Summarizing Data with ore.summary”.

Statistical Function Method Changes

Oracle R Enterprise 1.5 provides transparency layer methods for the stats package functions prcomp and svd.

The prcomp function performs principal components analysis and the svd function performs singular-value decomposition. Those functions now accept ore.frame objects and can use parallel execution in the database, which can improve scalability and performance.

Support for BLOB and CLOB Data Types

Some Oracle R Enterprise functions now support the Oracle Database data types BLOB and CLOB.

Functions ore.push and ore.pull now support the database data types BLOB and CLOB.

Embedded R execution R functions now support the database data types BLOB and CLOB for input and output objects.

For examples of using BLOB and CLOB data types, see "Example 6-11".

Changes in Oracle R Enterprise 1.4.1

The following topics describe the changes in Oracle R Enterprise 1.4.1:

New Features in Oracle R Enterprise 1.4.1

The following changes are in Oracle R Enterprise 1.4.1:

  • The ore.glm function now accepts offset terms in the model formula and the function can now be used to fit negative binomial and tweedie families of generalized linear models.

  • The ore.sync function has an additional optional argument, query, that creates an ore.frame object from an optimized SQL SELECT statement without creating a view in the database. You can use this argument to create a query even when you do not have the CREATE VIEW system privilege for the current schema.

  • The new global option for serialization, ore.envAsEmptyenv, specifies whether referenced environments in an object should be replaced with an empty environment during serialization to an Oracle Database. This option is used by the following functions:

    • ore.push, which for a list object accepts envAsEmptyenv as an optional argument

    • ore.save, which has envAsEmptyenv as a named argument

    • ore.doEval and the other embedded R execution functions, which accept ore.envAsEmptyenv as a control argument.

    The default values of the above arguments are regulated by the global option ore.envAsEmptyenv, but by using the argument you can override the global option value for a function.

    See Also:

Other Changes in Oracle R Enterprise 1.4.1

Other changes in this release are the following:

  • The arules and statmod packages from The Comprehensive R Archive Network (CRAN) are now included in the Oracle R Enterprise supporting packages.

Changes in Oracle R Enterprise 1.4

The following topics describe the changes in Oracle R Enterprise 1.4:

New Features in Oracle R Enterprise 1.4

The following changes are in Oracle R Enterprise 1.4:

  • Additions and improvements to data preparation functions:

    • The new factanal function performs factor analysis on a formula or an ore.frame object that contains numeric columns.

    • Both signatures of the princomp function support the scores, subset, and na.action arguments.

    • The new getXlevels function creates a list of factor levels that can be used in the xlev argument of a model.matrix call that involves an ore.frame object.

  • The new exploratory data analysis function ore.esm builds exponential smoothing models for time series data. The function builds a model using either the simple exponential smoothing method or the double exponential smoothing method. The function can preprocess the time series data with operations such as aggregation and the handling of missing values.

  • Additions and improvements to the Oracle R Enterprise regression and neural network modeling functions:

    • The new ore.glm function provides methods for fitting generalized linear models, which include logistic regression, probit regression, and poisson regression.

    • The ore.lm and ore.stepwise functions are no longer limited to a total of 1,000 columns when deriving columns in the model formula.

    • The ore.lm function now supports a weights argument for performing weighted least squares regression.

    • The anova function can now perform analysis of variance on an ore.lm object.

    • For the ore.stepwise function, the values for the direction argument have changed. The value "both" now prefers drops over adds. The new direction argument value "alternate" has the previous meaning of the "both" value.

    • The ore.neural function has several new arguments.

  • Additions and improvements to the Oracle Data Mining model algorithm functions:

    • The new ore.odmAssocRules function, which builds an Oracle Data Mining association model using the apriori algorithm.

    • The new ore.odmNMF function, which builds an Oracle Data Mining model for feature extraction using the Non-Negative Matrix Factorization (NMF) algorithm

    • The new ore.odmOC function, which builds an Oracle Data Mining model for clustering using the Orthogonal Partitioning Cluster (O-Cluster) algorithm.

  • An additional global option for Oracle R Enterprise, ore.parallel.

Changes in Oracle R Enterprise 1.3

The following topics describe the changes in Oracle R Enterprise 1.3:

New Features in Oracle R Enterprise 1.3

The new features in Oracle R Enterprise 1.3 are the following:

  • Predicting with R models using in-database data with the OREpredict package

  • Ordering and indexing with row.names<-

  • Predicting with Oracle Data Mining models using the OREodm package

  • Saving and managing R objects in the database

  • Date and time data types

  • Sampling and partitioning

  • Long names for columns

  • Automatically connecting to an Oracle Database instance in embedded R scripts

  • Building an R neural network using in-database data with the ore.neural function

Other Changes in Oracle R Enterprise 1.3

Other changes in this release are the following:

  • Installation and administration information has moved from this manual to Oracle R Enterprise Installation and Administration Guide. New features related to installation and administration are described in that book.

Changes in Oracle R Enterprise 1.1

The new features in Oracle R Enterprise 1.1 are the following:

  • Support for additional operating systems:

    • Oracle R Distribution and Oracle R Enterprise are now supported IBM AIX 5.3 and higher and on 10 and higher for both 64-bit SPARC and 64-bit x386 (Intel) processors.

    • The Oracle R Enterprise Server now runs on 64-bit and 32-bit Windows operating systems.

  • Improved mathematics libraries in R:

    • You can now use the improved Oracle R Distribution with support for dynamically picking up either the Intel Math Kernel Library (MKL) or the AMD Core Math Library (ACML) with Oracle R Enterprise.

    • On Solaris, Oracle R Distribution dynamically links with Oracle SUN performance library for high speed BLAS and LAPACK operations.

  • Support for Oracle Wallet enables R scripts to no longer need to have database authentication credentials in clear text. Oracle R Enterprise is integrated with Oracle Wallet for that purpose.

  • Improved installation scripts provide more prerequisite checks and detailed error messages. Error messages provide specific instructions on remedial actions.