Skip Headers
Oracle® R Enterprise User's Guide
Release 11.2 for Linux and Microsoft Windows

Part Number E26499-04
Go to Documentation Home
Home
Go to Book List
Book List
Go to Table of Contents
Contents
Go to Index
Index
Go to Feedback page
Contact Us

Go to previous page
Previous
Go to next page
Next
PDF · Mobi · ePub

1 Overview of Oracle R Enterprise

R is an open source statistical programming language and environment. For information about R, see the R Project for Statistical Computing at http://www.r-project.org.

R provides an environment for statistical computing, including:

R's rapid adoption has earned it a reputation as a new statistical software standard.

Oracle R Enterprise is a component of the Oracle Advanced Analytics Option of Oracle Database Enterprise Edition. For detailed information about Oracle R Enterprise, including links to software downloads, go to Oracle R Enterprise at http://www.oracle.com/technetwork/database/options/advanced-analytics/r-enterprise/index.html.

Oracle R Enterprise allows users to perform statistical analysis on data stored in tables in an Oracle Database. Oracle R Enterprise has these components:

The components of Oracle R Enterprise are described in Chapter 3.

Oracle R Connector for Hadoop is a related product that is part of the Big Data Appliance.

Oracle R Enterprise also includes functions that perform most common or base statistical procedures; see Chapter 4 for more information.

The rest of this chapter describes Oracle R Enterprise Architecture, Oracle R Enterprise Data Types, and Oracle R Enterprise Supported Configurations.

Oracle R Enterprise Architecture

Oracle R Enterprise has these three components including the connector for Hadoop:

Description of oreug_vm_001.png follows
Description of the illustration oreug_vm_001.png

  1. The Client R Engine is a collection of R packages that allows you to connect to an Oracle Database and to interact with data in that database. The ORCH package allows you to connect to an Oracle Hadoop cluster and interact with data in HDFS files; the package also allows the execution of MapReduce jobs.

    You can use any R commands from the client. In addition, the client supplies these functions:

    • The R SQL Transparency framework intercepts R functions for scalable in-database execution

    • Functions intercept data transforms, statistical functions, and Oracle R Enterprise-specific functions

    • Interactive display of graphical results and flow control as in open source R

    • Submission of R closures (functions) for execution in the Oracle Database

  2. The Server is a collection of PL/SQL procedures and libraries that augment Oracle Database with the capabilities required to support an Oracle R Enterprise client. The R engine is also installed on Oracle Database to supported embedded R execution. Oracle Database spawns R engines, which can provide data parallelism.

    The Oracle R Enterprise Database engine provides this functionality:

    • Scale to large datasets

    • Access to tables, views, and external tables in the database, as well as those accessible through database links

    • Use SQL query parallel execution

    • Use in-database statistical and data mining functionality

  3. R Engines spawned by Oracle Database are spawned to support database-managed parallelism; provide lights-out scheduled execution of R scripts, that is, scheduling or triggering R scripts packaged inside a PL/SQL or SQL query. Oracle R Enterprise provides efficient transfer to and from the spawned engines. Embedded R execution can be used to emulate MapReduce style programming.

There are several data types specific to Oracle R Enterprise; see Oracle R Enterprise Data Types for details.

Oracle R Connector for Hadoop

The Oracle R Connector for Hadoop (ORCH) is an R package that allows an ORE client to interact with and execute MapReduce jobs on the Oracle Hadoop cluster.

For information about ORCH, see the Oracle Big Data Connectors User's Guide (http://docs.oracle.com/cd/E27101_01/doc.10/e27365/toc.htm ), part of the Oracle Big Data Documentation library (http://docs.oracle.com/cd/E27101_01/index.htm).

Oracle R Enterprise Data Types

Oracle R Enterprise introduces a variant to many R data types. The name of the Oracle R Enterprise data type is the name of the corresponding R data type prefixed by ore. These data types establish a mapping between an R object and a database table or view. The mapping tracks metadata of the Oracle object which in turn aids in SQL query generation. These data types form the foundation of the Oracle R Enterprise transparency layer.

The following R data types have been overloaded for transparent in-database execution:

  • Character, Integer, Numeric and Logical vectors

  • Factors

  • Data Frame

  • Matrix is overloaded in two situations:

    • Linear algebra cross-products

    • Creating input matrices for advanced analytics

For more information and examples, see Oracle R Enterprise Transparency Framework .

Oracle R Enterprise Supported Configurations

Oracle R Enterprise consists of a client and a server. The client runs on Microsoft Windows, Oracle Linux, or Red Hat Linux; the server runs on Microsoft Windows, Oracle Linux, or Red Hat Linux. The server includes an Oracle Database, to which the client connects. Oracle R Enterprise also runs on Oracle Exadata machines with the Linux operating system. For details, see Prerequisites.

Installation of Oracle R Enterprise is described in Chapter 2.