Preface

This guide describes Oracle Big Data Appliance, which is used for acquiring, organizing, and analyzing very large data sets. It includes information about hardware operations, site planning and configuration, and physical, electrical, and environmental specifications.

This preface contains the following topics:

Audience

This guide is intended for Oracle Big Data Appliance customers and those responsible for data center site planning, installation, configuration, and maintenance of Oracle Big Data Appliance.

Documentation Accessibility

For information about Oracle's commitment to accessibility, visit the Oracle Accessibility Program website at http://www.oracle.com/pls/topic/lookup?ctx=acc&id=docacc.

Access to Oracle Support

Oracle customers that have purchased support have access to electronic support through My Oracle Support. For information, visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=info or visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs if you are hearing impaired.

Related Documentation

Oracle Big Data Appliance Documentation Library

In the Big Data portal on the Oracle Help Center, you can find a link to the complete Oracle Big Data Appliance library for your release of the product. The library includes the following documentation for Oracle Big Data Appliance as well as documentation for other Oracle big data products:

Note:

The Oracle Big Data Appliance Licensing Information User Manual is the consolidated reference for licensing information for Oracle and third-party software included in the Oracle Big Data Appliance product. Refer to this manual or contact Oracle Support if you have questions about licensing.

Help for Oracle Big Data Appliance in My Oracle Support

My Oracle Support (support.oracle.com) provides detailed “how to” instructions in the form of MOS notes (My Oracle Support notes) that are constantly updated. If you do not find information that you need in Oracle Big Data Appliance documentation library, check My Oracle Support. Use your Oracle account credentials to log in.

Your Oracle support representative may sometimes refer you to a MOS note for assistance with an issue.

In the search field, you can enter a specific MOS note number or any set of keywords.

Description of bigog_mos_search_field.png follows
Description of the illustration bigog_mos_search_field.png

See Also:

The Oracle Big Data Appliance Information Center

The Oracle Big Data Appliance Information Center (Doc ID 1445762.2) is an aggregation of MOS notes on many topics, such as

  • Site Checklists.

  • Service alerts.

  • The latest base image and how to use it for new installs and reprovisions.

  • Patches.

  • Certified CDH upgrades.

  • Tips on health checks as well as diagnostic and monitoring tools.

Documentation for Affiliated Hardware Products

The following Oracle libraries contain hardware information for Oracle Big Data Appliance.

Conventions

The following text conventions are used in this document:

Convention Meaning

boldface

Boldface type indicates graphical user interface elements associated with an action, or terms defined in text or the glossary.

italic

Italic type indicates book titles, emphasis, or placeholder variables for which you supply particular values.

monospace

Monospace type indicates commands within a paragraph, URLs, code in examples, text that appears on the screen, or text that you enter.

# prompt

The pound (#) prompt indicates a command that is run as the Linux root user.

Backus-Naur Form Syntax

The syntax in this reference is presented in a simple variation of Backus-Naur Form (BNF) that uses the following symbols and conventions:

Symbol or Convention Description

[ ]

Brackets enclose optional items.

{ }

Braces enclose a choice of items, only one of which is required.

|

A vertical bar separates alternatives within brackets or braces.

...

Ellipses indicate that the preceding syntactic element can be repeated.

delimiters

Delimiters other than brackets, braces, and vertical bars must be entered as shown.

boldface

Words appearing in boldface are keywords. They must be typed as shown. (Keywords are case-sensitive in some, but not all, operating systems.) Words that are not in boldface are placeholders for which you must substitute a name or value.

Changes in Oracle Big Data Appliance Release 5.2

Release 5.2 includes the following new features, software revisions, and other changes.

Cloudera 6.3.4 Support

Oracle Big Data Appliance 5.2 is based on Cloudera Enterprise 6.3.4. See the Cloudera web site for details about release 6.3.4.

Supported Upgrades

You can upgrade to Oracle Big Data Appliance Release 5.2 directly from Releases 4.12, 14.13, 4.14, and 5.1.

There are important prerequisites to fulfill before you can do the upgrade. See Upgrading the Software on Oracle Big Data Appliance in this guide for the prerequisites, the upgrade procedure, and some known issues.

Note:

Upgrade to Big Data Appliance 5.2 is supported for pre-existing Oracle Linux 6 clusters, but there is no option to create new Oracle Linux 6 clusters in the release.

Rolling Upgrades

Rolling upgrades (where nodes are upgraded one-after-the-other and downtime is avoided) are generally an option in Oracle Big Data Appliance releases. Rolling upgrades from Big Data Appliance 5.1 to 5.2 are supported. However, this option is not currently available for upgrades from Big Data Appliance 4.x releases to release 5.2. An upgrade from a 4.x release will require some cluster downtime.

Support for Migration from Oracle Linux 6 to Oracle Linux 7

Big Data Appliance 5.2 includes support for migrating cluster nodes from Oracle Linux 6 to Oracle Linux 7.

Oracle Linux 6 to Oracle Linux 7 migration is supported in releases 4.13 and greater. For any of these releases you can perform a migration from Oracle Linux 6 to Oracle Linux 7 either before or after the upgrade to Big Data Appliance 5.2. However, in the case of Big Data Appliance 4.13, if you choose to migrate to Oracle Linux 7 it is recommended that you perform the migration after the upgrade to Big Data Appliance 5.2. This is because some improvements to the process were introduced after release 4.13.

High Availability for HiveServer2 and Oozie Roles

For new Big Data Appliance 5.2 clusters only, HA HiverServer2 and HA Oozie are set up by default in the Mammoth installation. There is also an option to provide HA for the load balancer that handles traffic for these roles.

In clusters upgraded to 5.2 from earlier releases, High Availability cannot be provided for HiveServer2 and Oozie. This results in some differences in the role layouts between new and upgraded clusters.

Before You Create a New Cluster:

See For New Clusters: High Availability HiveServer2 and Oozie. This topic explains how High Availability works for these roles and shows the differences in role distribution between new and upgraded clusters.

Also see the Virtual IP Address (optional) subsection of Chapter 4: Using Oracle Big Data Appliance Configuration Generation Utility. This explains how to implement High Availability for the load balancer.

X8-2L Server Support

Big Data Appliance 5.2 racks are shipped with X8-2L servers, with the base image pre-installed. The OS level is Oracle Linux 7. Earlier server models are also supported.

You can integrate X8-2L servers into existing clusters consisting of X7-2L, X6-2L, and X5-2L servers. To do so, first ensure that the existing nodes of the cluster are running Big Data Appliance release 4.10 or higher.

X8 racks can be integrated into multirack configurations with existing X7, X6, or X5 racks.

Oracle Big Data SQL 4.1

Big Data SQL 4.1 is included as an optional package with the Big Data Appliance 5.2 release. Big Data SQL supports queries against non-relational data stored in multiple big data sources, including Apache Hive, HDFS, Oracle NoSQL Database, Apache Kafka, Apache HBase, and other NoSQL databases, as well as the Amazon S3 and Oracle Object Store.

A license is required in order to install Big Data SQL on Big Data Appliance. No additional license is required for the Oracle Database side of the installation. See the Oracle Big Data Appliance Licensing Information User Manual for details on licensing requirements.

Big Data SQL components must be installed on both Hadoop and Oracle Database. The Oracle Big Data SQL Installation Guide provides instructions for both parts of the installation, including generic instructions for installing the product on several supported Hadoop frameworks. However, on Oracle Big Data Appliance, most of the Hadoop-side installation is integrated with Mammoth, the Big Data Appliance installer. Installing Oracle Big Data SQL in this guide explains what is different about the Hadoop-side installation of Big Data SQL on Big Data Appliance.

See Also:

See the Big Data SQL 4.1 documentation.

Software Versions in This Release

  • Cloudera Enterprise 6.3.4, which includes major new component versions Apache Hive 2.1.1 with vectorization, Apache HBase 2.1.4 with performance and stability enhancements, Apache Oozie 5.1 with the ability to schedule recurring jobs, Apache Solr 7.4 with nested data types and JSON facet support. CDH 6 now includes Apache Kafka 2.2.1 and Apache Spark 2.4.0 as fully native components of the platform. Cloudera Enterprise 6.3.4 is the latest available release of Cloudera 6.

    The Cloudera parcels for Kudu, Kafka, and Key Trustee Server are included for your convenience, but are not deployed or configured by default.

    Note that in some earlier Oracle Big Data Appliance releases, the Mammoth installer deployed Apache Spark 2 automatically. In the upgrade to Oracle Big Data Appliance 5.2, this package is removed.

  • Oracle Big Data SQL 4.1. New features include Oracle SQL Access to Kafka enabling Oracle external tables to query data from Kafka topics. This release of Big Data SQL also supports:
    • Apache ORC file format
    • Complex data types with ORC, Parquet and Avro
    • Microsoft's Azure Blob Object Storage
    • Oracle Database 19c
  • Oracle Big Data Connectors 5.1

  • Oracle NoSQL Database 20.3.17.

  • MySQL 5.7.29

  • Oracle R Advanced Analytics for Hadoop (ORAAH) 2.8.1

  • Oracle's R Distribution (ORD) 3.3.0

  • Oracle Big Data Spatial & Graph 3.0.0

  • Java JDK 8u281

  • Oracle Linux 7 with UEK4 for new clusters. Oracle Linux 6 with UEK4 for Oracle Linux 6 cluster upgrades.

    Note:

    New Oracle Big Data Appliance racks are delivered with Oracle Linux 7.

    All servers in existing clusters (not in newly delivered racks) where you are installing Oracle Big Data Appliance 5.2 must first be updated to at least Oracle Big Data Appliance 4.10.0 before any X8-2L (or X7–2L) servers can be added as nodes in the cluster.

    For Big Data Appliance version 4.12.0 and higher, all new Cloudera Enterprise and Oracle NoSQL Database clusters will be deployed on Oracle Linux 7. Existing clusters on Oracle Linux 6 will remain on Oracle Linux 6 after upgrading.

    See Oracle Big Data Appliance Patch Set Master Note (Doc ID 1485745.1) in My Oracle Support for the base image download and instructions.

Software and Features Not Supported in This Release

Because of the transition to Cloudera Enterprise 6.x, the following software and functionality cannot currently be supported in Oracle Big Data Appliance Release 5.2.

  • Oracle Big Data Discovery

    If Oracle Big Discovery is installed, uninstall it prior to the upgrade to Big Data Appliance 5.2. This product is not compatible with Cloudera 6.

  • The CDH 5-Compatible ODI (Orace Data Integrator) Agent
    The Oracle Big Data Appliance 5.2 installation does not include a compatible version of the ODI Agent at this time. The agent carried in previous releases is not compatible with Cloudera 6. Therefore, clusters cannot be upgraded to release 5.2 if the agent is enabled. Before you upgrade, check to ensure that the existing ODI Agent is disabled:
    # bdacli getinfo cluster_odi_enabled

    To remove the existing ODI Agent, use bdacli to disable Oracle Big Data Connectors. Then re-enable Big Data Connectors without including the ODI Agent -- bdacli disable bdc and then bdacli enable bdc.

    The Oracle Big Data Appliance Configuration Utility will not allow you to select the ODI Agent for installation. If you import a pre-existing master.xml file into the Configuration Utility ensure that the ODI Agent installation is de-selected before generating new configuration files. Likewise, if you use the bdacli utility to enable Oracle Big Data Connectors, the bdacli enable bdc command will not enable the ODI Agent.

    Note:

    After the Mammoth installation, you can separately install the standalone ODI Agent 12.2.1.4 using the instructions in the ODI product documentation. This version of the agent is certified as compatible with Cloudera 6. Support is provided by the Fusion Middleware product team.

  • About New Kafka Clusters and Upgrade or Extension of Existing Kafka Clusters

    Creation of new Kafka clusters is not supported in Oracle Big Data Appliance 5.2.

    Pre-existing Kafka clusters remain supported after the Release 5.2 installation. These can be upgraded and extended using the same mammoth -p and mammoth -e command line options used to upgrade or extend CDH clusters. After an upgrade, the Kafka version is 2.2.1+cdh6.3.4

Recent Change History Prior to Oracle Big Data Appliance 5.2

The Recent Change History appendix to this guide describes changes in Releases 5.1 and 4.14. If you are upgrading from an older version of Oracle Big Data Appliance, then it may be informative to read this change summary.