Preface

This guide describes how to manage and use the installed Oracle Big Data Appliance software.

Note:

Oracle Big Data SQL is no longer documented within this guide. See the Oracle Big Data Appliance User's Guide for instructions on how to install and use Oracle Big Data SQL.

Audience

This guide is intended for users of Oracle Big Data Appliance including:

  • Application developers

  • Data analysts

  • Data scientists

  • Database administrators

  • System administrators

The Oracle Big Data Appliance Software User's Guide introduces Oracle Big Data Appliance installed software, features, concepts, and terminology. However, you must acquire the necessary information about administering Hadoop clusters and writing MapReduce programs from other sources.

Documentation Accessibility

For information about Oracle's commitment to accessibility, visit the Oracle Accessibility Program website at http://www.oracle.com/pls/topic/lookup?ctx=acc&id=docacc.

Access to Oracle Support

Oracle customers that have purchased support have access to electronic support through My Oracle Support. For information, visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=info or visit http://www.oracle.com/pls/topic/lookup?ctx=acc&id=trs if you are hearing impaired.

Conventions

The following text conventions are used in this document:

Convention Meaning

boldface

Boldface type indicates graphical user interface elements associated with an action, or terms defined in text or the glossary.

italic

Italic type indicates book titles, emphasis, or placeholder variables for which you supply particular values.

monospace

Monospace type indicates commands within a paragraph, URLs, code in examples, text that appears on the screen, or text that you enter.

# prompt

The pound (#) prompt indicates a command that is run as the Linux root user.

Backus-Naur Form Syntax

The syntax in this reference is presented in a simple variation of Backus-Naur Form (BNF) that uses the following symbols and conventions:

Symbol or Convention Description

[ ]

Brackets enclose optional items.

{ }

Braces enclose a choice of items, only one of which is required.

|

A vertical bar separates alternatives within brackets or braces.

...

Ellipses indicate that the preceding syntactic element can be repeated.

delimiters

Delimiters other than brackets, braces, and vertical bars must be entered as shown.

boldface

Words appearing in boldface are keywords. They must be typed as shown. (Keywords are case-sensitive in some, but not all, operating systems.) Words that are not in boldface are placeholders for which you must substitute a name or value.

Changes in Oracle Big Data Appliance Release 4 (4.8)

Release 4.8 includes following feature changes and software updates.

Software Updates

  • CDH (Cloudera's Distribution including Apache Hadoop) 5.10.1

  • CM (Cloudera Manager) 5.10.1

  • Oracle Big Data Connectors 4.8

  • Big Data SQL 3.1 (earlier releases are not supported on Oracle Big Data Appliance 4.8)

  • MySQL Enterprise Edition 5.7.17

  • Perfect Balance 2.10.0

  • Java JDK 8u121

  • Oracle Linux 6.8 UEK 4 (Oracle Linux Unbreakable Kernel, Release 4)

  • Oracle Data Integrator Agent 12.2.1.1 (for Oracle Big Data Connectors)

  • Oracle R Advanced Analytics for Hadoop (ORAAH) 2.7.0 

  • Oracle's R Distribution (ORD) 3.2.0

  • Oracle NoSQL Community Edition or Oracle NoSQL Database Enterprise Edition 4.3.10. Both optional.

    Note that support is no longer available for Oracle NoSQL Database Community Edition.

  • Kudu 1.2

  • Kafka 2.1

  • Spark 2.0

  • Cloudera Key Trustee Server 5.10.1

The Cloudera parcels for Spark, Kudu, Kafka, and Key Trustee Server are included for your convenience, but are not deployed or configured by default.

See the Cloudera Documentation for information about CDH and Cloudera Manager 5.10.1

Oracle Big Data Appliance Release 4.8 includes Oracle Big Data SQL 3.1 as a Mammoth installation option. You do not need to download this package from the Oracle Software Delivery Cloud.

New Features

In addition to software upgrades and extra Cloudera parcels, Oracle Big Data Appliance provides these new features.

  • Automated LAG Setup

    The Oracle Big Data Appliance Configuration Utility provides an option to assign groups of connectors to LAGs (Link Aggregation Groups) for cluster client network connections. LAGs can provide faster uplink times and reduce the points of potential failure in the network. See Configuring for LACP/LAG Connections in the Oracle Big Data Appliance Owner’s Guide for details.

  • MIT Kerberos Security Option for Oracle NoSQL Enterprise installations

    You can select MIT Kerberos security for an Oracle NoSQL cluster via the Oracle Big Data Appliance Configuration Utility.

  • Oracle Big Data SQL can Connect to the Exadata Database Machine Over Ethernet

    Ethernet is now supported for Oracle Big Data SQL networking between Oracle Exadata Database Machine and Oracle Big Data Appliance. This enables you to use Oracle Big Data SQL with these two Engineered Systems in environments where InfiniBand is not feasible, such as when the two systems are geographically distant from each other.

    The Oracle Big Data Appliance Configuration Utility now provides an option to choose between Ethernet and InfiniBand for Oracle Big Data SQL when you are setting up the configuration for a cluster.

    Ethernet connections for Oracle Big Data SQL are not supported for networking between Oracle Big Data Appliance and Oracle SPARC SuperCluster.

  • Oracle Big Data SQL Networking is Switchable Between Ethernet and InfiniBand

    If needed, you can switch between InfiniBand and Ethernet after the Mammoth installation and after the Oracle Big Data SQL installation has been completed. See Choosing Between Ethernet and InfiniBand Connections For Oracle Big Data SQL in this guide for details.

Oracle Big Data SQL 3.1 can be Enabled/Disabled Using bdacli

In Release 4.8, the Oracle Big Data Appliance Configuration Generation Utility provides an option to install Oracle Big Data SQL 3.1. If you choose this option, then the Hadoop side of Oracle Big Data SQL is enabled automatically.

If you do not choose to enable Oracle Big Data SQL in the Mammoth installation, you can enable it later using the bdacli command line interface:

# bdacli enable big_data_sql

You can also use bdacli to disable the Hadoop side Oracle Big Data SQL:

# bdacli disable big_data_sql

Note:

For Oracle Big Data Appliance Release 4.8, the instructions in the Oracle Big Data SQL 3.1 Installation Guide for downloading, installing, and uninstalling the software on the Hadoop cluster do not apply.
  • You do not need to download Oracle Big Data SQL 3.1 from the Oracle Software Delivery Cloud. The software is already included in the Mammoth bundle.

  • Do not install Oracle Big Data SQL 3.1 on Release 4.8 using ./setup-bds install bds-config.json.

    Mammoth performs this part of the installation for you. Likewise, do not use ./setup-bds uninstall bds-config.json to uninstall the software from the Hadoop cluster. To disable Oracle Big Data SQL after the Mammoth installation (or to enable it if you did not opt include it in the Mammoth installation), use the bdacli commands as described above.

Except for the install and uninstall commands, the instructions on using setup-bds in the Oracle Big Data SQL 3.1 Installation Guide apply to Oracle Big Data Appliance and should be used as documented. This is also true for the database side of Oracle Big Data SQL. Create the database installation bundle and install it on the nodes of the Oracle Database system as described in the guide.

In Release 4.8, Node Migration and the mammoth -e Command Require Extra Steps for Oracle Big Data SQL Support

  • If Oracle Big Data SQL is enabled, then you must disable it on the entire cluster prior to running bdacli admin_cluster migrate <node>. Re-enable it after the migration. Use the bdacli enable and disable commands.

    1. Disable Oracle Big Data SQL across the cluster: bdacli disable big_data_sql

    2. Perform the migration.

    3. Re-enable Oracle Big Data SQL: bdacli enable big_data_sql

  • The Mammoth command for extending an installation to additional servers, mammoth -e, does not make necessary Oracle Big Data SQL changes. Correct the problem by calling ./setup-bds extend bds-config.json from the node where Cloudera Manager is installed after mammoth -e completes.

TLS 1.0 (TLSv1) Disabled for Cloudera Configuration Manager and Hue – May Affect Client Access

In order to improve security, Oracle Big Data Appliance has disabled TLS 1.0 for Cloudera Manager and Hue and also in the system-wide Java configuration.

It is recommended that you reconfigure or upgrade clients using TLS 1.0 to use newer encryption, but if necessary you can re-enable TLS 1.0 for Cloudera Manager and Hue.

For details, log on to My Oracle Support and search for BDA 4.8 Disables TLSv1 by Default For Cloudera Manager/Hue/And in System-Wide Java Configurations (Doc ID 2250841.1).

Oracle Big Data Discovery Installation on Big Data Appliance 4.8 not Supported at This Time

A compatible build of Oracle Big Data Discovery is under development. This document will be updated when a compatible build is available.

Deprecated or Discontinued Features

  • The Perfect Balance automatic invocation feature is not supported in this release. If you having been using automatic invocation, please switch to the Perfect Balance API.

  • There is no longer an option to purchase support for Oracle NoSQL Database Community Edition.