10 Installing the Oracle Big Data Appliance Software

This chapter explains how to install, reinstall, and reconfigure the software on Oracle Big Data Appliance. It contains these sections:

Note:

If you did not enter the required passwords in the Oracle Big Data Appliance Configuration Generation Utility, then you are prompted to enter them during the software installation. Ensure that you know the current passwords for the operating system root and oracle users, the Cloudera Manager admin user, and the MySQL administrator. If you are installing or reinstalling Oracle Big Data Connectors, then you also need the MySQL password for Oracle Data Integrator.

10.1 About the Mammoth Utility

Mammoth is a command-line utility for installing and configuring the Oracle Big Data Appliance software. Using Mammoth, you can:

Set up a cluster for either CDH or Oracle NoSQL Database.
Create a cluster on one or more racks.
Create multiple clusters on an Oracle Big Data Appliance rack.
Extend a cluster to new servers on the same rack or a new rack.
Update a cluster with new software.

10.1.1 About Rolling Upgrades

A rolling upgrade reboots the nodes of a cluster in a sequence that always leaves one node out of three replicas active. This allows the cluster to stay on-line throughout the entire series of reboots. As a result, some (though not all) services of the cluster can remain accessible without interruption during the upgrade.

Rolling upgrades are only available for installations running CDH 4.1 or later.

Because sequential reboots of the nodes in pairs incrementally add time to the process, rolling upgrades take longer than a conventional Mammoth upgrade, in which all servers reboots are concurrent. Estimate an average of 8-10 minutes per pair of non-critical nodes, in addition to 4 minutes each for the Mammoth node and critical nodes (which are rebooted one at a time).

A rolling upgrade can be unattended. The reboot event will “roll” from one server to the next (or next pair) automatically if each server's post-upgrade reboot is successful. If the update detects error or warning conditions during a server reboot, then the process pauses and prompts you to continue.

Reboot of node mnode22bda08 had errors/warnings, do you wish to proceed anyway? [y/n]:

If you enter n:

The upgrade stops so that you can fix the problems.

When you are ready, resume the installation using mammoth -p.
if you enter y:

The upgrade continues.

This prompt is displayed once per node upgrade. If the node has already been rebooted (and the prompt has already been answered) then the upgrade will continue without displaying the prompt again.

When Rolling Upgrades are Possible

A rolling upgrade is an option for the following upgrade tasks described in this document:

For Adding Servers to a Cluster, there is normally no cluster downtime for the process. The only exception is that the first time a single-rack cluster is extended onto a second rack, there will be downtime for the cluster extension (as master roles are moved to the additional rack), but subsequent cluster extensions will not require any downtime.

In cases where rolling upgrade is an option, Mammoth displays the following message on the console and then prompts you to chose whether or not to proceed with the installation as a rolling upgrade:

You can choose a rolling upgrade or continue with a conventional upgrade. Rolling upgrades need a minimum replication factor of three (this is the default in Mammoth) to have full availability during the upgrade. Do you wish to perform a Rolling Upgrade? [y/n]:

Note:

If you enter y at the prompt to accept a rolling upgrade , the cluster is committed to the process and the upgrade of all nodes in the cluster must continue to completion uninterrupted.

Service Availability During a Rolling Upgrade

In a rolling upgrade, services that are configured to use HDFS High Availability (HDFS HA) can remain continuously available while each node in the cluster is upgraded and rebooted. These include HDFS, YARN, and Zookeeper. Keeping these services available during the upgrade requires a Hadoop replication factor of three or greater. This is because the rolling upgrade restarts nodes in pairs. The upgrade cannot predetermine if the replication factor meets this minimum, since global configuration may have been overwritten for individual files. An insufficient replication factor does not stop the upgrade, but in that case HDFS HA-enabled services may be subject to downtime.

In a rolling upgrade, Cloudera services are restarted in batches of two. This is what sustains access to HDFS and other services during an upgrade. Services that are not or cannot be configured to use HDFS HA may be unavailable during a rolling upgrade, regardless of the replication factor.

10.2 Installation Prerequisites

The Oracle Audit Vault, Auto Service Request, and Oracle Enterprise Manager options require software on a remote server on the same network as Oracle Big Data Appliance. Those installations must be complete before you run Mammoth, or it will fail.

Similarly, you must complete several steps for Kerberos if the key distribution center (KDC) is installed on a remote server. There are no preliminary steps if you install the KDC on Oracle Big Data Appliance.

The following list describes the prerequisites for all installation options.

Audit Vault Requirements:

Oracle Audit Vault and Database Firewall Server Release 12.1.1 must be up and running on a separate server on the same network as Oracle Big Data Appliance. No other version of Audit Value and Database Firewall is supported by Oracle Big Data Appliance.

Auto Service Request Requirements:

Your My Oracle Support account is set up.
ASR Manager is up and running.

Enterprise Manager Requirements:

Oracle Management System (OMS) version 12.1.0.4.0 or higher is up and running.
The OMS agent pull URL is working.
The OMS emcli download URL is working.
Both the HTTPS upload port and the console port are open.

Note:

Double-check the OMS credentials and ensure that you enter them correctly when running Mammoth. Invalid credentials are the primary cause of failure in the Enterprise Manager discovery process.

MIT Kerberos Requirements for a Remote KDC:

Add cloudera-scm/admin as a user to the KDC database by running the following command from kadmin:
```
addprinc -randkey cloudera-scm/admin@<REALM NAME>
```
Grant cloudera-scm/admin all permissions to the Kerberos database. It must be able to add, modify, remove, and list principals from the database.
Create the cmf.keytab file by running the following command from kadmin:
```
xst -k cmf.keytab cloudera-scm/admin@<REALM NAME> 
```
Move cmf.keytab to/opt/oracle/BDAMammoth.
To support Oozie and Hue, ensure that the remote KDC supports renewable tickets. If it does not, then follow these steps:
1. Open kdc.conf and set values for max_life and max_renewable_life. The max_life parameter defines the time period when the ticket is valid, and the max_renewable_life parameter defines the time period when users can renew a ticket.
2. Set maxrenewlife for the krbtgt principal. Use the following kadmin command, replacing duration with the time period and REALM NAME with the name of the realm:
```
modprinc -maxrenewlife duration krbtgt/REALM NAME
```
If the KDC does not support renewable tickets when Kerberos is configured, then Oozie and Hue might not work correctly.

Active Directory Kerberos Requirements for a Remote KDC

Active Directory Kerberos requirements are documented in MOS (My Oracle Support).

See MOS documents 2013585.1 and 2029378.1.

10.3 Downloading the Mammoth Software Deployment Bundle

The Mammoth bundle contains the installation files and the base image. Before you install the software, you must use Oracle Big Data Appliance Configuration Generation Utility to generate the configuration files, as described in "Generating the Configuration Files."

You use the same bundle for all procedures described in this chapter, regardless of rack size, and whether you are creating CDH or Oracle NoSQL Database clusters, or upgrading existing clusters.

To download the Mammoth bundle:

Locate the download site in either My Oracle Support or Automated Release Updates (ARU):

My Oracle Support
1. Log on to My Oracle Support and search for Oracle Big Data Appliance Patch Set Master Note (Doc ID 1485745.1).
2. Click the following link: Mammoth Bundle software version 4.6.0 for new installs (Oracle Linux 6), and upgrades (Oracle Linux 5).
3. Find the link to the appropriate Mammoth patch bundle in the table.
ARU
1. Connect to ARU.
2. On the Patches page, set Product to Big Data Appliance Integrated Software and Release to the appropriate release number.
3. Click Search.
Download the BDAMammoth ZIP files to any directory (such as /tmp) in the first node of the cluster. Depending on the configuration of the rack, this node can be the first, seventh, or thirteenth server from the bottom of the rack. For multirack clusters, this server is in the primary rack.

The patch consists of three files:
- p<patch_number>_Linux-x86-64_1of3.zip contains the Mammoth installation files.
- p<patch_number>_Linux-x86-64_2of3.zip contains the base image.
- p<patch_number>_Linux-x86-64_3of3.zip contains additional packages .
Log in to the first node of the cluster as root.

Extract all files from the all of the downloaded zip files.

$ unzip p<patch_number>_Linux-x86-64_1of3.zip
        Archive:  p<patch_number>_<version>_Linux-x86-64_1of3.zip
  inflating: README.txt
   creating: BDAMammoth-ol6-<version>/
 ...

Change to the BDAMammoth-<version> directory:
```
# cd BDAMammoth-ol6-<version>
```

Extract all files from BDAMammoth-<version>.run:

# ./BDAMammoth-ol6-<version>.run
 
Big Data Appliance Mammoth v<version> Self-extraction
 
Checking for and testing BDA Base Image in /tmp
 
BDABaseImage-<version>_RELEASE.iso: OK
 
Removing existing temporary files
 
Generating /tmp/BDAMammoth.tar
Verifying MD5 sum of /tmp/BDAMammoth.tar
/tmp/BDAMammoth.tar MD5 checksum matches
 
Extracting /tmp/BDAMammoth.tar to /opt/oracle/BDAMammoth
 
Extracting Base Image RPMs to bdarepo
Moving BDABaseImage into /opt/oracle/

Extracting BDA Extras
 
Removing temporary files
     .
     .
     .
Please "cd /opt/oracle/BDAMammoth" before running "./mammoth -i <rack_name>"
#

The new version of the Mammoth software is installed in /opt/oracle/BDAMammoth, and the previous version (if you are upgrading) is saved in /opt/oracle/BDAMammoth/previous-BDAMammoth.

Follow the specific instructions for the type of installation you are performing.

10.4 Installing the Software on a New Rack

Mammoth installs and configures the software on Oracle Big Data Appliance using the files generated by Oracle Big Data Appliance Configuration Generation Utility. A cluster can be dedicated to either CDH (Hadoop) or Oracle NoSQL Database.

For a CDH cluster, Mammoth installs and configures Cloudera's Distribution including Apache Hadoop. This includes all the Hadoop software and Cloudera Manager, which is the tool for administering your Hadoop cluster. If you have a license, Mammoth optionally installs and configures all components of Oracle Big Data Connectors.

For a NoSQL cluster, Mammoth installs Oracle NoSQL Database. CDH and Oracle NoSQL Database do not share a cluster, beginning with Oracle Big Data Appliance 2.2.

In addition to installing the software across all servers in the rack, Mammoth creates the required user accounts, starts the correct services, and sets the appropriate configuration parameters. When it is done, you have a fully functional, highly tuned, up and running Hadoop cluster.

10.4.1 Installing the Software

Follow this procedure to install and configure the software on one or more Oracle Big Data Appliance racks. You can configure one cluster on multiple racks in a single installation.

To install the software:

Verify that the Oracle Big Data Appliance rack is configured according to the custom network settings described in /opt/oracle/bda/network.json. If the rack is still configured to the factory default IP addresses, first perform the network configuration steps described in "Configuring the Network."
Verify that the software is not installed on the rack already. If the software is installed and you want to upgrade it, then use the mammoth -p option in Step 6.
Download and unzip the Mammoth bundle, as described in "Downloading the Mammoth Software Deployment Bundle." You must be logged in as root to the first server in the cluster.
Change to the BDAMammoth directory.
```
# cd /opt/oracle/BDAMammoth
```
Copy cluster_name-config.json to the current directory. See "About the Configuration Files."
Run the mammoth command with the appropriate options. See "Mammoth Software Installation and Configuration Utility." This sample command runs all steps:
```
./mammoth -i rack_name
```
After Mammoth completes Step 3 of the installation, it prompts you to restart, if it upgraded the base image.
If you installed support for Auto Service Request, then complete the steps in "Verifying the Auto Service Request Installation."
To configure another CDH cluster on the server:
1. Copy the BDAMammoth ZIP file to any directory on the first server of the cluster, which is either server 7 or server 13.
2. Repeat Steps 3 to 7. Each cluster has its own cluster_name-config.json file. Oracle Big Data Appliance Configuration Generation Utility creates the files in separate directories named for the clusters.

Note:

Mammoth stores the current configuration in the /opt/oracle/bda/install/state directory. Do not delete the files in this directory. Mammoth fails without this information if you must use it again, such as adding a rack to the cluster.

Verifying the Auto Service Request Installation

Log in to My Oracle Support at http://support.oracle.com.
Search for document ID 2103715.1, Engineered Systems ASR Configuration Check tool (asrexacheck version 4.x), and download the asrexacheck script.

Although this check was originally intended for Oracle Exadata Database Machine, it is now supported on Oracle Big Data Appliance.
Copy asrexacheck to a server in the Oracle Big Data Appliance cluster.
Log in to the server as root.
Copy asrexacheck to all servers in the cluster:
```
# dcli -C -f asrexacheck -d /opt/oracle.SupportTools
```
See Executing Commands Across a Cluster Using the dcli Utility for information about dcli.

Change the file access permissions:

# dcli -C chmod 755 /opt/oracle.SupportTools/asrexacheck

Run the script:

# dcli -C /opt/oracle.SupportTools/asrexacheck

File an Oracle Support Request (SR) to validate the ASR installation. Note the following choices:
1. Under "What is the Problem?" click the Hardware tab.
2. For Products Grouped By, select Hardware License.
3. For Problem Type, select My - Auto Service Request (ASR) Installation and Configuration Issues.
4. Include the output of asrexacheck.
Continue with Step 8 of the software installation procedure.

10.5 What If an Error Occurs During the Installation?

If the Mammoth utility fails, take these steps to resolve the problem:

Read the error message to see if it suggests a cause or a resolution.
Make the recommended changes and rerun the step.
If the error message does not recommend a resolution, or it does not resolve the problem:
- File a service request (SR) with My Oracle Support.
- Upload the diagnostic zip file, which Mammoth generates when an error occurs.

10.6 Adding Servers to a Cluster

To add servers to an existing cluster, use the Mammoth Utility to extend the cluster onto the new servers. Do not use the Oracle Big Data Appliance Configuration Generation Utility for this purpose.

For each rack in a cluster, the minimum number of servers that the rack can contribute to the cluster is three. Therefore, initially you must add a group of at least three. After the minimum number is fulfilled, there are no further constraints. You can then add more servers from the rack to the cluster individually or in groups of any number. You can also extend the cluster to all of the servers on the rack at the same time.

If you are adding servers to same rack (and to the same cluster) then there is no cluster downtime for the process. The first time a single-rack cluster is extended onto a second rack, there will be downtime for the cluster extension (as master roles are moved to the additional rack) but subsequent cluster extensions will not require any downtime. (See About Rolling Upgrades.)

There is no significant difference between how new server nodes and old nodes are configured. However, when a new node has a larger disk capacity, the data partitions created are correspondingly larger in order to use the entire disk. HDFS handles the fact that some data nodes have more space available than other nodes. Oracle Big Data Appliance performs no additional special configuration.

To install the software on additional servers in a cluster:

Ensure that all servers are running the same software version. The additional servers must not have an Oracle Big Data Appliance base image that is newer than the existing cluster. See "About Software Version Differences".
Ensure that all racks that form a single Hadoop cluster are cabled together. See Connecting Multiple Oracle Big Data Appliance Racks.
Connect as root to node01 of the primary rack and change to the BDAMammoth directory:
```
cd /opt/oracle/BDAMammoth
```
Note: Always start Mammoth from the primary rack.
Generate a parameter file for the server group. The following example adds six servers beginning with node13:
```
./mammoth -e node13 node14 node15 node16 node17 node18
```
The servers can be on the same rack or multiple racks. See the -e option in "Mammoth Software Installation and Configuration Utility."

After Mammoth completes Step 3 of the installation, it prompts you to restart, if it upgraded the base image.
If you are using Oracle Enterprise Manager Cloud Control to monitor Oracle Big Data Appliance, then run rediscovery to identify the hardware and software changes.

If you have a license for Oracle Big Data Connectors, then they are installed on all nodes of the non-primary racks, although the services do not run on them. Oracle Data Integrator agent still runs on node03 of the primary rack.

Mammoth obtains the current configuration from the files stored in /opt/oracle/bda/install/state. If those files are missing or if any of the services have been moved manually to run on other nodes, then Mammoth fails.

About Software Version Differences

All servers configured as one Hadoop cluster must have the same image. A new Oracle Big Data Appliance rack might be factory-installed with a newer base image than the previously installed racks. Use the imageinfo utility on any server to get the image version. When all servers of a single Hadoop cluster have the same image version, you can install the software.

To synchronize the new servers with the rest of the Hadoop cluster, either upgrade the existing cluster to the latest image version or downgrade the image version of the new servers.

To upgrade the image version:

Run Mammoth with the -p option to upgrade the software on the cluster to the latest version. See "Upgrading the Software on Oracle Big Data Appliance."

To downgrade the image version:

Reimage the new rack to the older version installed on the cluster. See My Oracle Support Information Center ID 1445745.2.
Use the old version of the Mammoth utility, which is on the first server of the existing cluster, to extend the cluster onto the new rack.

If you add a newer server model, then you can downgrade only to the first software version available for that model. For example, if you add Sun Server X3-2L servers, then you must install Oracle Big Data Appliance software version 2.3 or higher.

10.7 Upgrading the Software on Oracle Big Data Appliance

The procedure for upgrading the software is the same whether you are upgrading from one major release to another or just applying a patch set. The procedure is also the same whether your Hadoop cluster consists of one Oracle Big Data Appliance rack or multiple racks.

The process upgrades all components of the software stack including the firmware, Oracle Linux Unbreakable Enterprise Kernel (UEK), CDH, JDK, and Oracle Big Data Connectors (if previously installed). Two versions of the software bundle are available, one for Oracle Linux 5 and the other for Oracle Linux 6.

To upgrade only Oracle Big Data Connectors, and no other components of the software stack, contact Oracle Support for assistance.

Software downgrades are not supported.

Note:

The upgrade will prompt you to either choose a rolling upgrade or to continue with a conventional upgrade. A conventional upgrade process automatically stops and starts services as needed. Therefore the cluster is unavailable while the mammoth command is executing. The advantage of a rolling upgrade is that it may allow uninterrupted access to some services throughout the upgrade process. However, a rolling upgrade adds some extra time per server because the reboots are not concurrent.

Currently, Oracle Big Data Appliance software upgrades do not require corresponding upgrades of the InfiniBand or Cisco switches.

10.7.1 About the Operating System Versions

Oracle Big Data Appliance 2.3 and later software runs on either Oracle Linux version 5 or version 6. When upgrading the software, you can choose whether to upgrade the operating system also:

Retaining Oracle Linux 5: Download the Mammoth bundle for Oracle Linux 5. It retains Oracle Linux 5, but upgrades Oracle Unbreakable Enterprise Kernel (UEK) to version 2. It installs all of the Oracle Big Data Appliance software for this release.
Upgrading to Oracle Linux 6: Download the Mammoth bundle for Oracle Linux 6. To upgrade the operating system, you must first reimage the servers. Reimaging erases all files and data, therefore Oracle does not recommend this type of upgrade for a production system, unless you have a second cluster functioning as a backup with the same data. After reimaging, you can install the software with Oracle Linux 6 and UEK version 2.

10.7.2 Upgrading the Software

Follow these procedures to upgrade the software on an Oracle Big Data Appliance cluster to the current version.

10.7.2.1 Prerequisites

You must know the passwords currently in effect for the cluster, which the Mammoth utility will prompt you for:

oracle
root
Cloudera Manager admin
MySQL Database admin
MySQL Database for Oracle Data Integrator (if Oracle Data Integrator agent is installed)

In Oracle Big Data Appliance 4.1, Apache Sentry changes from a file-based provider to the Sentry service. If Sentry is enabled when you upgrade the cluster, then the file-based provider is retained. To switch to the Sentry service, disable Sentry before you upgrade the software, and then re-enable Sentry afterward.

10.7.2.2 Upgrading to the Current Software Version

Making sure cluster services are healthy before upgrade, and especially after reboot is very important. This is even more important in the case of rolling upgrade. In the case of rolling upgrade, if previous errors exist, the parcel upgrade will stop (even if the parcel is already upgraded). This will prevent services from starting. Manual steps will be needed to resume.

Upgrade the Oracle Big Data Appliance software to the current software version as follows. This is a summary. Refer to MOS Doc ID 2101906.1 for details, including prerequisites, further information on the steps below, and known issues.

Note:

All Oozie jobs should be stopped before the upgrade. Failure to do this may cause the upgrade to fail.

Download and unzip the Mammoth bundle, as described in "Downloading the Mammoth Software Deployment Bundle." You must be logged in as root to node 1 of the primary rack.
Change to the BDAMammoth directory.
```
# cd /opt/oracle/BDAMammoth
```
Run the mammoth command with the -p option:
```
# ./mammoth -p
```
Mammoth automatically upgrades the base image if necessary.
After all nodes reboot, perform the following checks.
1. Check uptime.
  # dcli -C uptime
2. Check /root/BDA_REBOOT_SUCCEEDED.
  # dcli -C ls -ltr /root/BDA_REBOOT_SUCCEEDED
  
  Note: If there is no BDA_REBOOT_SUCCEEDED, check for /root/BDA_REBOOT_* and /root/bda_reboot_status.
3. Verify that the kernel and JDK are upgraded.
```
# dcli -C uname -a 
# dcli -C java -version
```
4. Check that all Cloudera Configuration Manager services are healthy. You may need to manually restart some services.
  
  Note:
  During any upgrade, but especially a rolling upgrade, it is crucial that all services in CM are healthy after the reboot before continuing. Failure to do so will result in upgrade failures.
After the reboot and the post reboot checks, log on to node 1 of the primary rack and rerun mammoth -p in order to resume the upgrade.
```
# cd /opt/oracle/BDAMammoth
# ./mammoth -p
```
When the upgrade is complete, perform the post-upgrade validation steps described in the MOS document (2101906.1).

10.7.2.3 Performing Parcel Upgrades Outside of the One-Off Patch Mechanism

Oracle one-off patches for Oracle Big Data Appliance software are by definition supported. Parcel upgrades that are not part of a one-off patch are unsupported unless affirmed by Oracle Support.

The CDH 5.5.2 Parcel Upgrade

Mammoth bundles include a recent version of the parcel for Cloudera's Distribution including Apache Hadoop (CDH). However, CDH is released on Cloudera’s schedule and sometimes a newer CDH release than the one in the current Mammoth bundle may be available. A new CDH version may be supported for integration with the current Oracle Big Data Appliance release. Consult with Oracle Support if you have questions.

Oracle Big Data Appliance Release 4.4 includes CDH 5.5.1. A parcel upgrade to CDH 5.5.2 in this release is supported. See MOS Doc ID 2105676.1 for instructions. This upgrade is optional. Before you chose to do the upgrade, consider the issues described in the Known Limitations sections of the MOS note.

10.8 Changing the Configuration of Optional Software

During the initial configuration of Oracle Big Data Appliance, the optional software components may or may not be installed. Using the bdacli utility, you can reverse some of those decisions. You must provide the relevant server names, ports, user names, and passwords.

This section provides examples of some reconfiguration options. It has the following topics:

Note:

See "bdacli" for the complete syntax.

10.8.1 Changing Support for Oracle Big Data Connectors

You can add or remove support for Oracle Big Data Connectors:

10.8.1.1 Adding Oracle Big Data Connectors

When adding support for Oracle Big Data Connectors, you can choose whether to install Oracle Data Integrator agent. If you do, then you must provide passwords for the MySQL Database root user and the Oracle Data Integrator user of MySQL Database (BDA_ODI_REPO). You must also know these passwords if they are not saved in cluster_name-config.json:

Cloudera Manager admin user
Oracle Audit Vault and Database Firewall, if enabled

The following procedure uses the bdacli utility.

To add Oracle Big Data Connectors to a cluster:

Enable Oracle Big Data Connectors:

# bdacli enable bdc
INFO: Logging all actions in /opt/oracle/BDAMammoth/bdaconfig/tmp/bda1node03-20140805110007.log and traces in /opt/oracle/BDAMammoth/bdaconfig/tmp/bda1node03-20140805110007.trc
INFO: This is the install of the primary rack
INFO: Checking if password-less ssh is set up
     .
     .
     .
Do you wish to enable ODI? [y/n]: y
Enter password for the BDA_ODI_REPO mysql user
Enter password: odi_password
Enter password again: odi_password
Enter password for the mysql root user
Enter password: root_password
Enter password again: root_password
WARNING: The password for the Cloudera Manager admin user is missing from the parameters file and is required for the installation.
Enter password: admin_password
Enter password again: admin_password
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
  0     2    0     2    0     0    181      0 --:--:-- --:--:-- --:--:--   250
INFO: The password for audit vault server is not needed since feature is not enabled
INFO: Creating environment.pp file ...
INFO: Making sure all puppet agents can be accessed.
INFO: Pinging puppet agents
INFO: Adding BDC to the cluster. This will take some time ...
     .
     .
     .
SUCCESS: Successfully reconfigured service

10.8.1.2 Removing Oracle Big Data Connectors

When removing support for Oracle Big Data Connectors, you must provide passwords for the following users, if the passwords are not saved in cluster_name-config.json:

Cloudera Manager admin user
MySQL Database root user, if Oracle Data Integrator agent is enabled
BDA_ODI_REPO user of MySQL Database, if Oracle Data Integrator agent is enabled
Oracle Audit Vault and Database Firewall, if enabled

The following procedure uses the bdacli utility.

To remove Oracle Big Data Connectors from a cluster:

Remove Oracle Big Data Connectors:

# bdacli disable bdc
INFO: Logging all actions in /opt/oracle/BDAMammoth/bdaconfig/tmp/bda1node03-20140805104603.log and traces in /opt/oracle/BDAMammoth/bdaconfig/tmp/bda1node03-20140805104603.trc
INFO: This is the install of the primary rack
INFO: Checking if password-less ssh is set up
INFO: Executing checkRoot.sh on nodes 
 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
SUCCESS: Executed checkRoot.sh on nodes 
 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
INFO: Executing checkSSHAllNodes.sh on nodes 
 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
SUCCESS: Executed checkSSHAllNodes.sh on nodes 
 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
INFO: Reading component versions from 
 
/opt/oracle/BDAMammoth/bdaconfig/COMPONENTS
INFO: Creating nodelist files...
WARNING: The password for the Cloudera Manager admin user is missing from the parameters file and is required for the installation.
Enter password: admin_password
Enter password again: admin_password
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
  0     2    0     2    0     0    184      0 --:--:-- --:--:-- --:--:--   250
WARNING: The password for the MySQL root user is missing from the parameters file and is required for the installation.
Enter password: root_password
Enter password again: root_password
INFO: Executing verifyMySQLPasswd.sh on nodes 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
SUCCESS: Executed verifyMySQLPasswd.sh on nodes 
 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
WARNING: The password for the MySQL BDA_ODI_REPO user is missing from the  parameters file and is required for the installation.
Enter password: odi_password
Enter password again: odi_password
INFO: Executing verifyMySQLPasswd.sh on nodes 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
INFO: The password for audit vault server is not needed since feature is not enabled
INFO: Creating environment.pp file ...
INFO: Making sure all puppet agents can be accessed.
INFO: Pinging puppet agents
INFO: Removing big data connectors. This will take some time ...
     .
     .
     .
SUCCESS: Successfully reconfigured service

10.8.2 Adding Support for Auto Service Request

The following procedure shows how to add support for Auto Service Request.

To support Auto Service Request:

Set up your My Oracle Support account and install ASR Manager. You must do this before activating Auto Service Request on Oracle Big Data Appliance. See Setting Up Auto Service Request.

Turn on Auto Service Request monitoring and activate the assets:

# bdacli enable asr
INFO: Logging all actions in /opt/oracle/BDAMammoth/bdaconfig/tmp/bda1node01-20130205075303.log and traces in /opt/oracle/BDAMammoth/bdaconfig/tmp/bda1node01-20130205075303.trc
     .
     .
     .
Enter the value for ASR_HOST [Default: ]: asr-host.example.com
Enter the value for ASR_PORT [Default: 162]:
Enter the value for ASR_SERVER_USER: jdoe
 
Please Check the values you entered for the ASR parameters
 
ASR_HOST = asr-host.example.com
ASR_PORT = 162
ASR_SERVER_USER = jdoe
 
Are these values correct (y/n): y
Enter password for user jdoe on machine asr-host.example.com
Enter password: password
Enter password again: password
INFO: Creating environment.pp file ...
INFO: Making sure all puppet agents can be accessed.
INFO: Pinging puppet agents
INFO: Setting up ASR on all nodes. This will take some time ...
     .
     .
     .

Complete the steps in "Verifying the Auto Service Request Installation."

10.8.3 Adding Support for Oracle Enterprise Manager Cloud Control

Big Data Appliance supports two versions of the Enterprise Manager System Monitoring plug-in for Oracle Big Data Appliance – 13.1 and 12.c. Install either Enterprise Manager System Monitoring plug-in for Oracle Big Data Appliance in an Oracle Enterprise Manager Cloud Control installation on the same network.

To support Oracle Enterprise Manager Cloud Control:

Install the preliminary patches.

To enable system monitoring over HTTPS for either the 13.1 or 12.c plug-in, you must first install the patches listed in the table below. You can download these patches from the Oracle Automated Release Updates (ARU) site. the 13.2 plug-in does not require additional patches.

Enterprise Manager System Monitoring Plug-In	Patch Prerequisites for Support of the Plug-In on Oracle Big Data Appliance
13.2 Enterprise Manager System Monitoring Plug-In	No patches needed.
13.1 Enterprise Manager System Monitoring Plug-In	OMS side patch: 23294904 Agent plugin : 23294778 Discovery plugin : 23294785
12.c Enterprise Manager System Monitoring Plug-In	OMS side patch: 23218275 BDA Plugin Discovery OH patches: 23218115 and 23218109.

Install the Enterprise Manager System Monitoring plug-in.

For instructions on installing the version 13.1 plug-in, see Oracle Enterprise Manager System Monitoring Plug-in Installation Guide for Oracle Big Data Appliance . You can use these same instructions to install version 13.2.

For instructions on installing the 12.c plug-in, see this My Oracle Support Document:Download/Deployment of the 12c EM May Bundle Patch to Support Discovery/Monitoring of Cloudera Manager Admin Console HTTPS URLS (Doc ID 2148611.1).

10.8.4 Adding Support for Oracle Audit Vault and Database Firewall

Only Release 12.1.1 of Oracle Audit Vault and Database Firewall Server is supported for use with Oracle Big Data Appliance at this time.

Before installing support on Oracle Big Data Appliance, ensure that Oracle Audit Vault and Database Firewall Server is up and running. The software must be installed on a separate server on the same network as Oracle Big Data Appliance.

You must also have the following information about the Audit Vault Server installation:

Audit Vault Server administration user name and password
Database service name
IP address
Port number

To add support for Oracle Audit Vault and Database Firewall:

Add support for Oracle Audit Vault and Database Firewall:

# bdacli enable auditvault
INFO: Logging all actions in /opt/oracle/BDAMammoth/bdaconfig/tmp/ bda1node01-20140805072714.log and traces in /opt/oracle/BDAMammoth/bdaconfig/tmp/ bda1node01-20140805072714.trc
INFO: This is the install of the primary rack
INFO: Checking if password-less ssh is set up
INFO: Executing checkRoot.sh on nodes 
 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
SUCCESS: Executed checkRoot.sh on nodes 
 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
INFO: Executing checkSSHAllNodes.sh on nodes 
 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
SUCCESS: Executed checkSSHAllNodes.sh on nodes 
 
/opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
INFO: Reading component versions from 
 
/opt/oracle/BDAMammoth/bdaconfig/COMPONENTS
INFO: Creating nodelist files...
Please enter the Audit Vault Server Admin Username: admin_username
Please enter the Audit Vault Server Admin Password: admin_password
Enter password again: admin_password
Please enter the Audit Vault Server Database Service Name: service_name
Please enter the Audit Vault Server IP Address: IP address
Please enter the Audit Vault Server Port: port_number
INFO: Creating environment.pp file ...
INFO: Making sure all puppet agents can be accessed.
INFO: Pinging puppet agents
INFO: Adding audit Vault Service. This will take some time ...
     .
     .
     .

10.8.5 Adding Kerberos Authentication

The following procedure configures Kerberos authentication.

Note:

If you choose to enable Active Directory Kerberos, first read MOS (My Oracle Support) documents 2029378.1 and 2013585.1. These documents explain required preliminary steps and provide important information on known issues.

To support Kerberos authentication:

Ensure that you complete the Kerberos prerequisites listed in "Installation Prerequisites."
Log into the first NameNode (node01) of the primary rack.

Configure Kerberos:

# bdacli enable kerberos
INFO: Logging all actions in /opt/oracle/BDAMammoth/bdaconfig/tmp/bda1node01-20131104072502.log and traces in /opt/oracle/BDAMammoth/bdaconfig/tmp/bda1node01-20131104072502.trc
     .
     .
     .
INFO: Executing checkRoot.sh on nodes /opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
SUCCESS: Executed checkRoot.sh on nodes /opt/oracle/BDAMammoth/bdaconfig/tmp/all_nodes #Step -1#
SUCCESS: Password-less root SSH is setup.
 
 Do you want to setup KDC on a BDA node (yes/no): yes
 
 Please enter the realm name: EXAMPLE.COM
 
 Enter password for Kerberos database: password
 Enter password again: password
INFO: Executing changekrb5_kdc.sh on nodes bda1node01 #Step -1#
SUCCESS: Executed changekrb5_kdc.sh on nodes bda1node01 #Step -1#
SUCCESS: Successfully set the Kerberos configuration on the KDC
INFO: Setting up Master KDC
INFO: Executing setupKDC.sh on nodes bda1node01 #Step -1#
     .
     .
     .

10.8.6 Installing Oracle Big Data Discovery

Oracle Big Data Discovery is an end-to-end product for visual analysis of Big Data. You can enable or disable Oracle Big Data Discovery on the Oracle Big Data Appliance if this product has been licensed.

Use only the methods described in this document to install and uninstall Oracle Big Data Discovery on the Oracle Big Data Appliance. The installation procedure described in the Oracle Big Data Discovery Installation Guide should not be used in this context. The Oracle Big Data Appliance does not support the installation of WebLogic Server, Studio, and Dgraph on separate nodes of a CDH cluster.

If you are installing Big Data Discovery on an edge node, the Oracle Big Data Appliance installation process for Oracle Big Data Discovery does NOT apply. In the case of edge nodes, you should instead follow the normal installation process described in the Oracle Big Data Discovery Installation Guide

10.8.6.1 How Oracle Big Data Discovery is Deployed on Oracle Big Data Appliance

All Oracle Big Data Discovery (BDD) components are installed on a single node. Oracle Big Data Discovery will be installed on the fifth node of the cluster (referred to as “Node 5” in these instructions). After enabling BDD, the cluster will consist of five nodes: 1, 2, 3, 4, 6. If necessary, you can later add Node 5 back into the CDH cluster by disabling Oracle Big Data Discovery. If you cannot afford to dedicate a node from the Hadoop cluster, you can install the software on a server external to Oracle Big Data Appliance and use the Oracle Big Data Appliance cluster for data processing. In that case, do not use the instructions in this section. Instead, see the Oracle Big Data Discovery product documentation in the Oracle Help Center for other installation options.

Note:

This installation is not supported on Oracle Big Data Appliances running Oracle Linux 5.

To re-purpose Node 5 for use by Big Data Discovery, the installation removes all data currently on the server’s drive array. It first checks that the required disk space exists elsewhere within the cluster. If this requirement is met, it then redistributes all Hadoop blocks on the server to the other nodes. No Hadoop data is lost. The installation then deprovisions the DataNode and NodeManager on the server and erases the data partitions (/u03 to /u12). Only the /boot, root (/) and the first two data partitions (/u01 and /u02) are left intact.

Important:

It is the customer’s responsibility to back up or move any non-Hadoop data that should be preserved prior to installing Oracle Big Data Discovery. The installation does not move any non-Hadoop data stored on Node 5. Non-Hadoop data present in data partitions /u03-/u12 on Node 5 are lost when the installation erases those data partitions.

10.8.6.2 Installing and Enabling Oracle Big Data Discovery

Oracle Big Data Appliance 4.6 supports both the 1.2.2 and 1.3.2 versions of Oracle Big Data Discovery.

To install Oracle Big Data Discovery 1.2.2 or 1.3.2 on Oracle Big Data Appliance 4.6, use the instructions provided in the support document 2150755.1 – How to Enable/Disable Oracle Big Data Discovery 1.2.2 on Oracle Big Data Appliance V4.5/CDH5.7.1/OL6 with bdacli in My Oracle Support. Although these instructions are for installing on Oracle Big Data Appliance 4.5, they also work for the installation of either Oracle Big Data Discovery 1.2.2 or 1.3.2 on Oracle Big Data Appliance 4.6 . However you need to adapt these steps as follows:

Ignore instructions to upgrade CDH 5.7 to 5.7.1. Oracle Big Data Appliance 4.6 already includes CDH 5.8.0. CDH 5.8.2 is also supported for this installation.
In Step 4 in Prerequisites for Node Selection and Preparation, copy the Avro, Hadoop, Hive, and Spark packages from /opt/oracle/BDAMammoth/source/CDH-P/5.8.x-ol5 from Node 1 to /opt/oracle/BDAMammoth/source/CDH-P/5.8.x-ol5 on Node 5 (if these files do not already exist at the given path on Node 5).
In the section Prerequisites for Package Download, note that the Oracle Software Deliver Cloud offers version 1.3.2 by default. If you want version 1.2.2 instead, click Select Alternate Release.
Step 2 in the section Enable BDD describes how to copy and rename the download package filenames for verson 1.2.2. In the version 1.3.2 bundle, the filenames are V77911*-01.zip, as shown below, with the exception of the JAR file which is the same as in Release 1.2.2. For 1.3.2, copy and rename the files as follows:
```
# cp /tmp/V779116-01.zip /opt/oracle/bdd/installer.zip 
# cp /tmp/V779114-01.zip /opt/oracle/bdd/bdd1.zip 
# cp /tmp/V779115-01.zip /opt/oracle/bdd/bdd2.zip 
# cp /tmp/fmw_12.1.3.0.0_wls.jar  /opt/oracle/bdd/.
```
Extract the JAR file above from V44413-01.zip

Follow the rest of the instructions as documented in the My Oracle Support note.

Important:

Do not use the installation instructions provided here to upgrade an existing Oracle Big Data Discovery installation to version 1.3.2. Follow the upgrade instructions in the Oracle Big Data Discovery Upgrade Guide.

10.8.6.3 Using Oracle Big Data Discovery

Refer to the Oracle Big Data Discovery documentation in the Oracle Help Center for instructions on using the software. Note that the site contains instructions for uninstalling, re-installing, upgrading, or otherwise reconfiguring Oracle Big Data Discovery that should not be used with the installation of the software on Oracle Big Data Appliance. Contact Oracle Support if you have questions about this.

10.9 Reinstalling the Base Image

The operating system and various utilities are factory installed on Oracle Big Data Appliance, as described in "Oracle Big Data Appliance Management Software". You may need to reinstall this base image if, for example, you want to return Oracle Big Data Appliance to its original state, or you replaced a failed server. Mammoth automatically upgrades the base image as necessary before upgrading the Oracle Big Data Appliance software to a newer version.

You can reimage all or part of a rack. However, all the servers in a cluster must have the same image. Follow the appropriate instructions:

10.9.1 Reimaging a Single Oracle Big Data Appliance Server

Follow this procedure to reimage one server, for example, following replacement of a failed server.

Caution:

If you reimage a server, then all files and data are erased.

To reinstall the base image on one server:

Download the base image patch from My Oracle Support or Oracle Automated Release Updates (ARU), and copy it to the server being reimaged. You can use the base image downloaded with the Mammoth bundle, or download the separate base image patch.

Caution:

Use the most recent 4.x version of the base image. Do not use the version included in the Mammoth bundle.

See "Downloading the Mammoth Software Deployment Bundle." You can take the same basic steps to download the base image patch from My Oracle Support.
If you are reimaging the server to the current customer settings, verify that /opt/oracle/bda/network.json reflects the intended network configuration. If it does not, then generate a new file using Oracle Big Data Appliance Configuration Generation Utility. See "Generating the Configuration Files."

Ensure that 4 GB or more disk space are free in the partition:

$ df -h
Filesystem            Size  Used Avail Use% Mounted on
/dev/md2              161G   23G  130G  15% /
/dev/md0              194M   40M  145M  22% /boot
tmpfs                  24G     0   24G   0% /dev/shm
/dev/sda4             1.7T  197M  1.7T   1% /u01
/dev/sdb4             1.7T  197M  1.7T   1% /u02
/dev/sdc1             1.8T  199M  1.8T   1% /u03
/dev/sdd1             1.8T  199M  1.8T   1% /u04
     .
     .
     .

Unzip the downloaded base image ZIP file. The following example shows a partial listing of the output to the command line:

# unzip p24449886_460_Linux-x86-64.zip
Archive: p24449886_460_Linux-x86-64.zip
creating: BDABaseImage-ol6-4.6.0_RELEASE/
creating: BDABaseImage-ol6-4.6.0_RELEASE/Extras/
creating: BDABaseImage-ol6-4.6.0_RELEASE/Extras/BALANCER/
extracting: BDABaseImage-ol6-4.6.0_RELEASE/Extras/BALANCER/orabalancer-2.8.0-h2.zip
inflating: BDABaseImage-ol6-4.6.0_RELEASE/Extras/BALANCER/orabalancer-2.8.0-h2.noarch.rpm
extracting: BDABaseImage-ol6-4.6.0_RELEASE/Extras/BALANCER/orabalancerdemo-2.8.0-h2.zip
creating: BDABaseImage-ol6-4.6.0_RELEASE/Extras/CELL/
inflating: BDABaseImage-ol6-4.6.0_RELEASE/Extras/CELL/bd_cell-12.1.2.0.100_LINUX.X64_160506.11
...

Change to the subdirectory created in the previous step:
```
# cd BDABaseImage-ol6-4.6.0_RELEASE
 
```
Reimage the server using the makebdaimage command. The following example reimages server 4, including the internal USB, from the 4.6.0 base image to the custom settings in network.json. You must be logged in to the server being reimaged, which is server 4 in this example.
```
./makebdaimage -usbint BDABaseImage-4.6.0_RELEASE.iso /opt/oracle/bda/BdaDeploy.json 4
```
If the makebdaimage command succeeds without errors, then restart the server.

10.9.2 Reimaging an Oracle Big Data Appliance Rack

Follow this procedure to reimage an entire rack.

Caution:

If you reimage an entire rack, then all clusters, files, and data on the rack are erased. Reimaging is not required for a software upgrade.

To reinstall the base image on all servers in a rack:

If the Oracle Big Data Appliance software was installed previously on the rack, then save the /opt/oracle/BDAMammoth/cluster_name-config.json file to a safe place outside Oracle Big Data Appliance.
Download the most recent base image patch from My Oracle Support or Oracle Automated Release Updates (ARU), and copy it to the first (bottom) server of the rack being reimaged. You can use the base image downloaded with the Mammoth bundle, or download the separate base image patch.

See "Downloading the Mammoth Software Deployment Bundle." You can take the same basic steps to download the base image patch from My Oracle Support.

Caution:

Use the most recent 4.x version of the base image. Do not use the version included in the Mammoth bundle.
Establish an SSH connection to the first server and log in as root.
If you are reimaging to existing customer network settings, then verify that /opt/oracle/bda/network.json reflects the intended network configuration. If it does not, then generate a new file using Oracle Big Data Appliance Configuration Generation Utility. See "Generating the Configuration Files."
Ensure that passwordless SSH is set up:
```
# dcli hostname
192.168.41.37: bda1node01.example.com
192.168.41.38: bda1node02.example.com
192.168.41.39: bda1node03.example.com
     .
     .
     .
```
This command must run without errors and return the host names of all Oracle Big Data Appliance servers. If not, then follow the steps in "Setting Up Passwordless SSH". Do not continue until the dcli hostname command runs successfully on all servers.
Check all Oracle Big Data Appliance servers for hardware issues:
```
# dcli bdacheckhw | grep -v SUCCESS
```
Resolve any hardware errors and warnings before reimaging the rack.
Verify that at least 4 GB are available in the root (/) partition of all servers:

# dcli df -h /
192.168.41.37: Filesystem            Size  Used Avail Use% Mounted on
192.168.41.37: /dev/md2              161G   21G  132G  14% /
192.168.41.38: Filesystem            Size  Used Avail Use% Mounted on
192.168.41.38: /dev/md2              161G   19G  135G  12% /
192.168.41.39: Filesystem            Size  Used Avail Use% Mounted on
192.168.41.39: /dev/md2              161G   23G  131G  15% /
     .
     .
     .

Unzip the downloaded base image ZIP file. For example:

# unzip p22118555_420_Linux-x86-64.zip
Archive:  p22118555_420_Linux-x86-64.zip    
creating: BDABaseImage-ol6-4.2.0_RELEASE/   
inflating: BDABaseImage-ol6-4.2.0_RELEASE/biosconfig     
inflating: BDABaseImage-ol6-4.2.0_RELEASE/makebdaimage    
extracting: BDABaseImage-ol6-4.2.0_RELEASE/BDABaseImage-ol6-4.2.0_RELEASE.md5sum     
inflating: BDABaseImage-ol6-4.2.0_RELEASE/reimagerack     
inflating: BDABaseImage-ol6-4.2.0_RELEASE/reimagecluster     
inflating: BDABaseImage-ol6-4.2.0_RELEASE/BDABaseImage-ol6-4.2.0_RELEASE.iso      
creating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/    
creating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/ODI/   
inflating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/ODI/odi_generic.jar      
creating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/NOSQL/   
inflating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/NOSQL/kv-ee-3.2.5-0.noarch.rpm     
inflating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/NOSQL/kv-ce-3.2.5-0.noarch.rpm      
creating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/BALANCER/   
inflating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/BALANCER/orabalancer-2.4.0-h2.noarch.rpm    
extracting: BDABaseImage-ol6-4.2.0_RELEASE/Extras/BALANCER/orabalancerdemo-2.4.0-h2.zip    
extracting: BDABaseImage-ol6-4.2.0_RELEASE/Extras/BALANCER/orabalancer-2.4.0-h2.zip      
creating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/CELL/   
inflating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/CELL/bd_cell-12.1.2.0.100_LINUX.X64_150225.1100-1.x86_64.rpm     
inflating: BDABaseImage-ol6-4.2.0_RELEASE/Extras/CELL/exadata-oswatcher-12.1.2.2.0.150512-1.noarch.rpm     
inflating: BDABaseImage-ol6-4.2.0_RELEASE/README.txt     
inflating: BDABaseImage-ol6-4.2.0_RELEASE/ubiosconfig     
inflating: BDABaseImage-ol6-4.2.0_RELEASE/json-select

Change to the subdirectory created in the previous step:
```
# cd BDABaseImage-ol6-4.2.0_RELEASE 
```
Complete one of the following procedures:
- To reimage an Oracle Big Data Appliance that was configured for a customer network to the same customer network settings, execute the ./reimagerack command.
- To reimage an appliance that still has the factory settings:
  1. Ensure that /opt/oracle/bda/network.json does not exist.
  2. Execute the ./reimagerack command.
- To restore the factory network settings on a rack configured with custom network settings:
  1. Copy /opt/oracle/bda/network.json to a safe location outside Oracle Big Data Appliance.
  2. Ensure that /opt/oracle/bda/BdaShip.json exists.
  3. Reimage the rack:
```
./reimagerack deploy ship
```
See reimagerack for the complete syntax of the command.
Run Mammoth. See "Installing the Software."

10.9.3 Reimaging an Oracle Big Data Appliance Cluster

Follow this procedure to reimage a group of servers that the Mammoth utility has deployed as a single cluster. This can only be used on a group of servers that has been deployed as a single cluster using Mammoth. You can either reapply the network settings that are in currently in effect or revert to factory settings.

Caution:

If you reimage a cluster, then all files and data on the cluster are deleted. Reimaging is not required for a software upgrade.

Mandatory Prerequisites

You must rename the file <rack_name>-network.json generated by the Oracle Big Data Appliance configuration utility to network.json. Before reimaging, /opt/oracle/bda/network.json must exist on all nodes of the cluster.

To reinstall the base image on all servers in a cluster:

Tip:

The reimagecluster utility creates an installer on the internal USB drive of each server in the rack, initializes the installation, and reboots each server. While the servers are down for reboot, you can log on to Integrated Lights Out Manager (ILOM) on each server to monitor progress:

<server name>-ilom.<domain>.com

Click on Remote Control -> Redirection -> Launch Remote Console to launch a Java console where you can monitor the re-imaging.

If the Oracle Big Data Appliance software was installed previously on the rack, then save the/opt/oracle/BDAMammoth/mammoth-<rack_name>.params file (or /opt/oracle/BDAMammoth/<cluster_name>-config.json) to a safe place outside Oracle Big Data Appliance. If the cluster has been upgraded, the file may exist in /opt/oracle/BDAMammoth/previous-BDAMammoth directory.
Download the latest Oracle Big Data Appliance Base Image tarball from My Oracle Support to a temporary directory. Copy it onto the first (bottom) server of the cluster to be re-imaged.

For Oracle Big Data Appliance 4.6, use the 4.5 base image, which is MOS Patch Patch 23317761
If re-imaging to your current settings, verify that /opt/oracle/bda/network.json reflects the intended network configuration. If it does not, then generate a new file using Oracle Big Data Appliance Configuration Generation Utility.
Establish an SSH connection to the first server and log in as root.
Ensure that password-less SSH has been setup on the cluster (dcli -C hostname should run without errors). If it has not been setup, run setup-root-ssh -C.
```
# setup-root-ssh -C 
Enter root password: 
... 
... 
setup-root-ssh succeeded
```
dcli -C hostname must run without errors and return the host names of all Oracle Big Data Appliance servers in the cluster. If not, then follow the steps in "Setting Up Passwordless SSH". Do not continue until the dcli -C hostname command runs successfully on all servers.

Ensure that there is at least 4 GB free in the / partition of all servers:

# dcli -C df -h /
10.xxx.xxx.xx: Filesystem      Size  Used Avail Use% Mounted on 
10.xxx.xxx.xx: /dev/md2        459G   49G  387G  12% / 
10.xxx.xxx.xx: Filesystem      Size  Used Avail Use% Mounted on
...

Unzip the downloaded base image ZIP file. For example:

# unzip p23317761_450_Linux-x86-64.zip
Archive: p23317761_450_Linux-x86-64.zip
creating: BDABaseImage-ol6-4.5.0_RELEASE/
creating: BDABaseImage-ol6-4.5.0_RELEASE/Extras/
creating: BDABaseImage-ol6-4.5.0_RELEASE/Extras/BALANCER/
extracting: BDABaseImage-ol6-4.5.0_RELEASE/Extras/BALANCER/orabalancer-2.7.0-h2.zip
inflating: BDABaseImage-ol6-4.5.0_RELEASE/Extras/BALANCER/orabalancer-2.7.0-h2.noarch.rpm
extracting: BDABaseImage-ol6-4.5.0_RELEASE/Extras/BALANCER/orabalancerdemo-2.7.0-h2.zip
creating: BDABaseImage-ol6-4.5.0_RELEASE/Extras/CELL/
inflating: BDABaseImage-ol6-4.5.0_RELEASE/Extras/CELL/bd_cell-12.1.2.0.100_LINUX.X64_160506.11   
...

Change to the subdirectory created in the previous step:
```
# cd BDABaseImage-ol6-4.5.0_RELEASE 
```
Use one of the following procedures to re-image the entire Oracle Big Data Appliance. The first alternative is to retain your existing configuration . The second is to restore to factory settings (prior to Oracle Big Data Appliance Configuration Utility setup) .

Reimage and Retain Existing Network Settings
1. At /opt/oracle/bda/, check that both network.json and BdaShip.json are listed.
```
# ls *.json
network.json BdaShip.json
```
  If network.json is not listed, locate the customer-specific network.json file (the one that describes the configuration you want to keep). First check to ensure that this file defines correct network settings. If it does not, generate a new file using Oracle Big Data Appliance Configuration Generation Utility. Then copy the file to /opt/oracle/bda and rename it to network.json.
2. Use dcli to copy network.json to all nodes of the cluster.
```
#  dcli -C -f /opt/oracle/bda/network.json -d /opt/oracle/bda/
```
3. Execute the ./reimagecluster command. Partial output of the command is shown below.
```
# ./reimagerack
Passwordless SSH set up to all hosts : xxx.xxx.xx.xx xxx.xxx.xx.xx xxx.xxx.xx.xx xxx.xxx.xx.xx xxx.xxx.xx.xx 
 xxx.xxx.xx.xx 
Using /tmp/BDABaseImage-ol6-4.5.0_RELEASE/BDABaseImage-ol6-4.5.0_RELEASE.iso to re-image
BDABaseImage-ol6-4.5.0_RELEASE.iso: OK
Re-imaging to existing customer network settings

FINAL CONFIRMATION : All of the following hosts will be completely erased, re-imaged and initialized including resetting of the iloms: xxx.xxx.xx.xx xxx.xxx.xx.xx xxx.xxx.xx.xx xxx.xxx.xx.xx xxx.xxx.xx.xx xxx.xxx.xx.xx
Proceed [y/n]?
y
Running hardware checks on all nodes...
...
Rebooting all servers  
Broadcast message from root@bda1node01.example.com (unknown) at 15:58 ...  
The system is going down for reboot NOW! 
```
To Reimage to Factory Network Settings
1. As described in Step 1, check that /opt/oracle/bda/network.json has been copied to a safe location outside Oracle Big Data Appliance and that /opt/oracle/bda/BdaShip.json exists.
2. Reimage the cluster to factory-shipped settings with the following command:
```
# ./reimagecluster deploy ship
```
Whether you re-imaged with existing network settings or restored to factory settings, the final steps are the same. After reboot from reimagecluster completes, run setup-root-ssh to re-establish a connection to all the servers in the rack. This is necessary so that the dcli command can be run to reboot all the servers again.
```
# setup-root-ssh
Enter root password:
...
setup-root-ssh succeeded
```
Reboot all servers once again using the dcli reboot command.
```
# dcli reboot
```
Verify that the files /root/BDA_IMAGING_SUCCEEDED and /root/BDA_REBOOT_SUCCEEDED exist on each server in the rack.
```
# dcli ls -lrt /root/BDA_* 
```
Check that the date and timestamp of /root/BDA_REBOOT_SUCCEEDED correspond to the time of the reboot in the reimaging process. The file may be zero-byte or may contain some INFO messages.

Verify the installed base image version:

# imageinfo
Big Data Appliance Image Info
IMAGE_VERSION : 4.5.0
IMAGE_CREATION_DATE : Wed May 18 20:19:54 UTC 2016
IMAGE_LABEL : BDA_4.5_LINUX.X64_RELEASE
LINUX_VERSION : Oracle Linux Server release 6.7
KERNEL_VERSION : 2.6.39-400.264.1.el6uek.x86_64
BDA_RPM_VERSION : bda-4.5.0-1.el6.x86_64
OFED_VERSION : OFED-IOV-1.5.5-2.0.0088
JDK_VERSION : jdk1.8.0_92-1.8.0_92-fcs.x86_64

You can now run the Mammoth utility to bring the system up to the latest Oracle Big Data Appliance version. See About the Mammoth Utility.

10.10 Installing a One-Off Patch

One-off patch bundles provide a fix to specific bugs in one or more releases. You use Mammoth to apply the patch to your cluster.

To install a one-off patch bundle:

Download the patch bundle from the Automated Release Update (ARU) system to a directory such as /tmp on the first node of the Oracle Big Data Appliance cluster.

The file is named BDA-patch-release-patch.zip. The examples in this procedure use the name BDA-patch-4.3.0-123456.zip.

Unzip the file. For example:

# unzip BDA-patch-4.3.0-123456.zip
Archive:  BDA-patch-4.3.0-123456.zip
   creating: BDA-patch-4.3.0-123456/
  inflating: BDA-patch-4.3.0-123456/BDA-patch-4.3.0-123456.run 
  inflating: BDA-patch-4.3.0-123456/README.txt

Change to the patch directory created in Step 2. For example:
```
$ cd BDA-patch-4.3.0-123456
```

Extract the contents of the run file. For example:

$ ./BDA-patch-4.3.0-123456.run
Big Data Appliance one-off patch 123456 for v4.3.0 Self-extraction
 
Removing existing temporary files
 
Generating /tmp/BDA-patch-4.3.0-123456.tar
Verifying MD5 sum of /tmp/BDA-patch-4.3.0-123456.tar
/tmp/BDA-patch-4.3.0-123456.tar MD5 checksum matches
 
Extracting /tmp/BDA-patch-4.3.0-123456.tar to /opt/oracle/BDAMammoth/patches/123456
Removing temporary files
 
Please "cd /opt/oracle/BDAMammoth" before running "./mammoth -p 123456"

Change to the BDAMammoth directory:
```
$ cd /opt/oracle/BDAMammoth
```
Install the patch. For example:
```
$ ./mammoth -p 123456
```
Mammoth prompts you to choose whether or not you want to load the patch as rolling upgrade. (See "About Rolling Upgrades")

Alternatively, you can use the bdacli command. See "bdacli".

10.11 Mammoth Software Installation and Configuration Utility

You must log in as root on the first server and change to the /opt/oracle/BDAMammoth directory to use Mammoth. It has this syntax:

./mammoth option [cluster_name] ]

In this command, cluster_name is the name of the cluster You must enter cluster_name in the first command exactly as it appears in (cluster_name-config.json). Afterward, cluster_name defaults to the rack specified in a previous mammoth command.

You must finish installing one rack before starting the installation of another rack.

Example 10-1 Mammoth Syntax Examples

This command displays Help for the Mammoth utility:

./mammoth -h

This command does a complete install on rack bda3:

./mammoth -i bda3

This command runs steps 2 through 6 on the rack being set up:

./mammoth -r 2-6

This command generates a parameter file to add six new servers over a starter rack, beginning with node07, to an existing cluster:

./mammoth -e node07 node08 node09 node10 node11 node12

10.11.1 Mammoth Options

The syntax of the mammoth command supports the configuration of new clusters. You can also use the Mammoth bundle to upgrade from earlier releases.

-c

Run the Oracle Big Data Appliance cluster checks.

-e newnode1 newnode2 newnode3...

Generates a parameter file for a group of servers being added to a cluster. The file is named cluster_name-config.json if the new servers are in a rack outside the cluster.

To identify the new servers, list them on the command line. The servers can be in the same rack or multiple racks.

No passwords are included in the parameter file, so you must enter them when running Mammoth.

On Oracle NoSQL Database clusters, Mammoth prompts for the kind of zone for the new nodes. You can choose from an existing zone, a new primary zone, or a new secondary zone. When adding to an existing zone, Mammoth lists the zones that you can use. When creating a new zone, Mammoth prompts for the zone name and replication factor.

-h

Displays command Help including command usage and a list of steps.

-i cluster_name

Runs all mandatory steps on the cluster, equivalent to -r 1-18 for a full rack. Use this option when configuring a new rack or adding a group of servers to a cluster.

-l

Lists the steps of the Mammoth utility.

-p

Upgrades the software on the cluster to the current version or installs a one-off patch.

-r n-N

Runs steps n through N of the Mammoth while no errors occur.

-s n [cluster_name]

Runs step n. Enter cluster_name to identify another cluster on the same rack. See the -e option.

-v

Displays the version number of the Mammoth.

10.11.2 Mammoth Installation Steps

Following are descriptions of the steps that the Mammoth performs when installing the software.

Step 1 PreinstallChecks

This step performs several tasks:

Validates the configuration files and prompts for the passwords.
Sets up a Secure Shell (SSH) for the root user so you can connect to all addresses on the administrative network without entering a password.
Sets up passwordless SSH for the root user on the InfiniBand network.
Generates /etc/hosts from the configuration file and copies it to all servers so they use the InfiniBand connections to communicate internally. The file maps private IP addresses to public host names.
Sets up an alias to identify the node where the Mammoth is run as the puppet master node. For example, if you run the Mammoth from bda1node01 with an IP address 192.168.41.1, then a list of aliases for that IP address includes bda1node01-master. The Mammoth uses Puppet for the software installation.
Checks the network timing on all nodes. If the timing checks fail, then there are unresolved names and IP addresses that will prevent the installation from running correctly. Fix these issues before continuing with the installation.

This step also performs a variety of hardware and software checks. A failure in any of these checks causes the Mammoth to fail:

The ARP cache querying time is 2 seconds or less.
All server clocks are synchronized within 10 seconds of the current server.
All servers succeeded on the last restart and generated a /root/BDA_REBOOT_SUCCEEDED file.
The bdacheckhw utility succeeds.
The bdachecksw utility succeeds.

Step 2 SetupPuppet

This step configures puppet agents on all nodes and start them, configures a puppet master on the node where the Mammoth is being run, waits for the agents to submit their certificates, and automates their signing. This step also changes the root password on all nodes (optional). After this step is completed, Puppet can deploy the software.

Puppet is a distributed configuration management tool that is commonly used for managing Hadoop clusters. The puppet master is a parent service and maintains a Puppet repository. A puppet agent operates on each Hadoop node.

A file named /etc/puppet/puppet.conf resides on every server and identifies the location of the puppet master.

Puppet operates in two modes:

Periodic pull mode in which the puppet agents periodically contact the puppet master and asks for an update, or
Kick mode in which the puppet master alerts the puppet agents that a configuration update is available, and the agents then ask for the update. Puppet operates in kick mode during the Mammoth installation.

In both modes, the puppet master must trust the agent. To establish this trust, the agent sends a certificate to the puppet master node where the sys admin process signs it. When this transaction is complete, the puppet master sends the new configuration to the agent.

For subsequent steps, you can check the Puppet log files on each server, as described in "What If an Error Occurs During the Installation?".

Step 3 UpdateBaseImage

Installs the most recent Oracle Big Data Appliance image and system parameter settings. The system automatically restarts if the kernel is upgraded.

Step 4 PrepareBaseImage

Copies third-party licenses to /opt/oss/src/OSSLicenses.pdf on every server, as required by the licensing agreements.
Copies third-party software source code to /opt/oss/src/ on every server, as required by the licensing agreements.

Mammoth does not copy the source code to Oracle NoSQL Database clusters.
Creates the hdfs and mapred users, and the hadoop group. It also creates the oracle user and the dba and oinstall groups.

The various packages installed in later steps also create users and groups during their installation.

See Also:

Oracle Big Data Appliance Software User's Guide for more information about users and groups.
The NameNode data is copied to multiple places to prevent a loss of this critical information should a failure occur in either the disk or the entire node where they are set up.

Step 5 SetupMySQL

Installs and configures MySQL Database. This step creates the primary database and several databases on node03 for use by Cloudera Manager. It also sets up replication of the primary database to a backup database on node02.

Mammoth does not install MySQL Database on Oracle NoSQL Database clusters.

Step 6 SetupKDC

Configures the Kerberos Key Distribution Center (KDC) that is used to set up security on the cluster.

Step 7 InstallHadoop

Installs all packets in Cloudera's Distribution including Apache Hadoop (CDH) and Cloudera Manager. It then starts the Cloudera Manager server on node03 and configures the cluster.

Mammoth does not install CDH or Cloudera Manager on Oracle NoSQL Database clusters.

Step 8 StartHadoopServices

Starts the agents on all nodes and starts all CDH services. Also enables HTTPS for Cloudera Manager, Hue, and Oozie if HTTPS/ Network Encryption was enabled.

After this step, you have a fully functional Hadoop installation.

Cloudera Manager runs on port 7180 of node03. You can open it in a browser, for example:

http://bda1node03.example.com:7180

In this example, bda1node02 is the name of node02 and example.com is the domain. The default user name and password is admin. You are prompted to change the password at the end of this step.

Note that Mammoth does not install or start CDH services on Oracle NoSQL Database clusters.

Step 9 InstallBDASoftware

Installs the server-side components of Oracle Big Data Connectors, if this option was selected in Oracle Big Data Appliance Configuration Generation Utility. Oracle Big Data Connectors must be licensed separately. Optional.

Installs the server-side components of Oracle Big Data SQL, if this option was selected in Oracle Big Data Appliance Configuration Generation Utility. For Oracle Big Data SQL support of Oracle NoSQL Database, this step installs the client libraries (kvclient.jar) on the CDH nodes. Oracle Big Data SQL must be licensed separately. Optional.

Installs Oracle NoSQL Database on clusters allocated to its use. Enterprise Edition requires a separate license.

Step 10 SetupHadoopSecurity

Actions performed in this step are based upon the choices you made in the Oracle Big Data Appliance Configuration Utility prior to installation.

Configures HTTPS/ Network Encryption if this option was selected.
Configures HDFS Transparent Encryption if this option was selected.

Mammoth sets up the The Navigator Key Trustee services required by HDFS Transparent Encryption based on whether you chose local or external Key Trustee servers in the configuration utility prior to installation:
- If you had selected Setup Key Trustee Servers on the BDA in the configuration utility, Mammoth will setup up active and passive Key Trustee servers locally on the Oracle Big Data Appliance.
- If you chose to use external Key Trustee Servers, the connection to the external servers is configured in this setup. (Note that you must have already configured the external Navigator Key Trustee Servers as described in MOS Doc ID 2112644.1)
Configures Kerberos authentication if this option was selected. There are no prerequisites if you set up the key distribution center on Oracle Big Data Appliance. Otherwise, see "Installation Prerequisites."

Step 11 SetupASR

Installs and configures Auto Service Request (ASR). Optional.

This step does the following:

Installs the required software packages
Configures the trap destinations
Starts the monitoring daemon

To activate the assets from ASR Manager, see "Verifying ASR Assets".

Note:

For this step to run successfully, the ASR host system must be up with ASR Manager running and configured properly. See Setting Up Auto Service Request.

Step 12 CleanupInstall

Performs the following:

Changes the Cloudera Manager password if specified in the Installation Template.
Deletes temporary files created during the installation.
Copies log files from all nodes to subdirectories in /opt/oracle/bda/install/log.
Runs cluster verification checks, including TeraSort, to ensure that everything is working properly. It also generates an install summary. All logs are stored in a subdirectory under /opt/oracle/bda/install/log on node01.

Step 13 CleanupSSHroot (Optional)

Removes passwordless SSH for root that was set up in "PreinstallChecks".

10.12 Oracle Big Data Appliance Base Imaging Utilities

The following utilities are distributed in the base image bundle. To run the utilities, you must be logged in as root.

10.12.1 makebdaimage

Reinstalls the base image on a single server, a cluster, or an entire rack. Both reimagecluster and reimagerack call this utility.

The makedbaimage utility has this syntax:

makebdaimage [--usb | --usbint] [--noiloms] /path/BDABaseImage-version_RELEASE.iso /path/BDdaDeploy.json target_servers

Options

--usb | --usbint

Identifies the USB port that will be used for reimaging. Use --usb for the external USB drive, or --usbint for the internal drive. The internal USB drive contains a full Linux installation.

To use the --usbint option, you must be logged in to the target server; otherwise, you reimage the server using the wrong network information.

--noiloms

The reimaging process does not alter the Oracle ILOMs on the target servers.

target_servers

One or more servers where the image will be installed, which you specify using one of these formats:

node_number

Identifies one server for reimaging. Enter the position of the server in the rack, from 1 to 18, counting from bottom to top.
from.json to.json

Identifies the current configuration (from.json) and the desired configuration (to.json). The JSON files can be either the factory configuration in BdaShip.json or a custom configuration in network.json. You can find them in /opt/oracle/bda on configured servers.
to.json: Concatenation of JSON files similar to network.json and BdaShip.json, but containing an extra ETH0_MACS array entry. The first entry in the concatenated file with a matching eth0_mac entry is used when reimaging. The to.json file is used as is.

10.12.2 reimagecluster

Reimages all servers in the cluster in parallel using dcli and makebdaimage.

The reimagecluster utility has this syntax:

reimagecluster [--no-iloms] [from.json [to.json]]

Prerequisites

Verify that the following command returns the list of servers in the cluster:
```
$ dcli -C hostname
```
Ensure that exactly one BDABaseImage-version_RELEASE*.iso file is in the current directory.

Options

--no-iloms

The reimaging process does not alter the Oracle ILOMs on the target servers.

from.json

The full path to the current configuration file, either BdaShip.json or network.json.

This option defaults to /opt/oracle/bda/network.json. If network.json is missing, then the option defaults to /opt/oracle/bda/BdaShip.json. The servers must already be set to the values in the JSON file used by reimagecluster.

to.json

The full path to the new network configuration in effect after reimaging. It can be either a network.json or a BdaShip.json file.

10.12.3 reimagerack

Reimages all servers in the rack in parallel using dcli and makedbaimage.

The reimagerack utility has this syntax:

reimagerack [--no-iloms] [--no-macs] [--hosts n1, n2...] [from.json [to.json]]

Prerequisites

Verify that the following command returns the list of servers in the rack:
```
$ dcli -hostname
```
Ensure that exactly one BDABaseImage-version_RELEASE*.iso file is in the current directory.

Options

--no-iloms

The reimaging process does not alter the Oracle ILOMs on the target servers.

--no-macs

The utility does not retrieve the server MAC addresses. Instead, it uses the InfiniBand cabling. Use this option only when restoring the factory settings; you must include both JSON files (from.json and to.json) in the command line.

--hosts n1, n2...

Restricts reimaging to the servers identified by a comma-delimited list of host names or IP addresses, whichever one dcli accepts. All servers must be in the list of target servers identified by the dcli -t command.

This option enforces the --no-macs option, so its restrictions also apply.

from.json

The full path to the current configuration file, either BdaShip.json or network.json.

to.json

The full path to the new network configuration in effect after reimaging. It can be either a network.json or a BdaShip.json file.