Upgrade requirements

Before upgrading, make sure your system meets the following requirements.

Requirement Description
Hadoop BDD supports the following:
  • Cloudera Distribution for Hadoop (CDH) 5.10.x for BDA 4.8
  • Cloudera Distribution for Hadoop (CDH) 5.11.x for BDA 4.9
  • Cloudera Distribution for Hadoop (CDH) 5.12.x for BDA 4.10
  • Cloudera Distribution for Hadoop (CDH) 5.13.x for BDA 4.11
  • Cloudera Distribution for Hadoop (CDH) 5.14.x for BDA 4.12
  • Cloudera Distribution for Hadoop (CDH) 5.15.x for BDA 4.13
  • Cloudera Distribution for Hadoop (CDH) 5.16.x for BDA 4.14
  • Hortonworks Data Platform (HDP) 2.6
  • MapR 5.2.1+

If you don't have one of the above installed, upgrade your Hadoop cluster before upgrading BDD. For instructions, refer to the documentation for your Hadoop distribution.

Additionally, BDD requires Spark 1.6.x. Be sure to check your current version of Spark and upgrade it, if necessary.

Before upgrading your Hadoop cluster, be aware of the following:
  • You can't switch to a different Hadoop distribution without reinstalling BDD. For example, if you currently have CDH, you can't switch to HDP.
  • You should stop your BDD cluster before you upgrade Hadoop. Once Hadoop has been upgraded, follow the procedure described in "Switching Hadoop versions" in the Administrator's Guide to enable BDD to work with the new version.
  • After upgrading Hadoop, verify that the YARN configuration changes you made before installing BDD weren't reset. For more information, see "YARN setting changes" in the Installation Guide.
Security BDD can run on Hadoop clusters secured with TLS/SSL and HDFS data at rest encryption. If you don't currently have these enabled, you can configure them in your Hadoop cluster before upgrading BDD and then enable them for BDD at upgrade time.

See the section "Security options" in the Installation Guide for instructions. Also, be sure to set the HADOOP_CERTIFICATES_PATH property in bdd.conf before upgrading.

Operating system BDD supports the following operating systems:
  • Oracle Enterprise Linux 6.4+, 7.1, 7.2
  • Red Hat Enterprise Linux 6.4+, 7.1, 7.2

If you don't have one of the above installed, upgrade your OS before upgrading BDD.

Additionally, you must clear the $http_proxy environment variable. You can do this by running the following command:
export http_proxy=
OS user The entire upgrade process must be performed by the bdd user, the OS user that installed BDD and runs all BDD processes. This user must have the same UID on all nodes in the cluster, including Hadoop nodes.
Dgraph databases (index) The Dgraph database (index) directory must contain enough free space to double your databases, as this may occur temporarily during the upgrade process.

Although the current version of BDD enables you to store your Dgraph databases on HDFS, you can't move them there until after the upgrade. For instructions on doing this, see the Administrator's Guide.

Transform Service If you're upgrading from 1.1.x, the upgrade script will install the Transform Service on the nodes you specify in bdd.conf. For best performance, these should be Managed Servers.

The Transform Service requires at least 11GB of RAM and 39GB of virtual memory, and may require more depending on the size of its workloads. Be sure that the nodes you install it on meet these requirements.

Studio database Studio supports the following database versions:
  • Oracle 11g, 12c (12.1.0.1.0+)
  • MySQL 5.5.3+
Workflow Manager Service The upgrade script will install the Workflow Manager Service on the node you specify in bdd.conf. Note that you can't install it on more than one node.
The Workflow Manager Service requires a database (including username, password, and schema) of one of the following types, which you must create before upgrading:
  • Oracle 11g, 12c (12.1.0.1.0+)
  • MySQL 5.5.3+

For instructions, see "Workflow Manager Service database requirements" in the Installation Guide.

Perl modules Verify that the following Perl modules are installed on the Admin Server:
  • Mail::Address
  • XML::Parser

If any is missing, install it according to the instructions in "Installing the required Perl modules" in the Installation Guide.

Linux utilities
  • curl 7.19.7+ (with support for the --tlsv1.2 and --negotiate options) and Network Security Services (NSS) 3.16.1+ must be installed on all Studio nodes.
  • Apache Ant 1.7.1+ must be installed and added to the PATH on all nodes in the cluster, including Hadoop nodes.
Browser BDD supports the following browsers:
  • Microsoft Edge
  • Internet Explorer 11
  • Firefox ESR
  • Chrome for Business
  • Safari Mobile 9.x