Obtaining the Hadoop client libraries

Next, obtain the Hadoop client libraries and put them on the Admin Server.

BDD requires a number of client libraries to interact with Hadoop. In a normal Hadoop cluster, these libraries are spread out, making it difficult for BDD to find them all. To solve this issue, the upgrade script adds the required libraries to a single JAR, called the Hadoop fat JAR, and distributes it to all BDD nodes.

The specific libraries you need depend on your Hadoop distribution. The location you put them in is arbitrary, as you will define it in bdd.conf.

Note: If you're upgrading from BDD 1.0, be sure to obtain the libraries for one of the currently supported CDH versions, even though you haven't upgraded to it yet.

If you're upgrading from 1.0 or 1.1.x, you should now apply the upgrade hotfix. If you have 1.2.x, move on to Backing up your current cluster.