1 Overview of Oracle Big Data Manager

Oracle Big Data Manager makes it easy to copy data between data sources.

What is Oracle Big Data Manager?

Oracle Big Data Manager is a browser-based tool that gives you broad capabilities to manage data across your enterprise. You can use it to connect to and interconnect a range of supported Oracle and non-Oracle data storage providers, including Oracle Database, Oracle Object Store, MySQL, as well as Hadoop, S3, and GitHub. After you register storage providers with Big Data Manager, you can preview data and (depending upon the accessibility of each storage provider) compare, copy, and move data between them. With a Hadoop storage provider, you can also move data internally within HDFS, do data import/export and analytics with Apache Zeppelin, and import data into Hive tables. You can also upload data from your local computer to a selected storage provider.

Oracle Big Data Manager provides several methods for data transfer. You can use the console, which includes drag and drop data selection. Python and Java SDKs are available for building data management scripts and applications. There is also CLI for creating and administering data management jobs and tools for monitoring job status.

The Oracle Big Data Manager administrator can create other user accounts and assign roles to those accounts.

Feature Summary

The full list of Oracle Big Data Manager features is as follows:

  • The Oracle Big Data Manager console, accessible through a browser-based GUI.

  • Graphical tools for:

    • Comparing, copying, and moving data between storage providers.

    • Uploading files, extracting data from ZIP archives, and browsing data in Oracle Database and MySQL database.

    • Scheduling, managing, and monitoring copy, move, and compare jobs.

    • Importing data into Apache Hive.

    • Importing and exporting Apache Zeppelin notes; and creating and running notes.

    • Managing storage providers, users, and roles.

    • Monitoring the health of the cluster and the services running on it.

    • Processing and analyzing data via Apache Zeppelin notes.

  • The bdm-cli utility, for copying data and managing copy jobs from the command line.

  • Python and Java SDKs, for integrating Oracle Big Data Manager operations into applications

Supported Storage Providers

Oracle Big Data Manager supports the following storage providers, although not all tasks are supported in every provider:

  • Hadoop Distributed File System (HDFS)

  • Oracle Cloud Infrastructure Object Storage Classic

  • Amazon Simple Storage Service (S3)

  • Github

  • Oracle Database

  • Apache Hive

  • MySQL database

How is Oracle Big Data Manager Installed and Configured?

Oracle Big Data Manager is installed automatically by the Mammoth installation of the Oracle Big Data Appliance software release. By default, it is installed on the same node where Cloudera Configuration Manager runs (usually node 3). No manual configuration is needed except to register storage providers.

The default port is 8890. The default password for the administrative account is the same as the Configuration Manager password. These are specified by the BDP_PWD and BDM_PORT parameters in the <cluster name>-config.json file. This file is one of the outputs generated when you use the Oracle Big Data Appliance Configuration Generation Utility to define your cluster and rack configuration.

See Also:

The chapter Using the Oracle Big Data Appliance Configuration Utility in the Oracle Big Data Appliance Owner’s Guide describes <cluster name>-config.json , which contains the Mammoth installation parameters.

Note:

Oracle Big Data Manager now supports AD Kerberos. It also supports MIT Kerberos with KDCs that are internal or external to the cluster.