The following sections describe Oracle Big Data Discovery (BDD)
and how it integrates with other software products. They also describe some of
the different cluster configurations BDD supports.
Big Data Discovery overview
Oracle Big Data Discover is made up of a number of distinct components: three main ones and a number of others that work closely with them.
Integration with Hadoop
BDD runs on top of an existing Hadoop cluster, which provides a number of components and tools that BDD requires to process and manage data. For example, the source data you load into BDD is stored in HDFS and processed by Spark on YARN.
Integration with WebLogic
WebLogic Server provides a J2EE container for hosting and managing Studio and the Dgraph Gateway, which are J2EE applications. Additionally, WebLogic's Admin Server plays an important role in installing and administering BDD.
Integration with Jetty
Jetty provides open-source javax.servlet containers for hosting the Transform Service and the Workflow Manager Service.
Cluster configurations and diagrams
BDD supports many different cluster configurations. The following sections describe three suitable for demonstration, development, and production environments, and their possible variations.
A note about component names
Some of the installation files and scripts may contain references to the Endeca Server, which is a legacy name for the Dgraph Gateway. This document refers to the component as the Dgraph Gateway, and notes any discrepancies to avoid confusion.