10 Introduction to Integration with Oracle Big Data SQL

This section describes the integration of Oracle NoSQL Database with Oracle Big Data SQL version 4.x. The goal of the Oracle Big Data SQL product is to allow users to employ the power of the Oracle SQL SELECT statement to manage and manipulate data stored in a number of different locations. Specifically, Oracle Big Data SQL is designed to provide SQL access to data stored in Apache Hadoop Distributed File System (HDFS), Apache Hive, various NoSQL databases - including Oracle NoSQL Database - as well as various relational databases. Oracle Big Data SQL achieves this by presenting Hadoop HDFS, Apache Hive, Oracle NoSQL Database, and the various other data sources as enhanced Oracle external tables of the Oracle Relational Database Management System (RDBMS) (See Managing External Tables in the Oracle Database Administrator's Guide). Oracle Big Data SQL maps the external semantics of accessing data from those sources - horizontal parallelism, location, and schema - to the Oracle Relational Database Management System's internal semantics. For more information on creating external table for Oracle NoSQL Database in Oracle Big Data SQL, see Create an External Table for Oracle NoSQL Database section in the Oracle Big Data SQL User's Guide.

To use Oracle Big Data SQL SELECT statements to query data stored in an Oracle NoSQL Database table, an Oracle Big Data SQL enabled external table must be created over the Oracle NoSQL Database table via an Apache Hive external table. In addition to presenting the steps to take to create such external tables, this document also presents a number of Oracle Big Data SQL queries that can be run against example table data contained in an Oracle NoSQL Database store, where the store can be either secure or non-secure.