Query External Data on Oracle Compute Cloud at Customer

Describes packages and tools to allow the Autonomous AI Database deployments on Exadata Cloud@Customer to load, query, and manage unstructured data by building external tables and external partitioned tables from objects in Data Lakes.

You can use the DBMS_CLOUD package to configure Autonomous AI Database deployments on Exadata Cloud@Customer to access the unstructured data stored on the Oracle Compute Cloud at Customer (C3) Object Storage Bucket.

Overview

You require a data platform that can extract business insights from structured and unstructured data. The combination of structured data available in a data warehouse and unstructured data available in a Data Lake is known as Data Lakehouse.

An Autonomous Data Lakehouse allows data analytics on both structured and unstructured data. The structured data is stored in an Autonomous AI Lakehouse, while the unstructured data is stored in a Data Lake configured on C3 Object Storage Buckets. The Autonomous AI Lakehouse can query, load, and manage objects in the Data Lake by using a database package called DBMS_CLOUD. This package allows the Autonomous AI Database to create external and external partitioned tables on data lake objects such as csv, txt, Avro, and Parquet files. You can build Autonomous Data Lakehouses by combining the capabilities of Autonomous AI Database deployments on Exadata Cloud@Customer and Oracle C3.

An Autonomous Data Lakehouse has two main components: An Autonomous AI Lakehouse and a Data Lake. The Autonomous AI Lakehouse is built using Autonomous AI Database deployments on Exadata Cloud@Customer, while the Data Lake is built using C3 Object Storage Buckets. The Autonomous AI Database hosts the structured relational data, while the Data Lake hosts collections of unstructured data in the form of txt, csv, Avro, Parquet, and other file types. You can query the Autonomous AI Database, and the database determines where to find the results, whether in the structured data, the unstructured data, or a combination of the two. You can use the DBMS_CLOUD package to configure Autonomous AI Database deployments on Exadata Cloud@Customer to load, query, and manage unstructured data by building external and external partitioned tables from objects in Data Lakes.

Prerequisites

Follow the steps below to configure C3 Object Storage Buckets and to provision various Autonomous AI Database components. You need to finish these steps before configuring Autonomous AI Database to communicate with C3 Object Storage Buckets.

Configuring Autonomous AI Database to communicate with the C3 Object Storage Bucket

You need to complete the following two tasks for the Autonomous AI Database to communicate with the C3 Object Storage Bucket:

In Autonomous AI Databases, you cannot complete the above two tasks as a regular user. Oracle Cloud Operations needs to run these two tasks on behalf of the regular user. You need to log in to My Oracle Support (MOS) to create a new Support Request for Oracle Cloud Operations to perform the two tasks. You need to add the following information to the ticket. You will get these values after you complete the steps in .

After Cloud Operations completes the above tasks, log in to the Autonomous AI Database. Use the DBMS_CLOUD.CREATE_CREDENTIAL procedure to provide the Autonomous AI Database with the authentication information required to connect to the C3 Object Storage Bucket. See CREATE_CREDENTIAL for more details.