Note:

Access Cross-Region Oracle Cloud Infrastructure Buckets from Oracle Big Data Service Cluster

Introduction

In this tutorial, we will guide you through the process of accessing cross-region Oracle Cloud Infrastructure (OCI) buckets from Oracle Big Data Service cluster. Cross-region bucket access in OCI enables Oracle Big Data Service clusters to seamlessly interact with buckets located in different regions. This feature is essential for organizations with a multi-region architecture, allowing efficient data sharing, backup and management.

Objectives

Prerequisites

Task 1: Enable Cross-Region Bucket Access in Oracle Big Data Service

  1. Go to the Apache Ambari Console, navigate to Services, HDFS, CONFIGS and click ADVANCED.

  2. In the Custom core-site section, add the following property for HDFS custom core-site configuration.

    fs.oci.client.multiregion.enabled=true
    

    Ambari Config

  3. Save the configuration and restart any necessary services in Apache Ambari.

    Ambari restart

Task 2: Test the Cross-Region Bucket Access

  1. SSH into any node of the Oracle Big Data Service cluster.

  2. Test the access to the cross-region bucket using the following HDFS command.

    hdfs dfs -ls oci://<bucket name>@<namespace>.<region-code>/
    

    Replace <bucket name>, <namespace>, and <region-code> with your bucket name, namespace, and the target region’s code, respectively. For example, us-phoenix-1.

Troubleshooting and Tips

Next Steps

Consider automating cross-region sync using tools like Apache Hadoop distributed copy (DistCp) to automate data synchronization across regions for backup or disaster recovery purposes.

Acknowledgements

More Learning Resources

Explore other labs on docs.oracle.com/learn or access more free learning content on the Oracle Learning YouTube channel. Additionally, visit education.oracle.com/learning-explorer to become an Oracle Learning Explorer.

For product documentation, visit Oracle Help Center.