Title and Copyright Information
Preface
- Audience
- Documentation Accessibility
- Conventions
- Related Information
1 Introducing Oracle GoldenGate for Big Data
- 1.1 What’s Supported in Oracle GoldenGate for Big Data?
  - 1.1.1 Verifying Certification, System, and Interoparability Requirements
  - 1.1.2 What are the Additional Support Considerations?
- 1.2 Setting Up Oracle GoldenGate for Big Data
- 1.3 Configuring Oracle GoldenGate for Big Data
2 Using the BigQuery Handler
- 2.1 Detailing the Functionality
- 2.2 Setting Up and Running the BigQuery Handler
3 Using the Cassandra Handler
- 3.1 Overview
- 3.2 Detailing the Functionality
- 3.3 Setting Up and Running the Cassandra Handler
- 3.4 About Automated DDL Handling
  - 3.4.1 About the Table Check and Reconciliation Process
  - 3.4.2 Capturing New Change Data
- 3.5 Performance Considerations
- 3.6 Additional Considerations
- 3.7 Troubleshooting
4 Using the Elasticsearch Handler
- 4.1 Overview
- 4.2 Detailing the Functionality
- 4.3 Setting Up and Running the Elasticsearch Handler
  - 4.3.1 Configuring the Elasticsearch Handler
  - 4.3.2 About the Transport Client Settings Properties File
- 4.4 Performance Consideration
- 4.5 About the Shield Plug-In Support
- 4.6 About DDL Handling
- 4.7 Troubleshooting
- 4.8 Logging
- 4.9 Known Issues in the Elasticsearch Handler
5 Using the File Writer Handler
- 5.1 Overview
6 Using the HDFS Event Handler
- 6.1 Detailing the Functionality
7 Using the Optimized Row Columnar Event Handler
- 7.1 Overview
- 7.2 Detailing the Functionality
8 Configuring the ORC Event Handler
9 Using the Oracle Cloud Infrastructure Event Handler
- 9.1 Overview
- 9.2 Detailing the Functionality
- 9.3 Configuring the Oracle Cloud Infrastructure Event Handler
- 9.4 Configuring Credentials for Oracle Cloud Infrastructure
- 9.5 Using Templated Strings
- 9.6 Troubleshooting
10 Using the Parquet Event Handler
- 10.1 Overview
- 10.2 Detailing the Functionality
- 10.3 Configuring the Parquet Event Handler
11 Using the S3 Event Handler
- 11.1 Overview
- 11.2 Detailing Functionality
- 11.3 Configuring the S3 Event Handler
12 Using the Command Event Handler
- 12.1 Overview - Command Event Handler
- 12.2 Configuring the Command Event Handler
- 12.3 Using Command Argument Template Strings
13 Using the Redshift Event Handler
- 13.1 Detailed Functionality
- 13.2 Operation Aggregation
  - 13.2.1 Aggregation In Memory
  - 13.2.2 Aggregation using SQL post loading data into the staging table
- 13.3 Unsupported Operations and Limitations
- 13.4 Uncompressed UPDATE records
- 13.5 Error During the Data Load Proces
- 13.6 Troubleshooting and Diagnostics
- 13.7 Classpath
- 13.8 Configuration
- 13.9 Redshift COPY SQL Authorization
14 Using the Autonomous Data Warehouse Event Handler
- 14.1 Detailed Functionality
- 14.2 ADW Database Credential to Access OCI ObjectStore File
- 14.3 ADW Database User Privileges
- 14.4 Unsupported Operations/ Limitations
- 14.5 Troubleshooting and Diagnostics
- 14.6 Classpath
- 14.7 Configuration
15 Using the HBase Handler
- 15.1 Overview
- 15.2 Detailed Functionality
- 15.3 Setting Up and Running the HBase Handler
- 15.4 Security
- 15.5 Metadata Change Events
- 15.6 Additional Considerations
- 15.7 Troubleshooting the HBase Handler
16 Using the HDFS Handler
- 16.1 Overview
- 16.2 Writing into HDFS in SequenceFile Format
  - 16.2.1 Integrating with Hive
  - 16.2.2 Understanding the Data Format
- 16.3 Setting Up and Running the HDFS Handler
- 16.4 Writing in HDFS in Avro Object Container File Format
- 16.5 Generating HDFS File Names Using Template Strings
- 16.6 Metadata Change Events
- 16.7 Partitioning
- 16.8 HDFS Additional Considerations
- 16.9 Best Practices
- 16.10 Troubleshooting the HDFS Handler
17 Using the Java Database Connectivity Handler
- 17.1 Overview
- 17.2 Detailed Functionality
- 17.3 Setting Up and Running the JDBC Handler
- 17.4 Sample Configurations
18 Using the Java Message Service Handler
- 18.1 Overview
- 18.2 Setting Up and Running the JMS Handler
19 Using the Kafka Handler
- 19.1 Overview
- 19.2 Detailed Functionality
- 19.3 Setting Up and Running the Kafka Handler
- 19.4 Schema Propagation
- 19.5 Performance Considerations
- 19.6 About Security
- 19.7 Metadata Change Events
- 19.8 Snappy Considerations
- 19.9 Kafka Interceptor Support
- 19.10 Kafka Partition Selection
- 19.11 Troubleshooting
20 Using the Kafka Connect Handler
- 20.1 Overview
- 20.2 Detailed Functionality
- 20.3 Setting Up and Running the Kafka Connect Handler
- 20.4 Kafka Connect Handler Performance Considerations
- 20.5 Kafka Interceptor Support
- 20.6 Kafka Partition Selection
- 20.7 Troubleshooting the Kafka Connect Handler
21 Using the Kafka REST Proxy Handler
- 21.1 Overview
- 21.2 Setting Up and Starting the Kafka REST Proxy Handler Services
- 21.3 Consuming the Records
- 21.4 Performance Considerations
- 21.5 Kafka REST Proxy Handler Metacolumns Template Property
22 Using the Kinesis Streams Handler
- 22.1 Overview
- 22.2 Detailed Functionality
  - 22.2.1 Amazon Kinesis Java SDK
  - 22.2.2 Kinesis Streams Input Limits
- 22.3 Setting Up and Running the Kinesis Streams Handler
- 22.4 Kinesis Handler Performance Considerations
- 22.5 Troubleshooting
23 Using the MongoDB Handler
- 23.1 Overview
- 23.2 Detailed Functionality
- 23.3 Setting Up and Running the MongoDB Handler
- 23.4 Reviewing Sample Configurations
24 Using the Metadata Providers
- 24.1 About the Metadata Providers
- 24.2 Avro Metadata Provider
- 24.3 Java Database Connectivity Metadata Provider
- 24.4 Hive Metadata Provider
25 Using the Oracle NoSQL Handler
- 25.1 Overview
- 25.2 Detailed Functionality
- 25.3 Oracle NoSQL Handler Configuration
- 25.4 Review a Sample Configuration
- 25.5 Performance Considerations
- 25.6 Full Image Data Requirements
26 Using the Pluggable Formatters
- 26.1 Using the Avro Formatter
- 26.2 Using the Delimited Text Formatter
  - 26.2.1 Using the Delimited Text Row Formatter
  - 26.2.2 Delimited Text Operation Formatter
- 26.3 Using the JSON Formatter
- 26.4 Using the Length Delimited Value Formatter
- 26.5 Using Operation-Based versus Row-Based Formatting
- 26.6 Using the XML Formatter
27 Using Oracle GoldenGate Capture for Cassandra
- 27.1 Overview
- 27.2 Setting Up Cassandra Change Data Capture
- 27.3 Deduplication
- 27.4 Topology Changes
- 27.5 Data Availability in the CDC Logs
- 27.6 Using Extract Initial Load
- 27.7 Using Change Data Capture Extract
- 27.8 Replicating to RDMBS Targets
- 27.9 Partition Update or Insert of Static Columns
- 27.10 Partition Delete
- 27.11 Security and Authentication
  - 27.11.1 Configuring SSL
- 27.12 Cleanup of CDC Commit Log Files
  - 27.12.1 Cassandra CDC Commit Log Purger
- 27.13 Multiple Extract Support
- 27.14 CDC Configuration Reference
- 27.15 Troubleshooting
28 Connecting to Microsoft Azure Data Lake
29 Connecting to Microsoft Azure Data Lake Gen 2
30 Connecting to Microsoft Azure Event Hubs
31 Connecting to Oracle Streaming Service
32 Stage and Merge Data Warehouse Replication
- 32.1 Steps for Stage and Merge
- 32.2 Snowflake Stage and Merge
  - 32.2.1 Configuration
- 32.3 Snowflake on AWS
  - 32.3.1 Data Flow
  - 32.3.2 Merge Script Variables
- 32.4 Snowflake on Azure
- 32.5 Google BigQuery Stage and Merge
- 32.6 Hive Stage and Merge
A Google BigQuery Dependancies
- A.1 BigQuery 1.11.1
B Cassandra Handler Client Dependencies
- B.1 Cassandra Datastax Java Driver 3.1.0
C Cassandra Capture Client Dependencies
D Elasticsearch Handler Transport Client Dependencies
- D.1 Elasticsearch 7.1.1 with X-Pack 7.1.1
- D.2 Elasticsearch 6.2.3 with X-Pack 6.2.3
- D.3 Elasticsearch 5.1.2 with X-Pack 5.1.2
E Elasticsearch High Level REST Client Dependencies
- E.1 Elasticsearch 7.6.1
F HBase Handler Client Dependencies
- F.1 HBase 2.2.0
- F.2 HBase 2.1.5
- F.3 HBase 2.0.5
- F.4 HBase 1.4.10
- F.5 HBase 1.3.3
- F.6 HBase 1.2.5
- F.7 HBase 1.1.1
- F.8 HBase 1.0.1.1
G HDFS Handler Client Dependencies
- G.1 Hadoop Client Dependencies
H Kafka Handler Client Dependencies
- H.1 Kafka 2.2.1
- H.2 Kafka 2.1.0
- H.3 Kafka 2.0.0
- H.4 Kafka 1.1.1
- H.5 Kafka 1.0.2
- H.6 Kafka 0.11.0.0
- H.7 Kafka 0.10.2.0
- H.8 Kafka 0.10.1.1
- H.9 Kafka 0.10.0.1
- H.10 Kafka 0.9.0.1
I Kafka Connect Handler Client Dependencies
- I.1 Kafka 2.2.1
- I.2 Kafka 2.1.1
- I.3 Kafka 2.0.1
- I.4 Kafka 1.1.1
- I.5 Kafka 1.0.2
- I.6 Kafka 0.11.0.0
- I.7 Kafka 0.10.2.0
- I.8 Kafka 0.10.2.0
- I.9 Kafka 0.10.0.0
- I.10 Kafka 0.9.0.1
- I.11 Confluent Dependencies
- I.12 Confluent 3.2.1
J MongoDB Handler Client Dependencies
- J.1 MongoDB Java Driver 3.4.3
K Optimized Row Columnar Event Handler Client Dependencies
- K.1 ORC Client 1.5.5
- K.2 ORC Client 1.4.0
L Parquet Event Handler Client Dependencies
- L.1 Parquet Client 1.10.1
- L.2 Parquet Client 1.9.0
M Velocity Dependencies
- M.1 Velocity 1.7
N OCI Dependencies
- N.1 OCI 1.13.2
- N.2 OCI: Proxy Settings Dependencies
O JMS Dependencies
- O.1 JMS 8.0