Replicate data from Autonomous AI Transaction Processing to Azure Data Lake Storage Gen 2
This quickstart demonstrates how to replicate data from Autonomous AI Transaction Processing to Azure Data Lake Storage Gen 2 using OCI GoldenGate.
Before you begin
To successfully complete this quickstart, you must have the following:
-
A source Autonomous AI Transaction Processing instance
Environment set up: Autonomous AI Transaction Processing
If you don’t already have a source database set up for replication, you can follow these steps to load a sample schema to use for this quickstart. This quickstart uses Autonomous AI Transaction Processing for the source database.
To set up the source Autonomous AI Transaction Processing:
-
Download and unzip the sample database schema.
-
In the Oracle Cloud console, select your Autonomous AI Transaction Processing (ATP) instance from the Autonomous AI Databases page to view its details and access Database Actions.
-
Unlock the GGADMIN user:
-
Select Database actions, then select Database Users.
-
Locate GGADMIN and then select its ellipsis menu (three dots) and select Edit.
-
In the Edit User panel, enter the GGADMIN password, confirm the password, and then deselect Account is Locked.
-
Select Apply Changes.
-
-
Load the source sample schema and data:
-
From the Database actions menu, under Development, select SQL.
-
Copy and paste the script from
OCIGGLL_OCIGGS_SETUP_USERS_ATP.sqlinto the SQL worksheet. -
Select Run Script. The Script Output tab displays confirmation messages.
-
Clear the SQL worksheet and then copy and paste the SQL script from
OCIGGLL_OCIGGS_SRC_USER_SEED_DATA.sql.Tip: You may need to run each statement separately for the SQL tool to execute the scripts successfully.
-
To verify that the tables were created successfully, close the SQL window and reopen it again. In the Navigator tab, look for the
SRC_OCIGGLLschema and then select tables from their respective dropdowns.
-
-
Enable supplemental logging:
-
Clear the SQL Worksheet.
-
Enter the following statement, and then select Run Statement:
ALTER PLUGGABLE DATABASE ADD SUPPLEMENTAL LOG DATA;
-
Task 1: Create the OCI GoldenGate resources
This quickstart example requires deployments and connections for both the source and target.
-
Create an Oracle deployment for the source Autonomous AI Transaction Processing instance.
-
Create a Big Data deployment for the target Azure Data Lake Storage.
-
Create a connection for the source Autonomous AI Transaction Processing instance.
-
Create a connection to GoldenGate, and then assign this connection to the source Oracle deployment.
-
Assign the Autonomous AI Transaction Processing connection to the source Oracle deployment.
-
Assign Azure Data Lake Storage connection to the target Big Data deployment.
Task 2: Add the Extract
-
On the Deployments page, select the source Autonomous AI Transaction Processing deployment.
-
On the deployment details page, select Launch Console.
-
Log in with the source deployment's administrator username and password.
Task 3: Add and run the Distribution Path
-
If using GoldenGate credential store, create a user for the Distribution Path in the target Big Data deployment, otherwise skip to Step 3.
-
In the source GoldenGate deployment console, add a Path Connection for the user created in Step 1.
-
In the source GoldenGate deployment console, select Path Connections in the left navigation.
-
Select Add Path Connection (plus icon), and then complete the following:
-
For Credential Alias, enter
GGSNetwork. -
For User ID, enter the name of the user created in Step 1.
-
Enter the user's password twice for verification.
-
-
Select Submit.
The path connection appears in the Path Connections list.
-
-
In the source deployment console, add a Distribution Path with the following values:
-
On the Source Options page:
-
For Source Extract, select the Extract created in Task 2.
-
For Trail Name, enter a two-character name, such as
E1.
-
-
On the Target Options page:
-
For Target Host, enter the host domain of the target deployment.
-
For Port Number, enter
443. -
For Trail Name, enter a two-character name, such as
E1. -
For Alias, enter the Credential Alias created in Step 2.
-
-
-
In the target Big Data deployment console, review the Receiver Path created as a result of the Distribution Path.
-
In the target Big Data deployment console, select Receiver Service.
-
Review the path details. This path was created as a result of the Distribution Path created in the previous step.
-
Task 4: Add and run the Replicat
-
In the target Big Data deployment console navigation menu, select Replicats.
-
Add a Replicat with the following values:
-
On the Replicat Information page, under Replicat type, select Classic Replicat, enter a Process Name, and then select Next.
-
On the Replication Options page:
-
For Name, enter the name of the Trail from Task 2.
-
For Domain, select a domain.
-
For Alias, select the Azure Data Lake Storage connection created in Task 1.
-
For Checkpoint Table, select the checkpoint table you created for the target deployment.
-
-
On the Replicat Parameters page, replace
MAP *.*, TARGET *.*;with:MAP SRC_OCIGGLL.*, TARGET *.*; -
On the Properties page, configure Azure Data Lake Storage properties.
-
(Required)
gg.eventhandler.abs.bucketMappingTemplate: Name of the Azure Data Lake Storage Container. If container is pre-configured, a static container name can be provided. If Azure authentication method permissions are provided, Template Keywords can be used for auto container creation by OCI GoldenGate. -
(Optional)
gg.handler.abs.format: Select how to format the output. JSON is the default setting. Available options include:
-
-
-
Select Create and Run.
You return to the Overview page, where you can review the Replicat details.
Task 5: Verify the replication
Perform some updates to the source Autonomous AI Transaction Processing instance to verify replication to Azure Data Lake Storage.
-
In the Oracle Cloud console, open the navigation menu, select Oracle AI Database, and then select Autonomous AI Transaction Processing.
-
In the list of Autonomous AI Transaction Processing instances, select your source instance to view its details.
-
On the database details page, select Database actions.
Note: You should be automatically logged in. If not, log in with the database credentials.
-
On the Database actions home page, select SQL.
-
Enter the following into the worksheet and select Run Script.
Insert into SRC_OCIGGLL.SRC_CITY (CITY_ID,CITY,REGION_ID,POPULATION) values (1000,'Houston',20,743113); Insert into SRC_OCIGGLL.SRC_CITY (CITY_ID,CITY,REGION_ID,POPULATION) values (1001,'Dallas',20,822416); Insert into SRC_OCIGGLL.SRC_CITY (CITY_ID,CITY,REGION_ID,POPULATION) values (1002,'San Francisco',21,157574); Insert into SRC_OCIGGLL.SRC_CITY (CITY_ID,CITY,REGION_ID,POPULATION) values (1003,'Los Angeles',21,743878); Insert into SRC_OCIGGLL.SRC_CITY (CITY_ID,CITY,REGION_ID,POPULATION) values (1004,'San Diego',21,840689); Insert into SRC_OCIGGLL.SRC_CITY (CITY_ID,CITY,REGION_ID,POPULATION) values (1005,'Chicago',23,616472); Insert into SRC_OCIGGLL.SRC_CITY (CITY_ID,CITY,REGION_ID,POPULATION) values (1006,'Memphis',23,580075); Insert into SRC_OCIGGLL.SRC_CITY (CITY_ID,CITY,REGION_ID,POPULATION) values (1007,'New York City',22,124434); Insert into SRC_OCIGGLL.SRC_CITY (CITY_ID,CITY,REGION_ID,POPULATION) values (1008,'Boston',22,275581); Insert into SRC_OCIGGLL.SRC_CITY (CITY_ID,CITY,REGION_ID,POPULATION) values (1009,'Washington D.C.',22,688002); -
In the source ATP deployment console, select the Extract name, and then select Statistics. Verify that SRC_OCIGGLL.SRC_CITY has 10 inserts.
-
In the target Big Data OCI GoldenGate deployment console, select the Replicat name, and then select Statistics. Verify that SRC_OCIGGLL.SRC_CITY has 10 inserts.
-
In the Oracle Cloud console, navigate to the Oracle Object Storage bucket and check its contents.
-
In Azure console, navigate to Azure BLOB Storage container and check its contents.