Stage and merge data into Autonomous AI Lakehouse using OCI GoldenGate

This quickstart guides you on how to stage and merge data from Autonomous AI Transaction Processing to Autonomous AI Lakehouse using an OCI GoldenGate Big Data deployment.

Before you begin

You must have the following in order to proceed:

Environment set up: Autonomous AI Databases

  1. Download and unzip the sample database schema.

  2. Set up the source Autonomous AI Transaction Processing:

    1. In the Oracle Cloud console, select your Autonomous AI Transaction Processing instance from the Autonomous AI Databases page to view its details and access Database Actions.

    2. Select Database Actions.

    3. Enable the GGADMIN user:

      1. Under Administration, select Database Users.

      2. Locate GGADMIN and then select its ellipsis menu (three dots) and select Edit.

      3. In the Edit User panel, enter the GGADMIN password, confirm the password, and then disable Account is Locked.

      4. Select Apply Changes.

    4. Load the load the source sample schema and data:

      1. From the Database Actions Selector menu, under Development, select SQL.

      2. Copy and paste the script from OCIGGLL_OCIGGS_SETUP_USERS_ATP.sql into the SQL worksheet.

      3. Select Run Script. The Script Output tab displays confirmation messages.

      4. Clear the SQL worksheet and then copy and paste the SQL script from OCIGGLL_OCIGGS_SRC_USER_SEED_DATA.sql.

      Tip: You may need to run each statement separately for the SQL tool to execute the scripts successfully.

    5. To verify that the tables were created successfully, close the SQL window and reopen it again. In the Navigator tab, look for the SRC_OCIGGLL schema and then select tables from their respective dropdowns.

    6. Enable supplemental logging:

      1. Clear the SQL Worksheet.

      2. Enter the following statement, and then select Run Statement:

        ALTER PLUGGABLE DATABASE ADD SUPPLEMENTAL LOG DATA;
  3. Set up the target Autonomous AI Lakehouse:

    1. In the Oracle Cloud console, select your Autonomous AI Lakehouse instance from the Autonomous AI Databases page to view its details and access DB tools.

    2. Select Database Actions.

    3. In the Database Actions menu, under Development, select SQL.

    4. Copy and paste the script from previously downloaded OCIGGLL_OCIGGS_SETUP_USERS_ADW.sql into the SQL worksheet.

    5. Select Run Script. The Script Output tab displays confirmation messages.

    6. Clear the SQL worksheet and then copy and paste the SQL script from OCIGGLL_OCIGGS_SRC_MIRROR_USER_SEED_DATA.sql

    7. Select Run Script.

Task 1: Create OCI GoldenGate resources

This quickstart example requires deployments and connections for both the source and target.

  1. Create an Oracle deployment for the source Autonomous AI Transaction Processing instance.

  2. Create a Big Data deployment for the target Autonomous AI Lakehouse.

  3. Create a connection for the source Autonomous AI Transaction Processing instance.

  4. Create a connection for the target Autonomous AI Lakehouse instance.

  5. Create connection for Oracle Object Storage.

  6. Create a connection to GoldenGate, and then assign this connection to the source Oracle deployment.

  7. Assign the Autonomous AI Transaction Processing connection to the source Oracle deployment.

  8. Assign the Autonomous AI Lakehouse connection the target Big Data deployment.

  9. Assign the Oracle Object Storage connection to the target Big Data deployment.

Task 2: Add the Extract

  1. On the Deployments page, select the source Autonomous AI Transaction Processing deployment.

  2. On the deployment details page, select Launch Console.

  3. Log in with the source deployment's administrator username and password.

  4. Add an Extract.

Task 3: Add and run a Distribution Path

  1. If using GoldenGate credential store, create a user for the Distribution Path in the target Big Data deployment, otherwise skip to Step 3.

  2. In the source GoldenGate deployment console, add a Path Connection for the user created in Step 1.

    1. In the source GoldenGate deployment console, select Path Connections in the left navigation.

    2. Select Add Path Connection (plus icon), and then complete the following:

      1. For Credential Alias, enter GGSNetwork.

      2. For User ID, enter the name of the user created in Step 1.

      3. Enter the user's password twice for verification.

    3. Select Submit.

      The path connection appears in the Path Connections list.

  3. In the source deployment console, add a Distribution Path with the following values:

    1. On the Source Options page:

      • For Source Extract, select the Extract created in Task 2.

      • For Trail Name, enter a two-character name, such as E1.

    2. On the Target Options page:

      • For Target Host, enter the host domain of the target deployment.

      • For Port Number, enter 443.

      • For Trail Name, enter a two-character name, such as E1.

      • For Alias, enter the Credential Alias created in Step 2.

  4. In the target Big Data deployment console, review the Receiver Path created as a result of the Distribution Path.

    1. In the target Big Data deployment console, select Receiver Service.

    2. Review the path details. This path was created as a result of the Distribution Path created in the previous step.

Task 4: Add and run the Replicat

  1. In the target Big Data deployment console, select Administrator Service, and then select Add Replicat (plus icon).

  2. Add a Replicat with the following values:

    1. On the Replicat Information page, under Replicat type, select Classic Replicat, and enter a Process Name.

    2. On the Replicat Options page:

      • For Name, enter the name of the Trail from Task 2.

      • For Domain, select a domain.

      • For Alias, select the Oracle Object Storage connection and the Autonomous AI Lakehouseconnection created in Task 1.

      • For Checkpoint Table, select the checkpoint table you created for the target deployment.

    3. On the Managed Options page, leave the fields as they are, and select Next.

    4. On the Replicat Parameters page, change the MAP line to the following:

      MAP SRC_OCIGGLL.*, TARGET SRCMIRROR_OCIGGLL.*;
  3. On the Properties page, configure the following properties:

    1. gg.eventhandler.oci.compartmentID: Add the OCID of the compartment in which the Oracle Object Storage bucket is stored.

    2. gg.eventhandler.oci.bucketMappingTemplate: Add the name of theOracle Object Storage bucket.

  4. Select Create and Run.

Task 5: Verify the replication

  1. In the Oracle Cloud console, from the navigation menu, select Oracle AI Database, and then select Autonomous AI Transaction Processing.

  2. In the list of Autonomous AI Transaction Processing instances, select your source instance to view its details.

  3. On the database details page, select Database Actions.

    Note: You should be automatically logged in. If not, log in with the database credentials.

  4. On the Database Actions home page, select SQL.

  5. Enter the following into the worksheet and select Run Script.

  6. In the source GoldenGate OCI GoldenGate deployment console, select the Extract name, and then select Statistics. Verify that SRC_OCIGGLL.SRC_CUSTOMER has 7 inserts.

    Insert into SRC_OCIGGLL.SRC_CUSTOMER (CUSTID,DEAR,LAST_NAME,FIRST_NAME,ADDRESS,CITY_ID,PHONE,AGE,SALES_PERS_ID) values (1001,0,'Brendt','Paul','10 Jasper Blvd.',107,'(212) 555 2146',19,10);
    Insert into SRC_OCIGGLL.SRC_CUSTOMER (CUSTID,DEAR,LAST_NAME,FIRST_NAME,ADDRESS,CITY_ID,PHONE,AGE,SALES_PERS_ID) values (1002,0,'McCarthy','Robin','27 Pasadena Drive',11,'(214) 555 3075',29,11);
    Insert into SRC_OCIGGLL.SRC_CUSTOMER (CUSTID,DEAR,LAST_NAME,FIRST_NAME,ADDRESS,CITY_ID,PHONE,AGE,SALES_PERS_ID) values (1003,0,'Travis','Peter','7835 Hartford Drive',12,'(510) 555 4448',34,12);
    Insert into SRC_OCIGGLL.SRC_CUSTOMER (CUSTID,DEAR,LAST_NAME,FIRST_NAME,ADDRESS,CITY_ID,PHONE,AGE,SALES_PERS_ID) values (1004,0,'Larson','Joe','87 Carmel Blvd.',13,'(213) 555 5095',45,13);
    Insert into SRC_OCIGGLL.SRC_CUSTOMER (CUSTID,DEAR,LAST_NAME,FIRST_NAME,ADDRESS,CITY_ID,PHONE,AGE,SALES_PERS_ID) values (1005,0,'Goldschmidt','Tony','91 Torre drive',14,'(619) 555 6529',55,20);
    Insert into SRC_OCIGGLL.SRC_CUSTOMER (CUSTID,DEAR,LAST_NAME,FIRST_NAME,ADDRESS,CITY_ID,PHONE,AGE,SALES_PERS_ID) values (1006,0,'Baker','William','2890 Grant Avenue',15,'(312) 555 7040',64,21);
    Insert into SRC_OCIGGLL.SRC_CUSTOMER (CUSTID,DEAR,LAST_NAME,FIRST_NAME,ADDRESS,CITY_ID,PHONE,AGE,SALES_PERS_ID) values (1007,0,'Swenson','Jack','64 Imagination Drive',19,'(202) 555 8125',74,22);
  7. In the target Big Data deployment console, select the Replicat name, and then select Statistics. Verify that SRC_OCIGGLL.SRC_CUSTOMER has 7 inserts.

  8. In target Autonomous AI Lakehouse Cloud SQL console, execute the following command to validate the data replicated:

    select * from SRCMIRROR_OCIGGLL.SRC_CUSTOMER;