Add a Replicat for Databricks
Learn to add and configure a Replicat process for a Databricks target.
Before you begin
Before you add the Replicat, ensure that you have the following:
-
A Databricks connection created and assigned to your target Big Data deployment.
-
Ensure that you review prerequisites specific to this target type.
Add a Replicat
-
In the OCI GoldenGate deployment console navigation menu, select Replicat.
-
On the Replicat page, select Add Replicat.
-
In the Add Replicat panel, on the Replicat Information page, complete the fields as needed, and then select Next:
-
For Replicat Type, select Classic Replicat.
-
Enter a Process Name, no more than 5 characters long.
-
Enter a Description, to help distinguish this process from others.
-
-
On the Replicat Options page, complete the fields as needed, and then select Next:
-
For Name, enter the name of the Trail from Task 2.
-
For Target, select Databricks.
-
For Available Aliases, select your Databricks connection.
-
ForAvailable Staging Location, select Azure Datalake Storage.
-
For via staging alias, select your Azure Datalake Storage connection.
-
-
On the Managed Options page, leave the fields as is, and then select Next.
-
On the Parameter File page, replace
MAP *.*, TARGET *.*;with the following, and then select Next:MAP SRC_OCIGGLL.SRC_CUSTOMER, TARGET <target_catalog_name>.<target_schema_name>.SRC_CUSTOMER -
On the Properties File page, configure the File Handler and OCI Event Handler properties as needed. Some properties to consider modifying include:
-
Provide the target Azure Datalake Storage container name in
gg.eventhandler.abs.bucketMappingTemplate. -
Add
gg.handler.databricks.fileRollInterval=5s.
For information on this target's properties, see Databricks in the Oracle GoldenGate for Distributed Applications and Analytics guide.
-
-
Select Create and Run. If you select Create, then you can manually start the Replicat later from the Replicats page.