Internal Sources

Oracle AI Data Platform supports ingestion from internal Oracle sources using built-in ingestion connectors. These connectors enable users to seamlessly extract data using Spark-based notebooks and integrate it into their workflows and data pipelines.

Ingestion connectors abstract the complexities of connection setup, providing optimized access patterns for both batch and near real-time ingestion from Oracle-native services.

AI Data Platform provides sample code templates in this ZIP file to support ingesting data from several internal sources using Spark in notebooks.

Table 14-1 Internal Sources

Source	Access Type	Integration Method	Decription	External Catalog Support	Sample Code Available
Fusion	Extract Only	Preconfigured Spark Templates	Extracts data from Fusion SaaS applications via BICC into AI Data Platform tables or volumes.	No	Yes
REST Endpoints	Read Only	JDBC via Spark Notebook	Reads from APIs for ingesting semi-structured data like JSON.	No	Yes
MySQL HeatWave	Read Only	JDBC via Spark Notebook	Move data between AI Data Platform and MySQL HeatWave using JDBC.	No	Yes
Autonomous Data Warehouse (ADW)	Read/Write + Zero-Copy	JDBC or External Catalog	Ingest from or register ADW as an external catalog for querying data directly without duplication.	Yes	Yes
Autonomous Transaction Processing	Read/Write + Zero-Copy	JDBC or External Catalog	Ingest from or register ADW as an external catalog for querying data directly without duplication.	Yes	Yes
Oracle DB	Read/Write	JDBC or External Catalog	Supports data ingestion from on-prem or OCI Oracle Databases.	Yes	Yes
Exadata	Read/Write	JDBC or External Catalog	Access Exadata systems for high-performance reads and writes using JDBC.	No	Yes

Table 14-2 Spark SQL to Oracle Database, Autonomous Database, Exadata Data Type mapping

Spark SQL Type	Oracle Database, Autonomous Database, Exadata Data Type
ByteType	NUMBER(38,10)
ShortType	NUMBER(38,10)
IntegerType (INT)	NUMBER(38,10)
LongType	NUMBER(38,10)
FloatType	FLOAT(126)
DoubleType	NUMBER(38,10)
DecimalType(p,s)	NUMBER(p,s)
StringType	VARCHAR2(4000 CHAR)
BinaryType	BLOB
BooleanType	VARCHAR2(4000 CHAR)
DateType	DATE
TimestampType	TIMESTAMP(6)
ArrayType	VARCHAR2(4000 CHAR)
MapType	Not supported
StructType	VARCHAR2(4000 CHAR)
CalendarIntervalType	Supported if converted to String/VARCHAR2