Internal Sources
Oracle AI Data Platform Workbench supports ingestion from internal Oracle sources using built-in ingestion connectors. These connectors enable users to seamlessly extract data using Spark-based notebooks and integrate it into their workflows and data pipelines.
Ingestion connectors abstract the complexities of connection setup, providing optimized access patterns for both batch and near real-time ingestion from Oracle-native services.
AI Data Platform Workbench provides sample code templates in the Oracle AI Data Platform Workbench Samples Git repository to support ingesting data from several internal sources using Spark in notebooks.
Table 14-1 Internal Sources
| Source | Access Type | Integration Method | Decription | External Catalog Support | Sample Code Available |
|---|---|---|---|---|---|
| Fusion | Extract Only | Preconfigured Spark Templates | Extracts data from Fusion SaaS applications via BICC into AI Data Platform Workbench tables or volumes. | No | Yes |
| REST Endpoints | Read Only | JDBC via Spark Notebook | Reads from APIs for ingesting semi-structured data like JSON. | No | Yes |
| MySQL HeatWave | Read Only | JDBC via Spark Notebook | Move data between AI Data Platform Workbench and MySQL HeatWave using JDBC. | No | Yes |
| Oracle Autonomous AI Lakehouse | Read/Write + Zero-Copy | JDBC or External Catalog | Ingest from or register Oracle Autonomous AI Lakehouse as an external catalog for querying data directly without duplication. | Yes | Yes |
| Oracle Autonomous AI Transaction Processing | Read/Write + Zero-Copy | JDBC or External Catalog | Ingest from or register as an external catalog for querying data directly without duplication. | Yes | Yes |
| Oracle Database | Read/Write | JDBC or External Catalog | Supports data ingestion from on-prem or OCI databases. | Yes | Yes |
| Exadata | Read/Write | JDBC or External Catalog | Access Exadata systems for high-performance reads and writes using JDBC. | No | Yes |
Table 14-2 Spark SQL to , Oracle Autonomous AI Database, Exadata Data Type mapping
| Spark SQL Type | Oracle AI Database, Oracle Autonomous AI Database, Exadata Data Type |
|---|---|
| ByteType | NUMBER(38,10) |
| ShortType | NUMBER(38,10) |
| IntegerType (INT) | NUMBER(38,10) |
| LongType | NUMBER(38,10) |
| FloatType | FLOAT(126) |
| DoubleType | NUMBER(38,10) |
| DecimalType(p,s) | NUMBER(p,s) |
| StringType | VARCHAR2(4000 CHAR) |
| BinaryType | BLOB |
| BooleanType | VARCHAR2(4000 CHAR) |
| DateType | DATE |
| TimestampType | TIMESTAMP(6) |
| ArrayType | VARCHAR2(4000 CHAR) |
| MapType | Not supported |
| StructType | VARCHAR2(4000 CHAR) |
| CalendarIntervalType | Supported if converted to String/VARCHAR2 |