OCI Generative AI Overview
OCI Generative AI is a fully managed Oracle Cloud Infrastructure service for building, deploying, and operating generative AI applications at enterprise scale.
Use OCI Generative AI to work with pretrained and custom models, build production-grade agents, and apply enterprise governance controls across access, networking, and AI behavior. The service supports core generative AI tasks such as chat, embeddings, rerank, and OpenAI-compatible APIs, while also providing enterprise capabilities for tools, memory, retrieval, and hosted agentic applications.
This documentation is organized around three main areas:
- Enterprise AI Models
- Enterprise AI Agents
- Enterprise AI Governance
Enterprise AI Models
Use Enterprise AI Models when you want model access for inference tasks such as conversational generation, semantic search, recommendation, classification, clustering, and relevance ranking.
OCI Generative AI supports:
- Chat for conversational experiences such as question answering and virtual assistants
- Embeddings for semantic search, recommendation, classification, and clustering
- Rerank for ordering documents by relevance to a query
- OpenAI-compatible APIs for integrating with existing tools and SDKs
You can use OCI Generative AI models in several ways. You can call pretrained hosted models through the Console, API, or CLI. You can also import, fine-tune, and host custom models on dedicated AI clusters. This gives you a path from experimentation to production with enterprise controls and deployment flexibility.
Enterprise AI Agents
Use Enterprise AI Agents when you want to build production-grade agentic applications that combine models with tools, memory, retrieval, and orchestration.
OCI Generative AI provides two main approaches for building agents:
- Build agents with the OCI Responses API
- Deploy hosted agentic applications in OCI Generative AI
You can also combine these approaches in a hybrid architecture.
The OCI Responses API is the primary API for agentic workflows. It is OpenAI-compatible and supports model interaction, orchestration, reasoning, conversation state, and tool use. Supported tools include File Search, Code Interpreter, Function Calling, and MCP Calling. Agents can also use supporting resources such as Files, Vector Stores, Containers, Conversations, Projects, and memory features such as long-term memory and short-term memory compaction.
OCI Generative AI also supports SQL Search (NL2SQL) for agent workflows that need structured enterprise data access. NL2SQL converts natural-language requests into validated SQL by using semantic enrichment and structured data metadata.
For teams that want OCI-managed hosting for custom runtimes, OCI Generative AI provides Applications and Deployments for hosted agentic applications. This model supports container-based deployment, managed infrastructure, networking, storage integration, and identity configuration.
Enterprise AI Governance
Use Enterprise AI Governance when you need to secure and control how generative AI resources are accessed, deployed, and used.
Enterprise AI governance in OCI Generative AI combines infrastructure, identity, network security, and runtime controls to help keep AI systems secure, compliant, and aligned with organizational policy.
Key governance capabilities include:
- IAM Policies for controlling who can access, use, and manage OCI Generative AI resources
- Private Endpoints for keeping model access within a secure network boundary
- API Keys for accessing OCI Generative AI services
- OAuth for agentic tasks that require OCI IAM identity-domain application integration
- Zero Trust Packet Routing (ZPR) for identity-based network enforcement
- Guardrails for applying runtime safety and compliance controls to model inputs and outputs
Together, these capabilities provide end-to-end governance across access control, network security, hosted application configuration, and AI behavior.
How These Areas Work Together
These three areas work together as part of one platform:
- Enterprise AI Models provide the model foundation for inference and generation
- Enterprise AI Agents add orchestration, tools, memory, retrieval, and hosted execution
- Enterprise AI Governance applies the security, access, and compliance controls needed for enterprise use
This combination lets you move from model access, to agentic application development, to production deployment with governance built in.