Walkthrough for Deploying Agents in OCI Generative AI

Learn how to deploy agents in OCI Generative AI.

Use Generative AI Applications to deploy agents as managed containerized applications in OCI Generative AI.

With Generative AI Applications, you build a container image, upload it to Oracle Cloud Infrastructure Registry (OCIR), and deploy that image as a Generative AI Application by using the OCI Console, API, or CLI.

When you deploy an agent, you can configure how the application runs and how clients access it, including:

  • Scaling
  • Storage
  • Networking
  • Authentication

After the deployment is provisioned, OCI Generative AI provides an endpoint, such as an HTTP URL, that clients can use to invoke the deployed agent.

Deploying an agent is useful when you want a managed runtime for a containerized agent application, with OCI-managed deployment configuration and endpoint provisioning.

For more information, see the topics about Generative AI Applications and deploying containerized agent applications.