Hosted Applications and Deployments
OCI Generative AI applications define a managed runtime for Generative AI workloads. In an application, you set scaling, storage, networking, and authentication settings that are used by hosted deployments associated with the application.
Applications define a managed runtime for Generative AI workloads. Set scaling, storage, networking, and authentication for how the application runs and how clients access it.
After you create an application, you create one or more deployments by selecting a container image. The active deployment serves requests for the application endpoint.
When you deploy an application, the service provisions an endpoint that clients (or other agents) can call.