Creating an Endpoint
Create a public or a private endpoint for a custom or pretrained model on a hosting dedicated AI cluster in OCI Generative AI.
Important
To add a model to a private endpoint, first create a private endpoint and then perform the following task.
To add a model to a private endpoint, first create a private endpoint and then perform the following task.
On the Endpoints list page, select Create endpoint. If you need help finding the list page, see Listing Endpoints.
Endpoint Information
Hosting configuration
Networking resources
Select one of the following options:- Public endpoint
- Private endpoint: If you select this option, then select the compartment for the private endpoint, and then the private endpoint that you want to use.
Guardrails
Use the endpoint create command and required parameters to create an endpoint:
oci generative-ai endpoint create --model-id <model-OCID> --compartment-id <compartment-OCID> --dedicated-ai-cluster-id <hosting-dedicated-AI-cluster-OCID> [OPTIONS]
For a complete list of parameters and values for CLI commands, see the CLI Command Reference.
Note
For pretrained models, instead of an OCID, you can use the model name exactly as listed in the Console's playground. You can also find this OCI model name, in the model's detail page in Pretrained Foundational Models in Generative AI.Run the CreateEndpoint operation to create an endpoint.
Note
For pretrained models, instead of an OCID, you can use the model name exactly as listed in the Console's playground. You can also find this OCI model name, in the model's detail page in Pretrained Foundational Models in Generative AI.