Managing Private Endpoints

To use an OCI Generative AI custom model for inference, you must first create an endpoint for that model. You can also create endpoints for the pretrained models that are available in the playground.

A private endpoint is a private IP address within a VCN that you can use to access a specific service in OCI. The service sets up the private endpoint in a subnet of your choice within the VCN. Think of the private endpoint as another VNIC in the VCN. You control access to it similar to any other VNIC, using security rules. The service creates this VNIC and maintains its availability for you. You only need to manage the subnet and its security rules. Learn about About Private Endpoints.

Important

To access a Generative AI model through a private endpoint, host the model on a dedicated AI cluster and attach the cluster's endpoint to a Generative AI private endpoint.

Tip

Before creating a private endpoint, perform the Prerequisites for Private Endpoints and set up a virtual cloud network with a private subnet.

You can perform the following tasks to create and manage endpoints for custom models:

Learn about private endpoints in OCI.