1 About Oracle Private AI Services Container

The Private AI Services Container is a lightweight, containerized web service that allows you to offload expensive AI computation, such as vector embedding generation and vector index creation, outside of the database. This container can run in your data center or on compute nodes in the public cloud.

Offloading these computations can free up database compute resources that can then be used for other tasks, such as similarity search. There are currently two services provided by the Private AI Services Container:

Vector Embedding Service

The vector embedding service allows you to securely generate vector embeddings outside the database while still storing them directly in your Oracle AI Database. You can use the same embedding models that you would use inside the database, allowing you to seamlessly integrate the use of embeddings generated in the container with those generated in the database for similarity search operations.

To implement the embedding service, the container can be called from the database using the UTL_TO_EMBEDDING and UTL_TO_EMBEDDINGS procedures of the DBMS_VECTOR PL/SQL package. The container can also be called by REST clients such as curl, or clients that use the OpenAI SDK. Internet access is not needed to use the container.

For information about the implementation of the vector embedding service, see Use the Vector Embedding Service.

Vector Index Service

The vector index service gives you the ability to create Hierarchical Navigable Small World (HNSW) vector indexes faster by offloading the creation to an NVIDIA GPU. The high memory bandwidth and parallel processing of a GPU can produce significant time savings.

The vector index service is implemented using parameters of the CREATE VECTOR INDEX DDL to define the REST endpoint and the API KEY for the Private AI Services Container.

For more information about the implementation of the vector index service, see Use the Vector Index Service.

Note:

User data is not stored and is only processed transiently. All requests to the container are stateless and the data is not stored.

Note:

Support may be available for ancillary use of this program in conjunction with a supported Oracle product, to the extent described in that product's documentation. If support is provided it will be in accordance with Oracle’s technical support policies which may be found at https://www.oracle.com/support/policies/.

For licensing information related to Oracle AI Private Services Container, see Licensing Information User Manual for Oracle Private AI Services Container.