1 About Oracle Private AI Services Container
The Private AI Services Container is a lightweight, containerized web service that allows you to offload expensive AI computation, such as vector embedding generation and vector index creation, outside of the database. This container can run in your data center or on compute nodes in the public cloud.
Offloading these computations can free up database compute resources that can then be used for other tasks, such as similarity search. There are currently two services provided by the Private AI Services Container:
Vector Embedding Service
The vector embedding service allows you to securely generate vector embeddings outside the database while still storing them directly in your Oracle AI Database. You can use the same embedding models that you would use inside the database, allowing you to seamlessly integrate the use of embeddings generated in the container with those generated in the database for similarity search operations.
To implement the embedding service, the container can be
called from the database using the UTL_TO_EMBEDDING
and UTL_TO_EMBEDDINGS procedures of the
DBMS_VECTOR PL/SQL package. The container
can also be called by REST clients such as curl, or clients that use
the OpenAI SDK. Internet access is not needed to use the
container.
For information about the implementation of the vector embedding service, see Use the Vector Embedding Service.
Vector Index Service
The vector index service gives you the ability to create Hierarchical Navigable Small World (HNSW) vector indexes faster by offloading the creation to an NVIDIA GPU. The high memory bandwidth and parallel processing of a GPU can produce significant time savings.
The vector index service is implemented using parameters of
the CREATE VECTOR INDEX DDL to define the REST
endpoint and the API KEY for the Private AI Services Container.
For more information about the implementation of the vector index service, see Use the Vector Index Service.
Note:
User data is not stored and is only processed transiently. All requests to the container are stateless and the data is not stored.Note:
Support may be available for ancillary use of this program in conjunction with a supported Oracle product, to the extent described in that product's documentation. If support is provided it will be in accordance with Oracle’s technical support policies which may be found at https://www.oracle.com/support/policies/.
For licensing information related to Oracle AI Private Services Container, see Licensing Information User Manual for Oracle Private AI Services Container.