Oracle AI Vector Search Technical Architecture

Oracle AI Vector Search is designed for Artificial Intelligence (AI) workloads. It allows you to query data based on semantics and image similarity rather than with keywords.

This diagram shows the possible steps you must take to manage vector embeddings with Oracle AI Vector Search.

Generate vector embeddings from your unstructured data. You can perform this step either outside or within Oracle AI Database. To perform this step inside Oracle AI Database, first import a vector embedding model using the ONNX standard. Then use the Oracle Machine Learning for Python (OML4Py) client to download pretrained text, image, multimodal, text-classification, or reranking transformer models from the Hugging Face repository; augment them with necessary pre- and post-processing steps; convert the model into ONNX format; and generate a file or load the model directly into Oracle AI Database.
Your unstructured data can reside within or outside Oracle AI Database.
See Generate Vector Embeddings.
After you have generated the vector embeddings, you can store them along with the corresponding unstructured and business relational data. If vector embeddings are stored outside Oracle AI Database, then you can use SQL*Loader or Data Pump to load the vector embedding inside a relational table within Oracle AI Database. You can also access vector embeddings that are stored outside the database through external tables.
See Store Vector Embeddings.
Similar to how you create indexes on regular table columns, you can create vector indexes on vector embeddings, and you can create hybrid vector indexes (a combination of Oracle Text index and vector index) on your unstructured data. This is beneficial for running similarity searches over huge vector spaces.
See Create Vector Indexes and Hybrid Vector Indexes.
You can then use Oracle AI Vector Search native SQL operations to combine similarity with traditional relational key searches. In addition, you can run hybrid searches, an advanced information retrieval technique that combines both the similarity and keyword searches to achieve highly relevant search results. SQL and PL/SQL provide powerful utilities to transform unstructured data, such as documents, into chunks before generating vector embeddings on each chunk.
See Query Data with Similarity Searches and Hybrid Searches and Supported Clients and Languages.
You can use vector utility PL/SQL APIs for prompting large language models (LLMs) with textual prompts and images using LLM-powered interfaces. In addition, LLMs inherently lack the ability to access or incorporate new information after their training cut-off. By providing your LLM with up-to-date facts from your company, you can minimize the probability that an LLM will make up answers (hallucinate). Retrieval-augmented generation (RAG) is an approach developed to address the limitations of LLMs. RAG combines the strengths of pre-trained language models, including reranking ones, with the ability to retrieve information from a dataset or database in real time during the generation of responses. Oracle AI Vector Search enables RAG and LLM integration using popular frameworks like LangChain, Ollama, and LlamaIndex.
See Work with LLM-Powered APIs and Retrieval Augmentation Generation.

AI Vector Search