REBUILD_INDEX

Use the DBMS_VECTOR.REBUILD_INDEX function to rebuild a vector index.

Purpose

To rebuild a vector index such as Hierarchical Navigable Small World (HNSW) vector index or Inverted File Flat (IVF) vector index. In case only the idx_name is provided, it rebuilds the index using get_ddl. When all the parameters are provided, it performs a drop index followed by a call to dbms_vector.create_index().

Syntax

DBMS_VECTOR.REBUILD_INDEX (
    idx_name                   IN VARCHAR2,
    table_name                 IN VARCHAR2 DEFAULT NULL,
    idx_vector_col             IN VARCHAR2 DEFAULT NULL, 
    idx_include_cols           IN VARCHAR2 DEFAULT NULL,
    idx_partitioning_scheme    IN VARCHAR2 DEFAULT NULL,
    idx_organization           IN VARCHAR2 DEFAULT NULL,
    idx_distance_metric        IN VARCHAR2 DEFAULT 'COSINE',
    idx_accuracy               IN NUMBER   DEFAULT 90,
    idx_parameters             IN CLOB     DEFAULT NULL,
    idx_parallel_creation      IN NUMBER   DEFAULT 1,
    idx_rebuild_mode           IN VARCHAR2 DEFAULT NULL,
    idx_quantization_type      IN VARCHAR2 DEFAULT NULL,
    idx_compression_ratio      IN NUMBER   DEFAULT NULL,
    idx_distribute_parameters  IN CLOB     DEFAULT NULL,
    idx_duplicate_parameters   IN CLOB     DEFAULT NULL,
    idx_online_build           IN BOOLEAN  DEFAULT NULL,
    idx_owner                  IN VARCHAR2 DEFAULT NULL
);

Parameters

Parameter	Description
`idx_name`	Name of the index to rebuild.
`table_name`	Table on which to create the index.
`idx_vector_col`	Vector column on which to create the index.
`idx_include_cols`	A comma-separated list of column names to be covered by the index.
`idx_partitioning_scheme`	Partitioning scheme for IVF indexes: `GLOBAL` `LOCAL` IVF indexes support both global and local indexes on partitioned tables. By default, these indexes are globally partitioned by centroid. You can choose to create a local IVF index, which provides a one-to-one relationship between the base table partitions or subpartitions and the index partitions. For detailed information on these partitioning schemes, see Inverted File Flat Vector Indexes Partitioning Schemes.
`idx_organization`	Index organization: `NEIGHBOR PARTITIONS` `INMEMORY NEIGHBOR GRAPH` For detailed information on these organization types, see Manage the Different Categories of Vector Indexes.
`idx_distance_metric`	Distance metric or mathematical function used to compute the distance between vectors: `COSINE` (default) `MANHATTAN` `HAMMING` `JACCARD` `DOT` `EUCLIDEAN` `L2_SQUARED` `EUCLIDEAN_SQUARED` For detailed information on each of these metrics, see Vector Distance Functions and Operators.
`idx_accuracy`	Target accuracy at which the approximate search should be performed when running an approximate search query. As explained in Understand Approximate Similarity Search Using Vector Indexes, you can specify non-default target accuracy values either by specifying a percentage value or by specifying internal parameters values, depending on the index type you are using. For an HNSW approximate search: In the case of an HNSW approximate search, you can specify a target accuracy percentage value to influence the number of candidates considered to probe the search. This is automatically calculated by the algorithm. A value of 100 will tend to impose a similar result as an exact search, although the system may still use the index and will not perform an exact search. The optimizer may choose to still use an index as it may be faster to do so given the predicates in the query. Instead of specifying a target accuracy percentage value, you can specify the `EFSEARCH` parameter to impose a certain maximum number of candidates to be considered while probing the index. The higher that number, the higher the accuracy. For detailed information, see Understand Hierarchical Navigable Small World Indexes. For an IVF approximate search: In the case of an IVF approximate search, you can specify a target accuracy percentage value to influence the number of partitions used to probe the search. This is automatically calculated by the algorithm. A value of 100 will tend to impose an exact search, although the system may still use the index and will not perform an exact search. The optimizer may choose to still use an index as it may be faster to do so given the predicates in the query. Instead of specifying a target accuracy percentage value, you can specify the `NEIGHBOR PARTITION PROBES` parameter to impose a certain maximum number of partitions to be probed by the search. The higher that number, the higher the accuracy. For detailed information, see Understand Inverted File Flat Vector Indexes.
`idx_parameters`	Type of vector index and associated parameters. Specify the indexing parameters in JSON format: For HNSW indexes: `type`: Type of vector index to create, that is, `HNSW` `neighbors`: Maximum number of connections permitted per vector in the HNSW graph `efConstruction`: Maximum number of closest vector candidates considered at each step of the search during insertion `offload_credential_name`: The identifier of the credential that should be used when authenticating with the offload service of the Private AI Services Container. Typically, this should match the `offload_credential_name` used when the vector index was created (prefixed by a schema if the credential schema is not the same as the currently active user schema). `offload_url`: The Private AI Services Container service endpoint for index creation. For example: `{ "type" : "HNSW", "neighbors" : 3, "efConstruction" : 4, "offload_credential_name" : "privateai", "offload_url" : "https://<myprivateaiservicehostname>/v1/index" }` For detailed information on these parameters, see Hierarchical Navigable Small World Index Syntax and Parameters. For information about the Private AI Services Container vector index service, see Oracle Private AI Services Container User's Guide. For IVF indexes: `type`: Type of vector index to create, that is, `IVF` `partitions`: Neighbor partition or cluster in which you want to divide your vector space For example: `{ "type" : "IVF", "partitions" : 5 }` For detailed information on these parameters, see Inverted File Flat Index Syntax and Parameters.
`idx_parallel_creation`	Number of parallel threads used for index construction.
`idx_rebuild_mode`	Enables a full graph refresh when the value `FULL` is specified for the parameter. This parameter currently accepts only one value (`FULL`). By default, `idx_rebuild_mode` is set to `NULL`.
`idx_quantization_type`	Optional replacement quantization algorithm to apply to the rebuilt index. The supported values are `NULL`, `'NONE'`, and `'SCALAR'`.
`idx_compression_ratio`	Optional replacement compression ratio for a quantized vector index. This is provided either as a numeric value or `NULL` and is only valid when quantization is enabled.
`idx_distribute_parameters`	Optional replacement distribution settings for a distributed HNSW vector index. The value is provided as `NULL` or as a JSON `CLOB` that includes `distribute_method` and optionally a `service_name`. The following are supported as the value for `distribute_method`: `'ROWID RANGE'` `'SIMILARITY'` `'PARTITION'` `'SUBPARTITION'` `'DISTRIBUTE'` `'AUTO'` This parameter is only supported for HNSW indexes.
`idx_duplicate_parameters`	Optional replacement duplication settings for a distributed HNSW vector index. The value is expected to be either `NULL` or a JSON `CLOB` that includes `duplicate_method` and optionally a `service_name`. The value provided for `duplicate_method` must be `'ALL'`. This parameter is only supported for HNSW indexes.
`idx_online_build`	Optional replacement online build setting to use when recreating the vector index. The values accepted are `TRUE`, `FALSE`, and `NULL`. The default value is `NULL`.
`idx_owner`	Specify the owning schema of the index to be rebuilt. The index will be rebuilt in the same schema. Use the default value, `NULL`, if the index is present in the current schema.

Examples

Specify neighbors and efConstruction for HNSW indexes:

dbms_vector.rebuild_index(
    'v_hnsw_01', 
    'vpt01', 
    'EMBEDDING', 
     NULL, 
     NULL, 
    'INMEMORY NEIGHBOR GRAPH', 
    'EUCLIDEAN', 
     95, 
    '{"type" : "HNSW", "neighbors" : 3, "efConstruction" : 4}');

Specify the number of partitions for IVF indexes:

dbms_vector.rebuild_index(
    'V_IVF_01', 
    'vpt01', 
    'EMBEDDING', 
     NULL,
     NULL, 
    'NEIGHBOR PARTITIONS', 
    'EUCLIDEAN', 
     95, 
    '{"type" : "IVF", "partitions" : 5}');

To enable a full graph rebuild:

dbms_vector.rebuild_index(
    'v_hnsw_01', 
    'vpt01', 
    'EMBEDDING', 
     NULL, 
     NULL, 
    'INMEMORY NEIGHBOR GRAPH', 
    'EUCLIDEAN', 
     95, 
    'FULL',
    '{"type" : "HNSW", "neighbors" : 3, "efConstruction" : 4}');

Parent topic: DBMS_VECTOR