CLUSTER statement to create a cluster. A cluster is a schema object that contains data from one or more tables, all of which have one or more columns in common. Oracle Database stores together all the rows from all the tables that share the same cluster key.
For information on existing clusters, query the
DBA_CLUSTERS data dictionary views.
Oracle Database Concepts for general information on clusters
Oracle Database Application Developer's Guide - Fundamentals for information on performance considerations of clusters
Oracle Database Performance Tuning Guide for suggestions on when to use clusters
Oracle Database Reference for information on the data dictionary views
To create a cluster in your own schema, you must have
CLUSTER system privilege. To create a cluster in another user's schema, you must have
CLUSTER system privilege. Also, the owner of the schema to contain the cluster must have either space quota on the tablespace containing the cluster or the
TABLESPACE system privilege.
Oracle Database does not automatically create an index for a cluster when the cluster is initially created. Data manipulation language (DML) statements cannot be issued against cluster tables in an indexed cluster until you create a cluster index with a
Specify the schema to contain the cluster. If you omit
schema, Oracle Database creates the cluster in your current schema.
Specify is the name of the cluster to be created.
After you create a cluster, you add tables to it. A cluster can contain a maximum of 32 tables. After you create a cluster and add tables to it, the cluster is transparent. You can access clustered tables with SQL statements just as you can access nonclustered tables.
See Also:CREATE TABLE for information on adding tables to a cluster, "Creating a Cluster: Example", and "Adding Tables to a Cluster: Example"
Specify one or more names of columns in the cluster key. You can specify up to 16 cluster key columns. These columns must correspond in both datatype and size to columns in each of the clustered tables, although they need not correspond in name.
You cannot specify integrity constraints as part of the definition of a cluster key column. Instead, you can associate integrity constraints with the tables that belong to the cluster.
See Also:"Cluster Keys: Example"
Specify the datatype of each cluster key column.
You cannot specify a cluster key column of datatype
REF, nested table, varray,
BFILE, or user-defined object type.
You cannot use the
IS clause if any column datatype is not
NUMBER with scale 0.
You can specify a column of type
ROWID, but Oracle Database does not guarantee that the values in such columns are valid rowids.
See Also:"Datatypes" for information on datatypes
SORT keyword is valid only if you are creating a hash cluster. This clause instructs Oracle Database to sort the rows of the cluster on this column before applying the hash function. Doing so may improve response time during subsequent operations on the clustered data. See "HASHKEYS Clause" for information on creating a hash cluster.
physical_attributes_clause lets you specify the storage characteristics of the cluster. Each table in the cluster uses these storage characteristics as well. If you do not specify values for these parameters, Oracle Database uses the following defaults:
INITRANS: 2 or the default value of the tablespace to contain the cluster, whichever is greater
Specify the amount of space in bytes reserved to store all rows with the same cluster key value or the same hash value. This space determines the maximum number of cluster or hash values stored in a data block. If
SIZE is not a divisor of the data block size, then Oracle Database uses the next largest divisor. If
SIZE is larger than the data block size, then the database uses the operating system block size, reserving at least one data block for each cluster or hash value.
The database also considers the length of the cluster key when determining how much space to reserve for the rows having a cluster key value. Larger cluster keys require larger sizes. To see the actual size, query the
KEY_SIZE column of the
USER_CLUSTERS data dictionary view. (This value does not apply to hash clusters, because hash values are not actually stored in the cluster.)
If you omit this parameter, then the database reserves one data block for each cluster key value or hash value.
Specify the tablespace in which the cluster is to be created.
INDEX to create an indexed cluster. In an indexed cluster, Oracle Database stores together rows having the same cluster key value. Each distinct cluster key value is stored only once in each data block, regardless of the number of tables and rows in which it occurs. If you specify neither
HASHKEYS, then Oracle Database creates an indexed cluster by default.
After you create an indexed cluster, you must create an index on the cluster key before you can issue any data manipulation language (DML) statements against a table in the cluster. This index is called the cluster index.
You cannot create a cluster index for a hash cluster, and you need not create an index on a hash cluster key.
See Also:CREATE INDEX for information on creating a cluster index and Oracle Database Concepts for general information in indexed clusters
HASHKEYS clause to create a hash cluster and specify the number of hash values for the hash cluster. In a hash cluster, Oracle Database stores together rows that have the same hash key value. The hash value for a row is the value returned by the hash function of the cluster.
Oracle Database rounds up the
HASHKEYS value to the nearest prime number to obtain the actual number of hash values. The minimum value for this parameter is 2. If you omit both the
INDEX clause and the
HASHKEYS parameter, the database creates an indexed cluster by default.
When you create a hash cluster, the database immediately allocates space for the cluster based on the values of the
See Also:Oracle Database Concepts for more information on how Oracle Database allocates space for clusters and "Hash Clusters: Examples"
TABLE indicates that the cluster is a type of hash cluster containing only one table. This clause can provide faster access to rows than would result if the table were not part of a cluster.
See Also:"Single-Table Hash Clusters: Example"
Must evaluate to a positive value
Must contain at least one column, with referenced columns of any datatype as long as the entire expression evaluates to a number of scale 0. For example:
Cannot reference user-defined PL/SQL functions
Cannot reference the pseudocolumns
Cannot reference the user-related functions
USER or the datetime functions
Cannot evaluate to a constant
Cannot be a scalar subquery expression
Cannot contain columns qualified with a schema or object name (other than the cluster name)
If you omit the
IS clause, then Oracle Database uses an internal hash function for the hash cluster.
For information on existing hash functions, query the
DBA_CLUSTER_HASH_EXPRESSIONS data dictionary tables.
The cluster key of a hash column can have one or more columns of any datatype. Hash clusters with composite cluster keys or cluster keys made up of noninteger columns must use the internal hash function.
See Also:Oracle Database Reference for information on the data dictionary views
parallel_clause lets you parallelize the creation of the cluster.
For complete information on this clause, please refer to parallel_clause in the documentation on
Restriction on Parallelizing Cluster Creation If the tables in
cluster contain any columns of LOB or user-defined object type, this statement as well as subsequent
DELETE operations on
cluster are executed serially without notification.
This clause lets you specify whether
cluster will use row-level dependency tracking. With this feature, each row in the tables that make up the cluster has a system change number (SCN) that represents a time greater than or equal to the commit time of the last transaction that modified the row. You cannot change this setting after
cluster is created.
ROWDEPENDENCIES if you want to enable row-level dependency tracking. This setting is useful primarily to allow for parallel propagation in replication environments. It increases the size of each row by 6 bytes.
See Also:Oracle Database Advanced Replication for information about the use of row-level dependency tracking in replication environments
CACHE if you want the blocks retrieved for this cluster to be placed at the most recently used end of the least recently used (LRU) list in the buffer cache when a full table scan is performed. This clause is useful for small lookup tables.
NOCACHE if you want the blocks retrieved for this cluster to be placed at the least recently used end of the LRU list in the buffer cache when a full table scan is performed. This is the default behavior.
NOCACHE has no effect on clusters for which you specify
KEEP in the
CREATE CLUSTER personnel (department NUMBER(4)) SIZE 512 STORAGE (initial 100K next 50K);
CREATE INDEX idx_personnel ON CLUSTER personnel;
After creating the cluster index, you can add tables to the index and perform DML operations on those tables.
CREATE TABLE dept_10 CLUSTER personnel (department_id) AS SELECT * FROM employees WHERE department_id = 10; CREATE TABLE dept_20 CLUSTER personnel (department_id) AS SELECT * FROM employees WHERE department_id = 20;
Hash Clusters: Examples The following statement creates a hash cluster named
language with the cluster key column
cust_language, a maximum of 10 hash key values, each of which is allocated 512 bytes, and storage parameter values:
CREATE CLUSTER language (cust_language VARCHAR2(3)) SIZE 512 HASHKEYS 10 STORAGE (INITIAL 100k next 50k);
Because the preceding statement omits the
IS clause, Oracle Database uses the internal hash function for the cluster.
The following statement creates a hash cluster named
address with the cluster key made up of the columns
country_id, and uses a SQL expression containing these columns for the hash function:
CREATE CLUSTER address (postal_code NUMBER, country_id CHAR(2)) HASHKEYS 20 HASH IS MOD(postal_code + country_id, 101);
Single-Table Hash Clusters: Example The following statement creates a single-table hash cluster named
cust_orders with the cluster key
customer_id and a maximum of 100 hash key values, each of which is allocated 512 bytes:
CREATE CLUSTER cust_orders (customer_id NUMBER(6)) SIZE 512 SINGLE TABLE HASHKEYS 100;