CSO_DOC_META_INFO

This table stores metadata for each document.

Details

  • Schema: FUSION

  • Object owner: CSO

  • Object type: TABLE

  • Tablespace: TRANSACTION_TABLES

Primary Key

Name Columns

CSO_DOC_META_INFO_PK

COLLECTION_ID, DOC_ID

Columns

Name Datatype Length Not-null Comments
CHECKSUM VARCHAR2 32 The checksum of the raw content of the document (used for comparison/modification purposes)
COLLECTION_ID NUMBER 38 Yes A foreign key reference to COLLECTION_ID in CSO_COLLECTION table.
CRAWL_EVENT_ID VARCHAR2 256 The event ID for the crawl of the document.
CRAWL_RULE_INFO VARCHAR2 512 This column is not currently being used and may go away.
CREATION_TIME TIMESTAMP 7 This is the datetime when the document was stored.
CS_TYPE_ID NUMBER 38 Yes A foreign key reference to CS_TYPE_ID in CSO_CS_TYPE table.
DIRECT_IQXML NUMBER 38 A flag indicating whether the crawler generated IQXML directly.
DISPLAY_URL VARCHAR2 1024 This is the URL in search result.
DOC_ENCODING VARCHAR2 32 This is the Document encoding for this record.
DOC_ID NUMBER 38 Yes This is the Document Id for this record.
DOC_LANGUAGE VARCHAR2 32 This is the Document locale for this record.
DOC_LEVEL NUMBER 38 The depth in the fetch process compared to the starting point for HTTP crawl.
DOC_SIZE_K NUMBER 38 The size of the raw document in kilobytes.
DOC_TYPE_ID NUMBER 38 A foreign key reference to CS_TYPE_ID in CSO_CS_TYPE table.
ETAG VARCHAR2 255 This column is not currently being used and may go away.
FETCH_URL VARCHAR2 1024 Yes This is the URL for crawling for this record.
INDEXING_ALLOWED NUMBER 38 A flag indicating whether the crawling process has determined the document can be indexed.
IS_DELETED NUMBER 38 A flag indicating whether a document has been deleted.
LAST_MODIFIED_TIME TIMESTAMP 7 The last datetime when the the document was modified.
PARENT_ID NUMBER 38 This has been deprecated from the schema.
PREP_EVENT_ID VARCHAR2 256 The event ID for the pre-processing of the document.
RANKING NUMBER 38 This has been deprecated from the schema.
RECENCY NUMBER 38 This is the recency of the document.
RELEVANCY NUMBER 38 This is the relevancy of the document.
STARTPT_ID NUMBER 38 This is the ID of the starting point.
STATUS_CODE NUMBER 38 This is the status code of the document.
UUID VARCHAR2 64 Yes This is the UUID of the document.
VALID_END_TIME TIMESTAMP 7 This has been deprecated from the schema.
VALID_START_TIME TIMESTAMP 7 This has been deprecated from the schema.
VIRTUAL_COLLECTION_ID NUMBER 38 This is the ID of the virtual collection of the document.
VIRTUAL_DOC_ID NUMBER 38 This is the ID of the document in virtual collection.
ORIGINAL_DATA_SIZE NUMBER 38 Filtering by this column eliminates extra DB calls for determining if VC is full.

Foreign Keys

Table Foreign Table Foreign Key Column
CSO_DOC_META_INFO cso_collection COLLECTION_ID
CSO_DOC_META_INFO cso_doc_type DOC_TYPE_ID
CSO_DOC_META_INFO cso_cs_type CS_TYPE_ID
cso_doc_super_title cso_doc_meta_info COLLECTION_ID, DOC_ID
cso_doc_attribute cso_doc_meta_info COLLECTION_ID, DOC_ID
cso_doc_data cso_doc_meta_info COLLECTION_ID, DOC_ID

Indexes

Index Uniqueness Columns
CSO_DOC_META_INFO_U1 Unique COLLECTION_ID, DOC_ID
CSO_DOC_META_INFO_N2 Non Unique DOC_TYPE_ID
CSO_DOC_META_INFO_N3 Non Unique CS_TYPE_ID
CSO_DOC_META_INFO_N4 Non Unique VIRTUAL_COLLECTION_ID, DOC_TYPE_ID
CSO_DOC_META_INFO_N5 Non Unique VIRTUAL_COLLECTION_ID, VIRTUAL_DOC_ID
CSO_DOC_META_INFO_N6 Non Unique FETCH_URL
CSO_DOC_META_INFO_N7 Non Unique COLLECTION_ID, PARENT_ID