CSO_CRAWL_EXT_URL_DATA

This table stores information related to urls crawled for a crawl config.

Details

  • Schema: FUSION

  • Object owner: CSO

  • Object type: TABLE

  • Tablespace: Default

Primary Key

Name Columns

CSO_CRAWL_EXT_URL_DATA_PK

CSO_CRAWL_EXT_URL_DATA_ID

Columns

Name Datatype Length Precision Not-null Comments
CSO_CRAWL_EXT_URL_DATA_ID NUMBER Yes This is the primary key of the table.
CRAWL_CONFIG_ID NUMBER Yes This column refers to crawl config for which current url was downloaded.
CSO_INDEX_QUEUE_ITEM_ID NUMBER Yes This column specifies index queue item for which this url was downloaded.
FETCH_URL VARCHAR2 4000 Yes This column specifies url that is downloaded.
FETCH_URL_ID VARCHAR2 128 Yes This column specifies a unique hash for url string.
PARENT_URL_ID NUMBER Yes This column specifies parent of current url.
DEPTH_LEVEL NUMBER Yes This column specifies depth at which this url was downloaded.
CONTENT_CHECKSUM VARCHAR2 128 This column specifies checksum for the downloaded url content.
MIMETYPE VARCHAR2 254 This column specifies mimetype for the current url.
DETECTED_DOC_LANGUAGE VARCHAR2 32 This column specifies detected language for url content.
DOC_LANGUAGE VARCHAR2 32 This column specifies language in which article was actually created for url content.
DOC_SIZE_K NUMBER This column specifies size of the downloaded content.
CHANGE_FREQUENCY NUMBER This column specifies frequency at which url content is expected to change.
URL_PRIORITY NUMBER This column specifies url priority for processing.
URL_LAST_MODIFIED TIMESTAMP This column specifies when url content was last modified.
CONTENT_ID VARCHAR2 32 This column specifies content id of the article created for the url.
RAW_EXTERNAL_ID VARCHAR2 32 This column specifies ucm id for the uploaded raw url content.
STATUS_CODE NUMBER This column specifies processing status of the url.
LOCK_UNTIL NUMBER Yes This column specifies time measured in epoch time until which current url cannot be picked again for processing.
NEXT_FETCH_DATE NUMBER Yes This column specifies time measured by epoch time, when this url can be processed again.
CREATED_BY VARCHAR2 64 Yes Who column: indicates the user who created the row.
LAST_UPDATED_BY VARCHAR2 64 Yes Who column: indicates the user who last updated the row.
LAST_UPDATE_DATE TIMESTAMP Yes Who column: indicates the date and time of the last update of the row.
LAST_UPDATE_LOGIN VARCHAR2 32 Who column: indicates the session login associated to the user who last updated the row.
OBJECT_VERSION_NUMBER NUMBER 9 Yes Used to implement optimistic locking. This number is incremented every time that the row is updated. The number is compared at the start and end of a transaction to detect whether another session has updated the row since it was queried.
CREATION_DATE TIMESTAMP Yes Who column: indicates the date and time of the creation of the row.

Foreign Keys

Table Foreign Table Foreign Key Column
CSO_CRAWL_EXT_URL_DATA cso_crawl_config CRAWL_CONFIG_ID
CSO_CRAWL_EXT_URL_DATA cso_index_queue_item CSO_INDEX_QUEUE_ITEM_ID

Indexes

Index Uniqueness Tablespace Columns
CSO_CRAWL_EXT_URL_DATA_PK Unique Default CSO_CRAWL_EXT_URL_DATA_ID