CSO_CRAWL_EXT_URL_DATA
This table stores information related to urls crawled for a crawl config.
Details
-
Schema: FUSION
-
Object owner: CSO
-
Object type: TABLE
-
Tablespace: Default
Primary Key
Name | Columns |
---|---|
CSO_CRAWL_EXT_URL_DATA_PK |
CSO_CRAWL_EXT_URL_DATA_ID |
Columns
Name | Datatype | Length | Precision | Not-null | Comments |
---|---|---|---|---|---|
CSO_CRAWL_EXT_URL_DATA_ID | NUMBER | Yes | This is the primary key of the table. | ||
CRAWL_CONFIG_ID | NUMBER | Yes | This column refers to crawl config for which current url was downloaded. | ||
CSO_INDEX_QUEUE_ITEM_ID | NUMBER | Yes | This column specifies index queue item for which this url was downloaded. | ||
FETCH_URL | VARCHAR2 | 4000 | Yes | This column specifies url that is downloaded. | |
FETCH_URL_ID | VARCHAR2 | 128 | Yes | This column specifies a unique hash for url string. | |
PARENT_URL_ID | NUMBER | Yes | This column specifies parent of current url. | ||
DEPTH_LEVEL | NUMBER | Yes | This column specifies depth at which this url was downloaded. | ||
CONTENT_CHECKSUM | VARCHAR2 | 128 | This column specifies checksum for the downloaded url content. | ||
MIMETYPE | VARCHAR2 | 254 | This column specifies mimetype for the current url. | ||
DETECTED_DOC_LANGUAGE | VARCHAR2 | 32 | This column specifies detected language for url content. | ||
DOC_LANGUAGE | VARCHAR2 | 32 | This column specifies language in which article was actually created for url content. | ||
DOC_SIZE_K | NUMBER | This column specifies size of the downloaded content. | |||
CHANGE_FREQUENCY | NUMBER | This column specifies frequency at which url content is expected to change. | |||
URL_PRIORITY | NUMBER | This column specifies url priority for processing. | |||
URL_LAST_MODIFIED | TIMESTAMP | This column specifies when url content was last modified. | |||
CONTENT_ID | VARCHAR2 | 32 | This column specifies content id of the article created for the url. | ||
RAW_EXTERNAL_ID | VARCHAR2 | 32 | This column specifies ucm id for the uploaded raw url content. | ||
STATUS_CODE | NUMBER | This column specifies processing status of the url. | |||
LOCK_UNTIL | NUMBER | Yes | This column specifies time measured in epoch time until which current url cannot be picked again for processing. | ||
NEXT_FETCH_DATE | NUMBER | Yes | This column specifies time measured by epoch time, when this url can be processed again. | ||
CREATED_BY | VARCHAR2 | 64 | Yes | Who column: indicates the user who created the row. | |
LAST_UPDATED_BY | VARCHAR2 | 64 | Yes | Who column: indicates the user who last updated the row. | |
LAST_UPDATE_DATE | TIMESTAMP | Yes | Who column: indicates the date and time of the last update of the row. | ||
LAST_UPDATE_LOGIN | VARCHAR2 | 32 | Who column: indicates the session login associated to the user who last updated the row. | ||
OBJECT_VERSION_NUMBER | NUMBER | 9 | Yes | Used to implement optimistic locking. This number is incremented every time that the row is updated. The number is compared at the start and end of a transaction to detect whether another session has updated the row since it was queried. | |
CREATION_DATE | TIMESTAMP | Yes | Who column: indicates the date and time of the creation of the row. |
Foreign Keys
Table | Foreign Table | Foreign Key Column |
---|---|---|
CSO_CRAWL_EXT_URL_DATA | cso_crawl_config | CRAWL_CONFIG_ID |
CSO_CRAWL_EXT_URL_DATA | cso_index_queue_item | CSO_INDEX_QUEUE_ITEM_ID |
Indexes
Index | Uniqueness | Tablespace | Columns |
---|---|---|---|
CSO_CRAWL_EXT_URL_DATA_PK | Unique | Default | CSO_CRAWL_EXT_URL_DATA_ID |