If a crawl log level is set to INFO, TRACE, or DEBUG, the crawl statistics are entered as INFO entries in the log when the crawl finishes, as in this example (timestamps and log levels are omitted for ease of reading):
Crawl Mode = FULL_CRAWL (MetricsReport) Crawl Stop Cause = Completed (MetricsReport) Directories Filtered from Archives = 0 (MetricsReport) Directories Filtered = 0 (MetricsReport) Total Records Output = 423 (MetricsReport) Files Filtered from Archives = 124 (MetricsReport) Directories Crawled Not from Archives = 55 (MetricsReport) Documents Unsuccessfully Converted = 9 (MetricsReport) Files Crawled from Archives = 65 (MetricsReport) Files Crawled Not from Archives = 285 (MetricsReport) Delete Records Output = 0 (MetricsReport) Files Filtered Not from Archives = 51 (MetricsReport) Directories Crawled = 73 (MetricsReport) Directories Filtered Not from Archives = 0 (MetricsReport) Documents Converted = 333 (MetricsReport) Files Crawled = 350 (MetricsReport) Documents Converted After Retry = 0 (MetricsReport) New or Updated Records Output = 423 (MetricsReport) Directories Crawled from Archives = 18 (MetricsReport) Files Filtered = 175 (MetricsReport) Crawl Seconds = 71 (MetricsReport) Start Time = 5/23/08 9:23:59 AM EDT (MetricsReport) End Time = 5/23/08 9:25:10 AM EDT (MetricsReport)
Note that for incremental crawls, the
Delete Records Output
statistic is also included
and indicates how many files were deleted from the previous crawl. An Endeca
record is created for each deleted file; the record will have the
Endeca.Action
property set to DELETE.
The
Crawl Stop Cause
statistic has one of the following
values:
If a crawl fails, the
Crawl Failure Reason
statistic provides a message from
the CAS Server explaining the failure.
Keep in mind that if the log is too verbose (thus making it more difficult to find errors), you can change the log level of the crawl. The default log level is INFO.
The CAS logging configuration file is
cas-service-log4j.properties
and is located in the
directory. You can also change
the log level on a per-crawl basis using the CAS Console, the CAS API, or the
CAS command-line utilities.
<install
path>
\CAS\workspace\conf