You can globally exclude file formats by adding their file extensions to an exclusion line in the crawl-urlfilter.txt file.

The default crawl-urlfilter.txt configuration excludes these file types:

Except for HTML, text-based, and JavaScript files, text conversion on all other file types is performed by the CAS Document Conversion Module (if you have installed and enabled the module). As a rule of thumb, therefore, you should exclude any file format that is not supported by the module. For a list of the supported file formats, see the CAS Developer's Guide.


Copyright © Legal Notices