The CAS Web Crawling component is deprecated in this release, and will be removed in a future release.
Note
This is only the crawling aspect of the solution, that is, the retrieval of web pages and subsequent text extraction.
We recommend that you replace any existing implementation of the web crawling component with a third-party web crawler, for example, the latest version of Apache Nutch, and have the emitted records fed to CAS for indexing as usual.