Modifying a crawl may be useful if you want to re-configure any output settings. The settings vary depending on the output type of the crawl.
For example, if you modify a crawl that writes to an output file, you could re-configure a different output directory, different file format, and so on. If you modify a crawl that writes to MDEX-compatible output, you could re-configure the output directory, the Dimension Value Id Manager name, and so on. If you modify a crawl that writes to a Record Store instance, you could re-configure a different host machine, or use SSL, and so on.
To modify a crawl:
Start a command prompt, navigate to
, and locate the CAS Server Command-line utility (<install path>\CAS\<version>\bincas-cmd).Run the
getCrawltask of thecas-cmdand specify the-foption with an argument that specifies a path for the crawl configuration file and also specify the-idoption with the ID of the crawl. Optionally, you may want to specify the-doption to write default values for the configuration properties.For example, in a default CAS installation on Windows, this command identifies a crawl named
itldocsetand gets its configuration and writes it toC:\tmp\crawlConfig.xml.C:\Endeca\CAS\<version>\bin>cas-cmd getCrawl -d -f C:\tmp\crawlConfig.xml -id itldocset
Modify the XML crawl configuration file as necessary. The configuration settings vary depending on the crawl output type. See the following samples:
Sample configuration for writing output to a Record Store instance
Sample configuration for writing output to an MDEX compatible format
Run the
updateCrawlstask of thecas-cmdand specify the-foption with an argument that specifies the name for the XML crawl configuration file you modified in the previous steps.For example, in a default CAS installation on Windows, this command creates a data source named
itldocset.C:\Endeca\CAS\<version>\bin>cas-cmd updateCrawls -f C:\tmp\crawlConfig.xml
If the task succeeds, the console displays a message similar to the following:
Updated crawl itldocset

