Modifying a crawl may be useful if you want to re-configure any output settings. The settings vary depending on the output type of the crawl.
For example, if you modify a crawl that writes to an output file, you could re-configure a different output directory, different file format, and so on. If you modify a crawl that writes to MDEX-compatible output, you could re-configure the output directory, the Dimension Value Id Manager name, and so on. If you modify a crawl that writes to a Record Store instance, you could re-configure a different host machine, or use SSL, and so on.
To modify a crawl:
Start a command prompt, navigate to
<install path>
\CAS\<version>
\bincas-cmd
).Run the
getCrawl
task of thecas-cmd
and specify the-f
option with an argument that specifies a path for the crawl configuration file and also specify the-id
option with the ID of the crawl. Optionally, you may want to specify the-d
option to write default values for the configuration properties.For example, in a default CAS installation on Windows, this command identifies a crawl named
itldocset
and gets its configuration and writes it toC:\tmp\crawlConfig.xml
.C:\Endeca\CAS\<version>\bin>cas-cmd getCrawl -d -f C:\tmp\crawlConfig.xml -id itldocset
Modify the XML crawl configuration file as necessary. The configuration settings vary depending on the crawl output type. See the following samples:
Sample configuration for writing output to a Record Store instance
Sample configuration for writing output to an MDEX compatible format
Run the
updateCrawls
task of thecas-cmd
and specify the-f
option with an argument that specifies the name for the XML crawl configuration file you modified in the previous steps.For example, in a default CAS installation on Windows, this command creates a data source named
itldocset
.C:\Endeca\CAS\<version>\bin>cas-cmd updateCrawls -f C:\tmp\crawlConfig.xml
If the task succeeds, the console displays a message similar to the following:
Updated crawl itldocset