When a crawl is run, you specify its workspace directory either explicitly (via a path in the -w flag) or implicitly (via the output.file.directory property in the configuration file). Note that the -w flag overrides the setting of the output.file.directory property if the values are different.

By default, the workspace directory has these subdirectories:

If you are running simultaneous crawls, each crawl must have its own workspace directory.


Copyright © Legal Notices