When a crawl is run, you specify its workspace directory either explicitly (via a path in the -w
flag)
or implicitly (via the output.file.directory
property in the configuration file). Note that the -w
flag overrides the setting of the output.file.directory
property
if the values are different.
By default, the workspace directory has these subdirectories:
If you are running simultaneous crawls, each crawl must have its own workspace directory.