You add a File System data source by specifying folders and files to crawl (seeds), optionally specifying filters that include or exclude files, and specifying options for Endeca records that result from crawling the data source.
If you want to crawl network drives, map the drive on the machine running CAS before specifying it as a seed.
To add a new File System data source:
Select File System from the list and click Add.
The Data Source tab displays.
In Name, specify a unique name for the data source to distinguish it from others in the CAS Console.
You can create a data source name with alphanumeric characters, underscores, dashes, and periods. All other characters are invalid for a name.
Under Seeds, specify at least one seed to crawl.
For crawling file systems, a seed is an absolute path to a folder you want to crawl. Seeds may be local folders or network drives. Note that for Windows, you should specify network drives by universal naming convention (UNC) syntax rather than by using the letter of a mapped drive. For UNIX, you can specify mounted or local drives using standard file path syntax.
Examples of local folders on Windows:
Examples of syntax for network drives:
Click Save or select the Filters or Advanced Settings tab to continue configuring the data source.
The data source displays Acquisition Steps where you can add manipulators, revise the data source configuration if necessary, or start acquiring data from the data source.
At this point, you can add manipulators, acquire data from the data source, and monitor its status.