After you create and configure a data source, you can acquire data from it and, if desired, you can monitor its progress or stop acquiring from the data source.
When you acquire from a data source, the CAS Server automatically determines which acquisition mode is necessary. By default, the CAS Server attempts incremental acquisition, and it switches to full acquisition if any of the following conditions are true:
A data source has not been acquired before, which means no crawl history exists.
A Record Store instance that stores record output does not contain at least one record generation. This applies to the default case in which the CAS Server is configured to output to a Record Store instance rather than a file on disk.
Seeds have been removed from the data source configuration (adding seeds does not require full acquisition).
Filters have been added, modified, or removed in the data source configuration.
Repository properties have changed, such as the
username
property setting for CMS data sources.
In all other cases, the CAS Server acquires incrementally. However, you may force full acquisition of a data source. For more information on the difference between full and incremental acquisition, see the CAS Developer's Guide.
Also, a data source many contain one or more manipulators as part of its configuration. In this release, manipulators do not display in CAS Console. You can view, add, and configure manipulators using the CAS Server Command-line Utility. When you start an acquisition, any manipulators included with the data source also run and perform record manipulation.
To acquire from a data source:
The message in the Status column reads "Acquiring" for this data source. A timestamp appears for the Start Time, and Duration shows the time elapsed since the CAS Server started acquiring from this data source.