The sample Change Tracking data source illustrates an implementation of the IncrementalDataSourceRuntime interface. This interface provides support to check whether a full acquisition is required from the Change Tracking data source. If a full acquisition is not required, then the data source provides an implementation of runIncrementalAcquisition() to acquire only the changed records.
After you install the extensions into the Integrator Acquisition System, you can configure and then run the sample Change Tracking data source.
To run the sample Change Tracking data source:
For example, copy fileSystemCrawl.xml and save it as sampleChgTracking.xml within <install path>\IAS\<version>\sample\crawlConfigFiles.
Option | Description |
---|---|
crawlId | Specify a unique name to distinguish the crawl from others in IAS. For example, sampleChgTracking. |
moduleId | Specify the module ID for a Change Tracking data source. This value must be set to com.endeca.ias.extension.sample.datasource.incremental.ChangeTrackingDataSource. |
Option | Description |
---|---|
path | Specify the path to the documents: <install path>\IAS\<version>\sample\ias-extensions\data\change-tracking-db.xml. |
... <sourceConfig> <moduleId> <id>com.endeca.ias.extension.sample.datasource.incremental.ChangeTrackingDataSource</id> </moduleId> <moduleProperties> <moduleProperty> <key>path</key> <value>C:\Oracle\Endeca\IAS\3.1.0\sample\ias-extensions\data\change-tracking-db.xml</value> </moduleProperty> </moduleProperties> <excludeFilters /> <includeFilters /> </sourceConfig> ...
Option | Description |
---|---|
moduleId | Specify the output type for a crawl. Specify an id of Record Store. |
host | Specify the fully qualified name of the host running the Record Store instance. The default value is localhost. |
port | Specify the port of the Endeca IAS Service running the Record Store instance. The default value is 8401. |
contextPath | If you installed IAS into WebLogic
Server, and you modified the default WebLogic context path, then specify the
revised context path without including a forward slash. In WebLogic Server
installations, the default value of
contextPath is
ias-server. If you installed IAS into
Jetty, you can remove
contextPath or specify an empty value.
For example: <moduleProperty> <key>contextPath</key> <value>ias-server</value> </moduleProperty> |
... <outputConfig> <moduleId> <id>Record Store</id> </moduleId> <moduleProperties> <moduleProperty> <key>host</key> <value>mymachine.endeca.com</value> </moduleProperty> <moduleProperty> <key>port</key> <value>8401</value> </moduleProperty> </moduleProperties> </outputConfig> ...
C:\Oracle\Endeca\IAS\3.1.0\bin>ias-cmd.bat createCrawls -f C:\Oracle\Endeca\IAS\ 3.1.0\sample\crawlConfigFiles\sampleChgTracking.xml Created crawl sampleChgTracking
C:\Oracle\Endeca\IAS\3.1.0\bin>ias-cmd.bat startCrawl -id sampleChgTracking
C:\Oracle\Endeca\IAS\3.1.0\bin>recordstore-cmd.bat read-baseline -a sampleChgTracking [Endeca.Id=1, Endeca.Action=UPSERT, Endeca.SourceId=sampleChgTracking, DATA=base line data...] [Endeca.Id=3, Endeca.Action=UPSERT, Endeca.SourceId=sampleChgTracking, DATA=some incremental data...] [Endeca.Id=5, Endeca.Action=UPSERT, Endeca.SourceId=sampleChgTracking, DATA=some incremental data...]
<changeHistory> <key>5</key> <changeType>UPDATE</changeType> <time>2010-02-02T19:19:43.471-05:00</time> </changeHistory>
Acquiring data from this file results in an incremental update to record 5.
<row> <key>7</key> <data>some incremental data...</data> </row>
<changeHistory> <key>7</key> <changeType>CREATE</changeType> <time>2010-02-02T19:19:43.471-05:00</time> </changeHistory>
<changeHistory> <key>8</key> <changeType>DELETE</changeType> <time>2010-02-02T19:19:43.471-05:00</time> </changeHistory>
C:\Oracle\Endeca\IAS\3.1.0\bin>ias-cmd.bat startCrawl -id sampleChgTracking
Oracle Endeca Information Discovery Integrator: Integrator Acquisition System Extension API Guide · Version 3.1.0 · October 2013
Copyright © 2003, 2013, Oracle and/or its affiliates. All rights reserved.