After you have added your dimensions and Guided Search properties to your project, you may want to include other pipeline components to perform additional tasks. The following table describes the components you can add:
Component |
Description |
For More Info |
---|---|---|
Record assemblers |
Join data from one or more secondary data sources to the current record. |
"Adding a record assembler" in this guide and in the Oracle Developer Studio Help. |
Record caches |
Store a temporary copy of record data that has been read in by a record adapter. Record caches are generally used in conjunction with record assemblers and are set up to contain data from secondary data sources. |
"Adding a record cache" in this guide and in the Oracle Developer Studio Help. |
Java manipulators |
A Java manipulator is your own code in Java that you can use to perform data manipulation on properties and records. Java manipulators provide you with the most generic way of changing records in the Forge pipeline. A Java manipulator contains a class that is based on the Java API Adapter interface in the Content Adapter Development Kit (CADK). |
For information about how to write your own Java manipulator and for a sample code, see the Content Adapter Development Kit (CADK) Guide. |
Perl manipulators |
Allow you to write custom Perl code that changes the data associated with a Guided Search record. Perl manipulators are useful for such tasks as manually adding or removing source properties, changing the value of a source property, retrieving records based on a particular key, and so on. |
See "Using Perl Manipulators to Change Source Properties" in the Oracle Developer Studio Help. For details on Perl code syntax, see the Forge API Guide for Perl. |
Spiders |
Crawl document hierarchies on a file system or over HTTP. From a root URL, a spider spools URLs of documents to crawl. |
"Creating a spider" in this guide. |
Record manipulators |
Provide support, such as URL extraction, for a content acquisition system, such as a crawler implementation. |
See "Record Manipulators and Expressions" in the Oracle Developer Studio Help. |
Update adapters |
Provide support for partial (rapid) updates. |
See the Partial Updates Guide. |
Related links