VOID expressions return no value but are used to perform other work. The VOID PARSE_DOC expression obtains metadata and extracts text from documents and adds the metadata and document text in the form of property values to a record.

Both text/plain and text/html files can be extracted from documents by this expression; other file types are passed to the Document Conversion Module converters for parsing. See "Implementing the Endeca Crawler" in the Forge Guide for a description of each generated property that PARSE_DOC adds to the record.

The following list describes the optional expression nodes that can modify PARSE_DOC:

See the EXPRESSION element for DTD and attribute information.


Copyright © Legal Notices