VOID
expressions return no value but are used to
perform other work. The
VOID STRATIFY
expression identifies a Stratify
Classification Server that classifies Guided Search records.
For each record that passes through the record manipulator, the
STRATIFY
expression requests that the Stratify
Classification Server classify a document indicated by
Endeca.Document.Body
. Forge sends the document as an
attachment to a Stratify Classification Server. The Stratify Classification
Server examines the document including the document’s structure and classifies
it according to the classification model you developed in Stratify Taxonomy
Manager. You indicate the classification model in the
HIERARCHY_ID
expression node. The Classification
Server then sends back property values containing a Stratify topic name, a
unique ID, and a confidence rating of the classification. Forge appends these
values to the record for the document.
The following expression nodes are required in
VOID STRATIFY
:
STRATIFY_HOST
- Specifies the machine name or IP address of the Stratify Classification Server.STRATIFY_PORT
- Specifies the port on which the Stratify Classification Server listens for requests from Forge.HIERARCHY_ID
- Specifies the identifier of a Stratify classification model. To determine theVALUE
ofHIERARCHY_ID
: First, navigate to the working directory of the Stratify Classification Server that contains your classification model and taxonomy files. This directory is typically located at<Stratify Install Directory>\ClassificationServer\ClassificationServer\ClassificationServerWorkDir\Taxonomy-N
, where N is the number of the directory that contains the classification model you want to use with your Guided Search project. (Your environment may have multiple\Taxonomy-N
directories each containing different classification model and taxonomy files.) Second, note the number at the end of the of\Taxonomy-N
directory. This number is the value ofHIERARCHY_ID
. For example, if the classification model you want to use is stored in...\Taxonomy-2
, thenHIERARCHY_ID
should haveVALUE="2"
.IDENTIFIER_PROP_NAME
- Specifies the unique ID for the Guided Search record being processed. The default isEndeca.Identifier
.BODY_PROP_NAME
- Specifies the property that the Stratify Classification Server examines to classify the document. The default property isEndeca.Document.Body
. You can provide eitherEndeca.Document.Body
orEndeca.Document.Text
. However, specifyingEndeca.Document.Body
provides better classification because Forge can send the document to Stratify Classification Server as an attachment, and Stratify Classification Server can use the attachment to determine structural information of the document that aids in classification. If you specifyEndeca.Document.Text
, Forge sends the converted text of the document without any of its structural information.
See the
EXPRESSION
element for DTD and attribute information.
This example connects to the indicated Stratify Classification Server
and requests that it classify the document indicated by
Endeca.Document.Body
using against hierarchy ID 1.
<EXPRESSION NAME="STRATIFY" TYPE="VOID" > <EXPRNODE NAME="STRATIFY_HOST" VALUE="10.0.0.999"/> <EXPRNODE NAME="STRATIFY_PORT" VALUE="7021"/> <EXPRNODE NAME="HIERARCHY_ID" VALUE="1"/> <EXPRNODE NAME="IDENTIFIER_PROP_NAME" VALUE="Endeca.Identifier"/> <EXPRNODE NAME="BODY_PROP_NAME" VALUE="Endeca.Document.Body"/>