Two pass-through parameters set the language ID of input records on a global and per-record basis.

You can use the LANG and LANG_PROP_NAME pass-through parameters to specify the global language ID and the per-record language ID of the input records. The language ID is not case sensitive for both pass-through parameters. For example, you can specify EN or en for English (American).

Note that the LANG_PROP_NAME value takes precedence, and if not present, the value of LANG is used as the language of the record.

Both can be specified in the CAS manipulator.

The LANG_PROP_NAME pass-through specifies the name of the record property that contains the language ID for that record. If you do not specify this pass-through, the language ID for each record will default to the value of the LANG pass-through. For example, if the value for LANG is en-GB, then the term extractor assumes that all the records are in English - UK.

If you do specify the LANG_PROP_NAME pass-through, the term extractor will evaluate each record as follows:

If you have documents in multiple languages, the LANG_PROP_NAME pass-through is useful to ensure that only records in the desired language (the LANG setting) are processed by the term extractor.


Copyright © Legal Notices