The Text Enrichment component provides the ability to extract and assess free-form text data.
The Text Enrichment component uses the Salience Engine from Lexalytics. Depending on your license, the Salience Engine may also provide the ability to assess the sentiment of the input text. Sentiment can be evaluated for the whole input (or document), the sentiment towards specific entities, or the sentiment towards specific themes.
Text Enrichment feature | Resulting information in the output record |
---|---|
Sentiment Analysis | An overall sentiment score for the current
document, for specific entities, or for specific themes. This functionality is
available by special license.
This feature can be enabled and disabled. |
Named Entities | A list of named entities in the current
document. You can specify which types of entities to extract. Supported entity
types include:
The output record includes one column per type. Each column can contain multiple values. If Sentiment Analysis is enabled, the entities are added to different groups based on their sentiment scores. You must specify the ranges for the entity sentiment scores. The output record includes one column per range and each column can contain multiple values. This feature can be enabled or disabled. |
Themes | A list of themes in the document. All
meta-themes are added to the output record in a field you specify.
For any theme that is not a meta-theme, if the theme score is
higher than a user-specified threshold, then:
This feature can be enabled or disabled. |
Quotations | A list of quotes in the document, with an
attribution to the speaker. You can specify the maximum length of quotes and
the name of the field/property in the output record.
This feature can be enabled or disabled, |
Document Summary | A shortened version of the input content that
best represents the whole content in a limited number of words.
This feature is always enabled. It cannot be disabled. |
Although both sources are aimed at a developer audience, they can provide useful information for Integrator ETL users who are implementing the Text Enrichment feature.