[/map {"- map/map "}) [/map/title {"- topic/title "}) EnhancingText_unit (title] [/map/topicref {"- map/topicref "}) [/map/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicmeta/navtitle {"- topic/navtitle "}) Enhancing Text (navtitle][/map/topicref/topicmeta/linktext {"- map/linktext "}) Enhancing Text (linktext][/map/topicref/topicmeta/shortdesc {"- map/shortdesc "}) Integrator ETL provides two options for enhancing text before loading it to an Endeca Server data domain: text enrichment and text tagging.You can also detect the language of input text fields. (shortdesc] (topicmeta][/map/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Choosing Text Enrichment or Text Tagging (navtitle][/map/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Choosing Text Enrichment or Text Tagging (linktext][/map/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) Use these guidelines to choose between text enrichment and text tagging. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Using Text Tagger components (navtitle][/map/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Using Text Tagger components (linktext][/map/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) Use Text Tagger components to enhance your data with tags. (shortdesc] (topicmeta][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Overwriting and appending target values (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Overwriting and appending target values (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Overwrite Target Field configuration property of the components determines how tags are written to the output field. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Text Tagger input and output metadata (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Text Tagger input and output metadata (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) Output metadata fields for the Text Tagger components need not be in the same order as input metadata fields so long as the name of the field in the output metadata is the same as the name of the field on the input metadata. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Using the Text Tagger Whitelist component (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Using the Text Tagger Whitelist component (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Text Tagger Whitelist component adds tags based on a white list of terms to match. (shortdesc] (topicmeta][/map/topicref/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Text Tagger Whitelist input tags-rules (navtitle][/map/topicref/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Text Tagger Whitelist input tags-rules (linktext][/map/topicref/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The input for the Text Tagger Whitelist component uses a fixed metadata schema and a specific ordering. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Adding the Text Tagger Whitelist component to a graph (navtitle][/map/topicref/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Adding the Text Tagger Whitelist component to a graph (linktext][/map/topicref/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) This topic describes the requirements for adding the Text Tagger Whitelist component to a graph. (shortdesc] (topicmeta][/map/topicref/topicref/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Search term lengths (navtitle][/map/topicref/topicref/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Search term lengths (linktext][/map/topicref/topicref/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Search Term Maximum Characters Length field of the Text Tagger Whitelist component specifies the maximum length, in characters, of values in the SearchTerm property. (shortdesc] (topicmeta] (topicref] (topicref][/map/topicref/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Text Tagger Whitelist edges (navtitle][/map/topicref/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Text Tagger Whitelist edges (linktext][/map/topicref/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Text Tagger Whitelist component can use basic edges unless the source or target components require a different edge. (shortdesc] (topicmeta] (topicref] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Using the Text Tagger Regex component (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Using the Text Tagger Regex component (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Text Tagger Regex component uses regular expressions (regex) both to search for matches and to render output. (shortdesc] (topicmeta][/map/topicref/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Text Tagger Regex input patterns file (navtitle][/map/topicref/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Text Tagger Regex input patterns file (linktext][/map/topicref/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The input for the Text Tagger Regex component uses a fixed metadata schema and a specific ordering. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Adding Text Tagger Regex to a graph (navtitle][/map/topicref/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Adding Text Tagger Regex to a graph (linktext][/map/topicref/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Text Tagger Regex component requires two inputs: the source data to search for matches and the input patterns to search and render. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Text Tagger Regex edges (navtitle][/map/topicref/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Text Tagger Regex edges (linktext][/map/topicref/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Text Tagger Regex component uses a basic input edge unless the source component requires a different edge. (shortdesc] (topicmeta] (topicref] (topicref] (topicref][/map/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Using Text Enrichment (navtitle][/map/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Using Text Enrichment (linktext][/map/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Text Enrichment component provides the ability to extract and assess free-form text data. (shortdesc] (topicmeta][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Text Enrichment prerequisites (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Text Enrichment prerequisites (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Text Enrichment component requires the Salience Engine and a properties file, in addition to the input source text to process.If you want to use the query topics feature, you also need a query topics properties file; if you want to use normalized themes, you need a normalization.dat file. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Text Enrichment properties file (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Text Enrichment properties file (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Text Enrichment Properties file defines the configuration of the Salience Engine for the Text Enrichment component instance.All instances of the component can use the same properties file, or you can use different properties files to support different instances of the component. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Query topics definition file (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Query topics definition file (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The query topics definition file defines the topics you want to use to tag your text, and the queries used to evaluate whether to add the topic to an Endeca record. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Adding the Text Enrichment component to a graph (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Adding the Text Enrichment component to a graph (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) Before adding a Text Enrichment component to a graph, be sure you have created a Text Enrichment properties file.Also be sure you know the location of the Salience data directory. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Text Enrichment component edges (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Text Enrichment component edges (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Text Enrichment component only requires a basic edge for input.The output edge, however, must include all the fields from the input, plus all fields added by the Text Enrichment component. (shortdesc] (topicmeta][/map/topicref/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Creating metadata for an example Text Enrichment edge (navtitle][/map/topicref/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Creating metadata for an example Text Enrichment edge (linktext][/map/topicref/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) This example assumes you have already created an edge to join the Text Enrichment component to the next component in the graph. (shortdesc] (topicmeta] (topicref] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Processing text formatted in all caps (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Processing text formatted in all caps (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) To ensure correct processing of text formatted in all caps, use the setFlattenAllUpperCase property. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Normalizing themes (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Normalizing themes (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) You can normalize a number of discovered themes into a single reported theme. (shortdesc] (topicmeta] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Adding foreign language processing to Text Enrichment (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Adding foreign language processing to Text Enrichment (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Salience Engine supports text enrichment in French, German, Portuguese, and Spanish. (shortdesc] (topicmeta][/map/topicref/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Processing multiple languages (navtitle][/map/topicref/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Processing multiple languages (linktext][/map/topicref/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) When processing text from multiple languages, add a separate instance of the Text Enrichment component for each language. (shortdesc] (topicmeta] (topicref] (topicref][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Adding Twitter processing to Text Enrichment (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Adding Twitter processing to Text Enrichment (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Salience Engine supports enrichment of content derived from Twitter. (shortdesc] (topicmeta] (topicref] (topicref][/map/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Detecting text language (navtitle][/map/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Detecting text language (linktext][/map/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) Integrator ETL provides the ability to automatically detect the language of input text. (shortdesc] (topicmeta][/map/topicref/topicref/topicref {"- map/topicref "}) [/map/topicref/topicref/topicref/topicmeta {"- map/topicmeta "}) [/map/topicref/topicref/topicref/topicmeta/navtitle {"- topic/navtitle "}) Language Detector edges (navtitle][/map/topicref/topicref/topicref/topicmeta/linktext {"- map/linktext "}) Language Detector edges (linktext][/map/topicref/topicref/topicref/topicmeta/shortdesc {"- map/shortdesc "}) The Language Detector only requires a basic edge for input.The output edge, however, must include the output language field. (shortdesc] (topicmeta] (topicref] (topicref] (topicref] (map]