You can optionally add an auxiliary dictionary to supplement the primary OLT dictionary for any supported language. This may be necessary if searches for terms that exist in your data are not producing the expected results.
The auxiliary dictionary is a UTF-8 encoded file that is line oriented and tab delimited. Each line in the file represents an entry to supplement the primary dictionary.
Entries to an auxiliary dictionary are of the following form:
COMMAND value1 value2 ...
Specify
STEM
or
COMPOUND
for
COMMAND>
.
Using the STEM command
Each line beginning with
STEM
includes a term that represents the uninflected
stem (or, lemma) of a word, and one or more attributes that identify the
part(s) of speech (POS) of the word. The POS attributes must be separated from
each other by commas (with no spaces). The command name STEM and the new term
must be separated from each other and from the POS attbributes by tabs.
STEM new_term1 POS,POS,POS,... STEM new_term2 POS,POS,POS,... STEM new_term3 POS,POS,POS,...
The POS attributes enable Guided Search to identify the possible inflectional endings of the new term in its given language.
You can specify the part of speech (POS) attributes by their full names or by abbreviations of their names (listed here in parentheses):
nounProper (propN)
- A proper name of a person, a place, and so on, that is typically capitalized, such as Supidito, Tendulkar, Zachary.verb (V)
- Any verb in its dictionary form, such as deconstruct, upsell, or skateadjective (Adj)
- Modifiers of nouns, typically can be compared (green, greener, greenest), such as fast, trenchant, pendulousadverb (Adv)
- Any general modifier of a sentence that may modify an adjective or verb or may stand alone, such as slowly, yet, perhapspreposition (Prep)
- A word that forms a prepositional phrase with a noun, such as off, beside, from. Also used for postpositions in languages that have postpositions of similar function.punct (Punct)
- Any non-letter symbol that is treated as a unit by itself, such as %, $, ]pronoun (Pro)
- Any pronominal form, including personal pronouns (I, they), demonstrative pronouns (those, this), relative pronouns (who, which, wherever)interrog (Wh)
- An interrogative word, such as who, why, when, where, howdeterminer (Det)
- Words that carry grammatical information about a noun group, for example definite/indefinite, such as the, a, anparticle (Part)
- Small, invariant words that convey grammatical information; also used for interjections.conjunction (Conj)
- Conjunctions that introduce subordinate clauses, such as although, because, while; and conjunctions that introduce coordinate clauses, such as and, or, yetnumCardinal (Card)
- Cardinal numbers, like thirteen, 100, fivenumOrdinal (Ord)
- Ordinal numbers, like thirteenth, 100th, fifth
For example, the following German auxiliary dictionary shows three
entries. Each entry is marked with the attribute
N
to indicate it is a noun:
STEM aalglatt N STEM ausrüster N STEM verdränger N