The following table summarizes the user-configurable parameters described in the above process:
Parameter |
Description |
---|---|
hthresh |
Specifies the threshold number of hits
at or above which spelling suggestions will not be generated. That is, above
this threshold, the spelling feature is disabled, allowing correctly spelled
queries to return only actual (non-spell-corrected) results. Results that don’t
match all query terms don’t count toward the hthresh threshold. For example, if
you have a 1000 results which are all partial matches (they match only a proper
subset of the query terms) and
|
nsug |
Specifies the maximum number of alternate spelling queries to generate for a single user search query. |
sthresh |
Specifies the threshold spelling
similarity score for words considered by the spelling correction engine. Scores
are based on a scale where 100 points corresponds approximately to an edit
distance of 1. The cost associated with correcting a query term is higher if
the term corresponds to an actual word in the data. That is, correcting
modem to
model is considered a more significant edit than
correcting
modek to
model, if
modem occurs as a word in the data but
modek does not. The threshold applies to the entire
query; for multi-word queries, the edit scores associated with correcting
multiple words are added together, and the sum cannot exceed the threshold. For
details about the interaction of the
|
glom |
Specifies that cross-property matches are considered valid when scoring replacement queries. By default, hits that result from applying some queries terms to one text field on a record and other terms to a different text field are not counted. In some cases, these results are desirable and should be considered when computing spelling suggestions. |
nobrk |
Specifies that the word-break analysis portion of the spelling correction process described above is disabled. |
Each of these parameters can be specified independently for each of the spelling correction features:
For record and dimension search auto-correct, the
--spl_
prefix is used (for example,--spl_nsug
). The flag --spl by itself enables auto-suggest spelling corrections for record search and dimension search.For explicit suggestions, the
--dym_
prefix is used (for example,--dym_nsug
). The flag--dym
by itself enables Did You Mean explicit query spelling suggestions for record search queries.For parameters that apply to all of the above, the
--spell_
prefix is used.
For additional configuration of the word-break analysis feature
(beyond disabling it with
--spell_nobrk
), use the following
--wb_
flags:
--wb_noibrk
disables the insertion of breaks in word-break analysis.--wb_norbrk
disables the removal of breaks in word-break analysis.--wb_maxbrks
specifies the maximum number of word breaks to be added to or removed from a query. The default is one.--wb_minbrklen
specifies the minimum length of a new term created by word-break analysis. The default is two.