Oracle Commerce Guided Search - Managing Text in Different Languages

Use this Spelling Mode . . .	. . . for this language or type of language
aspell (the default)	English and similar languages for which sound-alike corrections can be made, using phonetic rules. aspell does not perform corrections to non-alphabetic/non-ASCII terms such as café, 1234, or A&M.
espell	Non-English words or terms that are not words, such as part numbers; performs non-phonetic (edit-distance-based) corrections.
aspell_OR_espell	Languages that include both ASCII and non-ASCII characters and phrases. Aspell corrects ASCII words and Espell corrects other words.
aspell_AND_espell	Both modules suggest corrections and the user selects the best selection from the union of results.
disable	Chinese or other languages that use non-alphabetic scripts to which the concept of spelling does not apply.

For example, to select the espell mode, use the following command:

dgidx --spellmode espell

You can discover which spelling mode works best for an alphabetic language other than English by testing the following spelling modes with data in that language: espell, aspell, aspell_OR_espell, and aspell_AND_espell.

Note

In some cases, you may find it easier to create a separate Oracle Commerce Guided Search application for each language that you are targeting, rather than configuring a single application to manage all languages. For information about the advantages and disadvantages of each approach, see How Many MDEX Engines Do I Need?.

Specifying a Correction Mode in a Configuration File

Follow these steps to specify a correction mode in a configuration file. If such a configuration file exists, it overrides any parameter specified in the dgidx –-spellmode option.

Using any standard text editor, create a file that contains the following text:

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE SPELL_CONFIG SYSTEM "spell_config.dtd.">
<SPELL_CONFIG>
	<SPELL_ENGINE>
		<DICT_PER_LANGUAGE>
			<ESPELL/>
		</DICT_PER_LANGUAGE>
	</SPELL_ENGINE>
</SPELL_CONFIG>

Save the file as <app name>_prefix.spell_config.xml.

For more information about the structure of a spell_config.xml file, refer to the Platform Services XML Reference. See also the spell_config.dtd in the MDEX Engine conf/dtd directory.
Store the file in the directory where you store your project's other XML instance configuration files.
Run a baseline update and restart the MDEX Engine with the new configuration file.

Stop Words in an Internationalized Application

A stop word is a commonly used word, such as "the", that a search engine has been programmed to ignore. Each MDEX Engine has only one stop word list. As a result, each stop word will be used for all records processed by the MDEX Engine, whatever their languages.

Thus, if you are using a single MDEX Engine for more than one language, provide a separate version of each stop word for each of the languages that your application supports.

Before you specify a stop word in one language, make sure that it does not appear with the same spelling but a different meaning in the other languages that your application supports. English and French in particular share many such "false cognates." For example, the French word for tea, "thé", can be mistaken for the English word "the", which is commonly designated as a stop word.

Merchandising Keyword Triggers

Keyword redirects send a user's search to a Web page (that is, to a URL).

Like dynamic business rules, keyword redirects use trigger and target values. The user's search is redirected if it contains a keyword (the trigger), and you have provided a rule that redirects any search containing that keyword to a particular URL (the target). These features are applied after navigation filtering.

If your application supports multiple languages and you intend to use a given keyword trigger in each language, you must create a separate rule for the keyword trigger in each language.

For example, if the word "pants" (English) triggers a rule, and the same rule should apply to queries in French and Spanish, then two other rules must be created: one triggered by "pantalones" (Spanish) and one triggered by "pantalon" (French).

For detailed information about how create keyword triggers, refer to the MDEX Engine Developer's Guide.

Guided Search Internationalization Guide