Categories of characters in indexed text

The Oracle Endeca Server treats characters in indexed text based on three categories.

The categories are:

During data processing, each word in the source text (that is, searchable attributes for record search, attribute values for value search) is indexed based on the alternatives for handling characters from the three categories, which is described in subsequent topics.