Oracle Commerce Guided Search - Configuring Search Interface Options

Field	Description
Name	A unique name for this search interface. Note A search interface cannot share a name with a dimension value or a property.
Members	The dimensions and Endeca properties that make up this search interface.
Ranking Strategy	The ranking modules associated with this search interface.

Field

Description

Name

A unique name for this search interface.

Note

A search interface cannot share a name with a dimension value or a property.

Members

The dimensions and Endeca properties that make up this search interface.

Ranking Strategy

The ranking modules associated with this search interface.

Creating search interfaces

Create new search interfaces within the Search Interfaces editor of the Search Interfaces view.

To create a new search interface:

On the Project tab, double-click Search Interfaces to open the Search Interface view.
In the Search Interface view, click New.
The Search Interfaces editor appears.
In the Name box, type the name of the new search interface.
From the Allow Cross-field Matches list, choose Always, Never, or On Failure. The Allow Cross-field Matches option specifies when the MDEX Engine should try to match search queries across dimension or property boundaries, but within the members of the search interface. There are three possible values:
- Always—the MDEX Engine always looks for matches across dimension or property boundaries, in addition to matches within a dimension or property. This is the default value.
- Never—the MDEX Engine does not look across dimension or property boundaries for matches.
- On Failure—the MDEX Engine only tries to match queries across dimension or property boundaries if it fails to find any match within a single dimension or property.
In the All (Searchable) Members list, select a member and click Add to add it to the Selected Members list. Repeat as many times as necessary to add additional members to the search interface.
Only Endeca properties and dimensions that have their Enable record search option checked appear in this list.
( Optional) If you want to associate a relevance ranking strategy to the search interface, click Relevance Ranking Modules.
( Optional ) If you want to make more detailed adjustments to the search interface, click Options and configure the Customize partial match settings, which specify if partial matches for search terms should be supported for this search interface.

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Basic Development Guide for details.

Modifying search interfaces

Select a search interface from the Search Interfaces view to change it in the editor.

To edit a search interface:

In the Search Interfaces view, double-click the name of the search interface you want to modify to open it in the Search Interface editor. (You can only open one search interface at a time.
Make the necessary changes to the search interface.
Click OK to return to the Search Interfaces view.

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Basic Development Guide for details.

Deleting search interfaces

Remove search interfaces from the Search Interfaces view.

To delete a search interface:

In the Search Interfaces view, select the search interface you want to remove and click Delete.
When the confirmation message appears, click Yes.

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Basic Development Guide for details.

Search Interface editor

You use the Search Interface editor to create a new search interface or modify the attributes of an existing one.

The Search Interface editor contains the following fields:

Option	Description
Name	A unique name for this search interface. Search interface names are case sensitive.
Allow cross-field matches	Specifies when the MDEX Engine should try to match search queries across dimension or property boundaries, but within the members of the search interface. There are three possible values: Always—the MDEX Engine always looks for matches across dimension or property boundaries, in addition to matches within a dimension or property. This is the default value. Never—the MDEX Engine does not look across boundaries for matches. On Failure—the MDEX Engine only tries to match queries across dimension or property boundaries if it fails to find any match within a single dimension or property.
All (searchable) members	A list of all dimensions and Endeca properties in the project that have the "Enable record search" field checked.
Selected members	A list of the searchable dimensions and properties that have been added to this search interface.
Relevance Ranking Modules	Clicking this button opens the Relevance Ranking Modules editor, where you can add, remove, and order the relevance ranking modules that compose the ranking strategy associated with this search interface.
Options	Clicking this button opens the Search Interface Options editor, where you can make more detailed adjustments to this search interface.

Allowing cross-field matches

The Allow Cross-field Matches option, in the Search Interface editor, specifies when the MDEX Engine should try to match search queries across dimension or property boundaries, but within the members of the search interface.

There are three possible values:

Always (default)
The MDEX Engine always looks for matches across dimension or property boundaries, in addition to matches within a dimension or property. This is the default.
Never
The MDEX Engine does not look across dimension or property boundaries for matches.
On Failure
The MDEX Engine only tries to match queries across dimension or property boundaries if it fails to find any matches within a single dimension or property.

Note

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Basic Development Guide for details.

Customizing partial matching

This section describes customizations available in the Customize Partial Match Settings feature of Developer Studio.

The Customize Partial Match Settings feature specifies if partial matches for search terms should be supported for this search interface.

Match at Least sets the minimum number of words that must be matched. The default is 2.
Omit at Most sets the maximum number of words that can be omitted. The default is 2.

To customize partial matching in a search interface:

In the Search Interface Options editor, check Customize Partial Match Settings.
The Match at Least ... Words and Omit at Most ... Words text boxes are each populated with 2 (the suggested value).
In the Match at Least ... Words text box, modify the minimum number of words that must match in order to consider a match.
You cannot enter 0 for this value.
In the Omit at Most ... Words text box, modify the maximum number of words that can be omitted in order to consider a match.

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Advanced Development Guide for details.

Using Snippeting

About snippeting

A snippet contains the search terms that the user provides along with a portion of the term’s surrounding content to provide context.

The snippeting feature (also referred to as keyword in context) provides the ability to return an excerpt from a record—a snippet—to an application user who performs a record search query. A snippet contains the search terms that the user provides along with a portion of the term’s surrounding content to provide context. A Web application displays these snippets on the record list page of a query’s results. With the added context, a user can more quickly choose the individual records they are interested in.

You enable snippeting on individual members (fields) in a search interface that typically have many lines of content. For example, fields such as Description, Abstract, DocumentBody, and so on are good candidates to provide snippeting results.

For example, if a user searches for intense in a wine catalog, the record list for this query has many records that match intense. A snippet for each matching record displays on a record list page:

Snippet format and size

A snippet consists of search terms, surrounding context words, and ellipses.

A snippet can contain any number of search terms bracketed by <endeca_term></endeca_term> tags. The tags call out search terms and allow you to more easily reformat the terms for display in your Web application.

The snippet size is the total number of search terms and surrounding context words. You can configure the total number of words in a snippet as described in Enabling snippeting. In order to adhere to the size setting for a snippet, it is possible that the MDEX Engine may omit some search terms and context words from a snippet. This situation becomes more likely if an application user provides a large number of search terms and the maximum snippet size is comparatively small.

A snippet consists of one or more segments. The segments are delimited by ellipses in between them. Ellipses (...) indicate that there is text omitted from the snippet occurring before or after the ellipses.

For example, here is a snippet made up of two segments with a maximum size set at 20 words. The snippet resulted from a search for the search terms Scotland and British which are enclosed within <endeca_term> tags.

...in Edinburgh <endeca_term>Scotland</endeca_term>,and has been employed by Ford for 25 years ... He first joined Ford's <endeca_term>British</endeca_term> operation. Mazda motor ...

Note

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Basic Development Guide for details.

Snippet property names

The MDEX Engine dynamically creates new snippet properties by appending .Snippet to the original name of the search interface members (fields) that you enabled for snippeting.

For example, if you enable snippeting for properties named Description and Reviews, the MDEX Engine creates new properties named Description.Snippet and Reviews.Snippetand returns these properties with the result set for a user's record search.

Note

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Advanced Development Guide for details.

Snippets are dynamically generated properties

The snippet property appears with a record only on a record list page.

It is important to emphasize that the MDEX Engine dynamically generates snippet properties. This means the snippet properties, unlike other Endeca properties, are not created, configured, or mapped using Developer Studio. A dynamically generated snippet property is not tagged to an Endeca record.

Note

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Basic Development Guide for details.

Enabling snippeting

You enable the snippeting feature in the Member Options dialog box, which is accessed from the Search Interface editor.

You enable the snippeting feature in the Member Options dialog box, which is accessed from the Search Interface editor. Each member of a search interface is enabled and configured separately. In other words, snippeting results are enabled and configured for each member of a search interface and not for all members of a single search interface.

A search interface member is a dimension or property that has been enabled for search and that has been added to the Selected members pane of the Search Interface editor. You can enable and configure any number of individual search interface members. Each member that you enable produces its own snippet.

Enabling a member in one search interface does not affect that member if it appears in other search interfaces. For example, enabling the Description property for Search Interface A does not affect the Description property in Search Interface B.

To configure a search interface member for snippeting results:

Open your project file in Developer Studio and double-click Search Interfaces.
Either create a new search interface or select an existing one from the Search Interfaces view and click Edit.
From the Selected Member area of the Search Interface editor, click a member that you want to configure for snippeting.
Click Edit.
The Member Options dialog box displays.
From the Member Options dialog box, check Enable snippeting.
Specify the maximum snippet size (number of words) a snippet can contain.
Click OK.
Repeat steps 3-7 if you want to configure additional search interface members.

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Basic Development Guide for details.

Troubleshooting snippets

You can increase the maximum size of snippets to include more context words.

If you are not seeing enough context words in your snippet, open the Member Options editor and increase the value for Maximum Snippet Size. The default value is 25 words.

Note

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Basic Development Guide for details.

Using Relevance Ranking Modules

About relevance ranking

You use relevance ranking to control the order in which record search results are displayed to the end-user.

You use relevance ranking to control the order in which record search results are displayed to the end-user. Typically, relevance ranking is used to ensure that the most important search results are displayed earliest to the user, since users are generally unlikely to page or scan through large result sets.

The importance of a particular record search result is generally an application-specific concept. Thus, the relevance ranking feature provides a flexible, configurable set of ranking modules. These modules are then grouped into strategies that can be used in combination to produce a wide range of relevance ranking effects. Each search interface has its own ranking strategy.

Relevance ranking contains a rich set of features that should be used advisedly. Misuse of relevance ranking strategies can cause unexpected results and degraded performance.

Creating ranking modules

Ranking modules are selected from a stock list of modules. The Static and Phrase modules both take parameters.

You may have multiple instances of the Static module, however, you should only have one instance of the Phrase module for each search interface.

To assign one or more ranking modules to a search interface:

In the Search Interface editor, click Relevance Ranking Modules.
The Relevance Ranking Modules editor appears.
In the All Modules list, select a relevance ranking module and click Add.
The module is moved to the Selected Modules list.
Note
Selecting a module causes a brief description to appear in the frame in the lower left corner of the editor.
If you selected the Static module, edit the Static module parameters.
If you selected the Phrase module, edit the Phrase module parameters.
(Optional) Repeat step 2 to add additional modules to the search interface.
(Optional) Use the up and down arrows to adjust the relative rank of the modules in your ranking strategy.
Click OK to return to the Search Interface editor.

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Advanced Development Guide for details.

Modifying ranking modules

Edit ranking modules from the Relevance Ranking Modules editor, in the Search Interface editor.

To edit a ranking module in a search interface:

In the Search Interfaces view, select the search interface you want to modify and click Edit to open it in the Search Interface editor.
In the Search Interface editor, click Relevance Ranking Modules.
The Relevance Ranking Modules editor appears.
Make the necessary changes.
Click OK to return to the Search Interface editor.

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Advanced Development Guide for details.

Deleting ranking modules

Delete ranking modules from the Relevance Ranking Modules editor.

To remove a ranking module from a search interface:

In the Search Interfaces view, select the search interface you want to modify and click Edit to open it in the Search Interface editor.
In the Search Interface editor, click Relevance Ranking Modules.
The Relevance Ranking Modules editor appears.
In the Selected Modules list, select the module you want to delete and click Remove.
Click OK to return to the Search Interface editor.

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Advanced Development Guide for details.

Changing the order of ranking modules

Select and move modules as needed from the Relevance Ranking Modules editor.

By default, ranking modules are evaluated in the order in which you created them.

To change the order of ranking modules:

In the Search Interfaces view, select the search interface you want to modify and click Edit to open it in the Search Interface editor.
In the Search Interface editor, click Relevance Ranking Modules.
The Relevance Ranking Modules editor appears.
In the Selected Modules list, select a module that you want to move and click either the up arrow or the down arrow until the module is in the correct order.
(Optional) Repeat step 3 as needed.
Click OK to return to the Search Interface editor.

Editing Static module parameters

Edit static module parameters from the Relevance Ranking Modules editor.

The Static relevance ranking module, which indicates that a constant score be applied to a given result, is one of two modules that take parameters. You can apply it to a specific searchable dimension or property, and specify whether the records will be sorted in ascending or descending order. You can have multiple Static modules, as long as they have different configurations.

To rank the members of a dimension or property statically:

Open the Relevance Ranking Modules editor.
In the All Modules list, select Static and click Add. The Edit Static Relevance Rank Module editor appears.
In the New Property or Dimension list, select the property or searchable dimension to which you want to apply the static ranking module.
Check Sort Records in Descending Order if you want the resulting records sorted in that order. If you want the records sorted in ascending order (the default), make sure the checkbox is cleared.
Click OK to return to the Relevance Ranking Modules editor.

Editing Phrase module parameters

Edit phrase modules parameters from the Relevance Ranking Modules editor.

You can use only one Phrase module in any given search interface, but you can set all of your options in it.

The Phrase relevance ranking module states that results containing the user's query as an exact phrase, or a subset of the exact phrase, should be considered more relevant than matches simply containing the user's search terms scattered throughout the text. Phrase is one of two modules that take parameters.

To edit Phrase module parameters:

Open the Relevance Ranking Modules editor.
In the All Modules list, select Phrase and click Add. The Edit Phrase Relevance Rank Module editor appears.
Set the following options. See "Phrase" for detailed descriptions, interaction information, and examples of how to use these options.
Click OK to return to the Relevance Ranking Modules editor.

Recommended relevance ranking strategies

Relevance ranking contains a rich set of features that should be used advisedly.

Misuse of relevance ranking strategies can cause unexpected results and degraded performance. See the Endeca Advanced Development Guide for detailed information on relevance ranking and recommended strategies.

Ranking Modules in Detail

Exact

The Exact module provides a finer grained (but more computationally expensive) alternative to the Phrase module.

The Exact module groups results into three strata based on how well they match the query string:

The highest stratum contains results whose complete text matches the user’s query exactly.
The middle stratum contains results that contain the user's query as an exact substring.
The lowest stratum contains other hits (such as normal conjunctive matches) .

The Exact module is computationally expensive, especially on large text fields. It is intended for use only on small text fields (such as dimension values or small property values like part IDs). This module should not be used with large or offline documents (such as FILE or ENCODED_FILE properties). Use of this module in these cases will result in very poor performance and/or application failures due to request timeouts. The Phrase module does similar but less sophisticated ranking and can be used as a higher performance substitute.

Field

The Field module ranks documents based on the search interface field with the highest priority in which it matched.

Only the best field in which a match occurs is considered. The Field module is often used in relevance ranking strategies for catalog applications, because the category or product name is typically a good match. Field assigns a score to each result based on the static rank of the dimension or property member or members of the search interface that caused the document to match the query .

In Developer Studio, static field ranks are assigned based on the order in which members of a search interface are listed in the Search Interfaces view. The first (left-most) member has the highest rank.

By default, matches caused by cross-field matching are assigned a score of zero. The score for cross-field matches can be set explicitly in Developer Studio by moving the <<CROSS_FIELD>> indicator up or down in the Selected Members list of the Search Interface editor. The <<CROSS_FIELD>> indicator is available only for search interfaces that have the Field module and are configured to support cross-field matches.

All non-zero ranks must be non-equal and only their order matters. For example, a search interface might contain both Title and DocumentContent properties, where hits on Title are considered more important than hits on DocumentContent (which in turn are considered more important than <<CROSS_FIELD>> matches). Such a ranking is implemented by assigning the highest rank Title, the next highest rank to DocumentContent, and setting the <<CROSS_FIELD>> indicator at the bottom of the Selected Members list in the Search Interface editor.

If a document matches on multiple fields, it is ranked based on the best field that it matches.

Note

The Field module is only valid for record search operations. This module assigns a score of zero to all results for other types of search requests.

First

Designed primarily for use with unstructured data, the First module ranks documents by how close the query terms are to the beginning of the document.

First groups its results into variably-sized strata. The strata are not the same size, because while the first word is probably more relevant than the tenth word, the 301st is probably not so much more relevant than the 310th word. This module takes advantage of the fact that the closer something is to the beginning of a document, the more likely it is to be relevant.

The First module works as follows:

When the query has a single term, First's behavior is straight-forward: it retrieves the first absolute position of the word in the document, then calculates which stratum contains that position. The score for this document is based upon that stratum; earlier strata are better than later strata .
When the query has multiple terms, First behaves as follows:
1. The first absolute position for each of the query terms is determined.
2. The median position of these positions is calculated. This median is treated as the position of this query in the document and can be used with stratification as described in the single word case.
With query expansion (using stemming, spelling correction, or the thesaurus), the First module treats expanded terms as if they occurred in the source query. For example, the phrase glucose intolerence would be corrected to glucose intolerance (with intolerence spell-corrected to intolerance). First then continues as it does in the non-expansion case. The first position of each term is computed and the median of these is taken.
In a partially matched query, where only some of the query terms cause a document to match, First behaves as if the intersection of terms that occur in the document and terms that occur in the original query were the entire query. For example, if the query cat bird dog is partially matched to a document on the terms cat and bird, then the document is scored as if the query were cat bird.

First's interaction with other features

First works for partial match modes, such as MatchPartial, as well as for MatchAll. For partial matches, First ranks documents based on the median position of the matching terms.

First does not work with Boolean searches, cross-field matching, or wildcard search. It assigns all such matches a score of zero.

Frequency

The Frequency (freq) module provides result scoring based on the frequency (number of occurrences) of the user's query terms in the result text.

Results with more occurrences of the user search terms are considered more relevant.

Frequency values are capped at 1024.

Glom

The Glom module ranks single-field matches ahead of cross-field matches.

This module serves as a useful tie-breaker function in combination with the Maximum Field module. It is only useful in conjunction with record search operations.

Interpreted

The Interpreted (interp) ranking module is a general-purpose module that assigns a score to each result based on the query processing techniques used to obtain the match.

Matching techniques considered include partial matching, cross-field matching, spelling correction, thesaurus, and stemming matching (discussed in detail in the Endeca Advanced Development Guide).

Specifically, the interpreted ranking module ranks results as follows:

All non-partial matches are ranked ahead of all partial matches.
Within the above layer, all single-field matches are ranked ahead of all cross-field matches.
Within the above layer, all non-spelling-corrected matches are ranked above all spelling-corrected matches.
Within the above layer, all non-thesaurus matches are ranked above all thesaurus matches.
Within the above layer, all non-stemming matches are ranked above all stemming (word form) matches.

Maximum Field

Unlike Field, which assigns a static score to cross-field matches, Maximum Field selects the score of the highest-ranked field that contributed to the match.

The Maximum Field (maxfield) module behaves identically to the Field module, except in how it scores cross-field matches.

Because Maximum Field defines the score for cross-field matches dynamically, it does not make use of the <<CROSS_FIELD>> indicator set in the Search Interface editor.

Nterms

The Nterms module ranks matches according to how many query terms they match.

For example, in a three-word query, results that match all three words will be ranked above results that match only two, which will be ranked above results that match only one.

The Nterms module is only applicable to search modes where results can vary in how many query terms they match. These include MatchAny, MatchPartial, MatchAllAny, and MatchAllPartial. For details on specifying a search mode for a query, see the Endeca Advanced Development Guide.

Number of Fields

The Number of Fields (numfields) module ranks results based on the number of fields in the associated search interface in which a match occurs.

Note that we are counting whole-field rather than cross-field matches, for example, a result that matches two fields matches each field completely, while a cross-field match typically does not match any field completely.

Phrase

The Phrase module states that results containing the user's query as an exact phrase, or a subset of the exact phrase, should be considered more relevant than matches simply containing the user's search terms scattered throughout the text.

Note the following points about the Phrase module:

If a query contains only one word, then that word constitutes the entire phrase and all of the matching results will be put into one stratum (score = 1).
Because of the way hyphenated words are positionally indexed, Oracle recommends that you enable subphrase if your results contain hyphenated words.

Configuring the Phrase module

When you add the Phrase module in the Relevance Ranking Modules editor, you are presented with an editor that allows you to set these options.

The Phrase module has a variety of options that you use to customize its behavior:

Rank based on length of subphrases
Use approximate subphrase/phrase matching
Apply spell correction, thesaurus, and stemming

Ranking based on sub-phrases

Subphrasing ranks results based on the length of their subphrase matches. In other words, results that match three terms are considered more relevant than results that match two terms, and so on.

When you configure the Phrase module, you have the option of enabling subphrasing.

A subphrase is defined as a contiguous subset of the query terms the user entered, in the order that he or she entered them. For example, the query "fax cover sheets" contains the subphrases "fax," "cover," "sheets," "fax cover," "cover sheets," and "fax cover sheets," but not "fax sheets."

Content contained inside nested quotation marks in a phrase is treated as one term. For example, consider the following phrase: "the question is 'to be or not to be.' " The quoted text, "to be or not to be," is treated as one query term, so this example consists of four query terms even though it has a total of nine words.

When subphrasing is not enabled, results are ranked into two strata: those that matched the entire phrase and those that didn't.

About approximate matching

The approximate setting is appropriate in cases where the runtime performance of the standard Phrase module is inadequate because of large result contents and/or high site load.

Approximate matching provides higher-performance matching, as compared to the standard Phrase module, with somewhat less exact results. With approximate matching enabled, the Phrase module looks at a limited number of positions in each result that a phrase match could possibly exist, rather than all the positions. Only this limited number of possible occurrences is considered, regardless of whether there are later occurrences that are better, more relevant matches.

Enabling positional indexing increases the number of occurrences that the Phrase module looks at, thereby increasing the accuracy of the approximate phrase matching results. See Using positional indexing with the Phrase module for more information.

Applying spelling correction, thesaurus, and stemming with the Phrase module

Describes available functions with query expansion enabled.

Applying spelling correction, thesaurus, and stemming adjustments to the original phrase is generically known as query expansion. With query expansion enabled, the Phrase module ranks results that match a phrase's expanded forms in the same stratum as results that match the original phrase. Consider the following example:

A thesaurus entry exists that expands "US" to "United States."
The user queries for "US government."

The query, "US government," is expanded to "United States government" for matching purposes, but the Phrase module gives a score of two to any results matching "United States government" because the original, unexpanded version of the query, "US government," only had two terms.

Editing Phrase module parameters

Edit phrase modules parameters from the Relevance Ranking Modules editor.

You can use only one Phrase module in any given search interface, but you can set all of your options in it.

To edit Phrase module parameters:

Open the Relevance Ranking Modules editor.
In the All Modules list, select Phrase and click Add. The Edit Phrase Relevance Rank Module editor appears.
Set the following options. See "Phrase" for detailed descriptions, interaction information, and examples of how to use these options.
Click OK to return to the Relevance Ranking Modules editor.

Summary of Phrase module option interactions

You should only use one Phrase module in any given search interface and set all of your options in it.

The three configuration settings for the Phrase module can be used in a variety of combinations for different effects. The following matrix describes the behavior of each combination. You should only use one Phrase module in any given search interface and set all of your options in it.

Subphrase	Approximate	Expansion	Behavior
Off	Off	Off	Default. Ranks results into two strata: those that match the user's query as a whole phrase, and those that do not.
Off	Off	On	Ranks results into two strata: those that match the original, or an extended version, of the query as a whole phrase, and those that do not.
Off	On	Off	Ranks results into two strata: those that match the original query as a whole phrase, and those that do not. Look only at the first possible phrase match within each record.
Off	On	On	Ranks results into two strata: those that match the original, or an extended version, of the query as a whole phrase, and those that do not. Look only at the first possible phrase match within each record.
On	Off	Off	Ranks results into N strata where N equals the length of the query and each result's score equals the length of its matched subphrase.
On	Off	On	Ranks results into N strata where N equals the length of the query and each result's score equals the length of its matched subphrase. Extend subphrases to facilitate matching but rank based on the length of the original subphrase (before extension).
On	On	Off	Ranks results into N strata where N equals the length of the query and each result's score equals the length of its matched subphrase. Look only at the first possible phrase match within each record.
On	On	On	Ranks results into N strata where N equals the length of the query and each result's score equals the length of its matched subphrase. Expand the query to facilitate matching but rank based on the length of the original subphrase (before extension). Look only at the first possible phrase match within each record.