Search interfaces allow your end-users to search on multiple properties and/or dimensions simultaneously.

Create new search interfaces within the Search Interfaces editor of the Search Interfaces view.

To create a new search interface:

  1. On the Project tab, double-click Search Interfaces to open the Search Interface view.

  2. In the Search Interface view, click New.

    The Search Interfaces editor appears.

  3. In the Name box, type the name of the new search interface.

  4. From the Allow Cross-field Matches list, choose Always, Never, or On Failure. The Allow Cross-field Matches option specifies when the MDEX Engine should try to match search queries across dimension or property boundaries, but within the members of the search interface. There are three possible values:

  5. In the All (Searchable) Members list, select a member and click Add to add it to the Selected Members list. Repeat as many times as necessary to add additional members to the search interface.

    Only Endeca properties and dimensions that have their Enable record search option checked appear in this list.

  6. ( Optional) If you want to associate a relevance ranking strategy to the search interface, click Relevance Ranking Modules.

  7. ( Optional ) If you want to make more detailed adjustments to the search interface, click Options and configure the Customize partial match settings, which specify if partial matches for search terms should be supported for this search interface.

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Basic Development Guide for details.

You use the Search Interface editor to create a new search interface or modify the attributes of an existing one.

The Search Interface editor contains the following fields:

A snippet consists of search terms, surrounding context words, and ellipses.

A snippet can contain any number of search terms bracketed by <endeca_term></endeca_term> tags. The tags call out search terms and allow you to more easily reformat the terms for display in your Web application.

The snippet size is the total number of search terms and surrounding context words. You can configure the total number of words in a snippet as described in Enabling snippeting. In order to adhere to the size setting for a snippet, it is possible that the MDEX Engine may omit some search terms and context words from a snippet. This situation becomes more likely if an application user provides a large number of search terms and the maximum snippet size is comparatively small.

A snippet consists of one or more segments. The segments are delimited by ellipses in between them. Ellipses (...) indicate that there is text omitted from the snippet occurring before or after the ellipses.

For example, here is a snippet made up of two segments with a maximum size set at 20 words. The snippet resulted from a search for the search terms Scotland and British which are enclosed within <endeca_term> tags.

...in Edinburgh <endeca_term>Scotland</endeca_term>, and has been employed by Ford for 25 years ... He first joined Ford's <endeca_term>British</endeca_term> operation. Mazda motor ...

You enable the snippeting feature in the Member Options dialog box, which is accessed from the Search Interface editor.

You enable the snippeting feature in the Member Options dialog box, which is accessed from the Search Interface editor. Each member of a search interface is enabled and configured separately. In other words, snippeting results are enabled and configured for each member of a search interface and not for all members of a single search interface.

A search interface member is a dimension or property that has been enabled for search and that has been added to the Selected members pane of the Search Interface editor. You can enable and configure any number of individual search interface members. Each member that you enable produces its own snippet.

Enabling a member in one search interface does not affect that member if it appears in other search interfaces. For example, enabling the Description property for Search Interface A does not affect the Description property in Search Interface B.

To configure a search interface member for snippeting results:

Implementing search features requires additional work outside of Developer Studio. Please refer to the Endeca Basic Development Guide for details.

Edit phrase modules parameters from the Relevance Ranking Modules editor.

You can use only one Phrase module in any given search interface, but you can set all of your options in it.

The Phrase relevance ranking module states that results containing the user's query as an exact phrase, or a subset of the exact phrase, should be considered more relevant than matches simply containing the user's search terms scattered throughout the text. Phrase is one of two modules that take parameters.

To edit Phrase module parameters:

The Field module ranks documents based on the search interface field with the highest priority in which it matched.

Only the best field in which a match occurs is considered. The Field module is often used in relevance ranking strategies for catalog applications, because the category or product name is typically a good match. Field assigns a score to each result based on the static rank of the dimension or property member or members of the search interface that caused the document to match the query .

In Developer Studio, static field ranks are assigned based on the order in which members of a search interface are listed in the Search Interfaces view. The first (left-most) member has the highest rank.

By default, matches caused by cross-field matching are assigned a score of zero. The score for cross-field matches can be set explicitly in Developer Studio by moving the <<CROSS_FIELD>> indicator up or down in the Selected Members list of the Search Interface editor. The <<CROSS_FIELD>> indicator is available only for search interfaces that have the Field module and are configured to support cross-field matches.

All non-zero ranks must be non-equal and only their order matters. For example, a search interface might contain both Title and DocumentContent properties, where hits on Title are considered more important than hits on DocumentContent (which in turn are considered more important than <<CROSS_FIELD>> matches). Such a ranking is implemented by assigning the highest rank Title, the next highest rank to DocumentContent, and setting the <<CROSS_FIELD>> indicator at the bottom of the Selected Members list in the Search Interface editor.

If a document matches on multiple fields, it is ranked based on the best field that it matches.

Designed primarily for use with unstructured data, the First module ranks documents by how close the query terms are to the beginning of the document.

First groups its results into variably-sized strata. The strata are not the same size, because while the first word is probably more relevant than the tenth word, the 301st is probably not so much more relevant than the 310th word. This module takes advantage of the fact that the closer something is to the beginning of a document, the more likely it is to be relevant.

The First module works as follows:

The Phrase module states that results containing the user's query as an exact phrase, or a subset of the exact phrase, should be considered more relevant than matches simply containing the user's search terms scattered throughout the text.

Note the following points about the Phrase module:

When you add the Phrase module in the Relevance Ranking Modules editor, you are presented with an editor that allows you to set these options.

The Phrase module has a variety of options that you use to customize its behavior:

Edit phrase modules parameters from the Relevance Ranking Modules editor.

You can use only one Phrase module in any given search interface, but you can set all of your options in it.

The Phrase relevance ranking module states that results containing the user's query as an exact phrase, or a subset of the exact phrase, should be considered more relevant than matches simply containing the user's search terms scattered throughout the text. Phrase is one of two modules that take parameters.

To edit Phrase module parameters:

You should only use one Phrase module in any given search interface and set all of your options in it.

The three configuration settings for the Phrase module can be used in a variety of combinations for different effects. The following matrix describes the behavior of each combination. You should only use one Phrase module in any given search interface and set all of your options in it.

Subphrase

Approximate

Expansion

Behavior

Off

Off

Off

Default. Ranks results into two strata: those that match the user's query as a whole phrase, and those that do not.

Off

Off

On

Ranks results into two strata: those that match the original, or an extended version, of the query as a whole phrase, and those that do not.

Off

On

Off

Ranks results into two strata: those that match the original query as a whole phrase, and those that do not. Look only at the first possible phrase match within each record.

Off

On

On

Ranks results into two strata: those that match the original, or an extended version, of the query as a whole phrase, and those that do not. Look only at the first possible phrase match within each record.

On

Off

Off

Ranks results into N strata where N equals the length of the query and each result's score equals the length of its matched subphrase.

On

Off

On

Ranks results into N strata where N equals the length of the query and each result's score equals the length of its matched subphrase. Extend subphrases to facilitate matching but rank based on the length of the original subphrase (before extension).

On

On

Off

Ranks results into N strata where N equals the length of the query and each result's score equals the length of its matched subphrase. Look only at the first possible phrase match within each record.

On

On

On

Ranks results into N strata where N equals the length of the query and each result's score equals the length of its matched subphrase. Expand the query to facilitate matching but rank based on the length of the original subphrase (before extension). Look only at the first possible phrase match within each record.

The Phrase module translates each wildcard in a query into a generic placeholder for a single term.

For example, the query "sparkling w* wine" becomes "sparkling * wine" during phrase relevance ranking, where "*" indicates a single term. This generic wildcard replacement causes slightly different behavior when subphrasing is and isn't enabled.

When subphrasing is not enabled, all results that match the generic version of the wildcard phrase exactly are still placed into the first stratum. It is important, however, to understand what constitutes a matching result from the Phrase module's point of view.

Consider the search query "sparkling w* wine" with the MatchAny mode enabled. In MatchAny mode, search results only need to contain one of the requested terms to be valid, so a list of search results for this query could contain phrases that look like this:

When phrase relevance ranking is applied to these search results, the Phrase module looks for matches to "sparkling * wine" not "sparkling w* wine." Therefore, there are three results-"sparkling white wine," "sparkling refreshing wine," and "sparkling wet wine"-that are considered phrase matches for the purposes of ranking. These results are placed in the first stratum. The other two results are placed in the second stratum.

When subphrasing is enabled, the behavior becomes a bit more complex. Again, we have to remember that wildcards become generic placeholders and match any single term in a result. This means that any subphrase that is adjacent to a wildcard will, by definition, match at least one additional term (the wildcard). Because of this behavior, subphrases break down differently. The subphrases for "cold sparkling w* wine" break down into the following (note that w* changes to *):

Notice that the subphrases "sparkling," "wine," and "cold sparkling" are not included in this list. Because these subphrases are adjacent to the wildcard, we know that the subphrases will match at least one additional term. Therefore, these subphrases are subsumed by the "sparkling *", "* wine", and "cold sparkling *" subphrases.

Like regular subphrase, stratification is based on the number of terms in the subphrase, and the wildcard placeholders are counted toward the length of the subphrase. To continue the example above, results that contain "cold" get a score of one, results that contain "sparkling *" get a score of two, and so on. Again, this is the case even if the matching result phrases are different, for example, "sparkling white" and "sparkling soda."

Finally, it is important to note that, while the wildcard can be replaced by any term, a term must still exist. In other words, search results that contain the phrase "sparkling wine" are not acceptable matches for the phrase "sparkling * wine" because there is no term to substitute for the wildcard. Conversely, the phrase "sparkling cold white wine" is also not a match because each wildcard can be replaced by one, and only one, term. Even when wildcards are present, results must contain the correct number of terms, in the correct order, for them to be considered phrase matches by the Phrase module.

Designed primarily for use with unstructured data, the Proximity module ranks how close the query terms are to each other in a document by counting the number of intervening words.

Like First, this module groups its results into variable sized strata, because the difference in significance of an interval of one word and one of two words is usually greater than the difference in significance of an interval of 21 words and 22.

Single words and phrases get assigned to the best stratum because there are no intervening words. When the query has multiple terms, Proximity behaves as follows:

Under query expansion (that is, stemming, spelling correction, and the thesaurus), the expanded terms are treated as if they were in the query, so the proximity metric is computed using the locations of the expanded terms in the matching document.

For example, if a user searches for big cats and a document contains the sentence, "Big Bird likes his cat" (stemming takes cats to cat), then the proximity metric is computed just as if the sentence were, "Big Bird likes his cats."

Proximity scores partially matched queries as if the query only contained the matching terms. For example, if a user searches for cat dog fish and a document is partially matched that contains only cat and fish, then the document is scored as if the query cat fish had been entered.

Proximity interacts with other features as follows:


Copyright © Legal Notices