Siebel Data Quality Administration Guide > Siebel Data Quality Matching Server >

SDQ Matching Server Libraries for Multiple Languages


The matching rules for the Siebel Data Quality (SDQ) Matching Server are compiled in a set of dynamic-link libraries (DLLs) for various languages. Because the character and name patterns differ substantially between languages, matching rules typically are tuned specifically for each language or language family.

The SDQ Matching Server includes a set of matching libraries that cover a variety of languages and code pages. By default, the installation uses a generalized international library that is built to support a set of Latin1-based languages (languages predominant in the Americas, Western Europe, Australia, and New Zealand). In addition, the Siebel installation DVDs include reference libraries for other regions and code pages. Table 4 provides the matching libraries and languages supported. For information about code pages, see Global Deployment Guide.

NOTE:  The international library intentionally ignores certain words and abbreviations because those words and abbreviations may have a different meaning in other non-Latin1 languages.

Table 4.  Supported SDQ Matching Server Libraries and Languages
International Library
Other Libraries

DAN - Danish

CHS - Simplified Chinese

DEU - German

CHT - Traditional Chinese

ESN - Spanish

CSY - Czech

ENU - English

ELL - Greek

FIN - Finnish

HEB - Hebrew

FRA - French

JPN - Japanese

ITA - Italian

KOR - Korean

NLD - Dutch

PLK - Polish

PTG - Portuguese

 

PTB - Brazilian Portuguese

 

SVE - Swedish

 

NOTE:  The matching rules for each language or combination of languages are delivered in the form of DLLs. You can retrieve additional DLLs by installing other language packs on the Siebel Server. For more information about DLLs, see SDQ Universal Connector Architecture.

You can achieve better matching if the region-specific library is used. However, the international library is best if the data is not limited to that region, because a dataset can include a heterogeneous mixture of international names. Installing region-specific libraries for Latin-based languages requires that an administrator replace the library file on the Siebel Server with the language-specific version of the file. For example, for Windows ENU, the library is placed in C:\Siebel\SiebSrvr\bin\enu. For UNIX ENU, the library is placed in /export/home/siebel/siebsrvr/lib/enu.

The library file installed on each Siebel Server should be in sync with the data that is processed from that machine. For example, if the Japanese library is installed, a batch component job for key generation or deduplication should be constrained to Japanese data.

If the Siebel Server is running in Japanese, it loads and references the Japanese version of the matching libraries while the other Siebel Servers (running in a different language) load and reference other matching library files on their own server file systems. The match keys table in the database stores keys generated from different libraries on different Siebel Servers, together with indicators for code page and population (matching library). When a match request is executed, the list of possible match candidates is built based on the match keys from the same code page and population.

NOTE:  The SDQ Matching Server does not support the ability to find matches across languages that are not supported by the installed library file. For example, English and French data can be compared using the international library, but Chinese and Spanish data cannot be compared because the matching rules for Chinese require a separate library.

Viewing Settings for SDQ Matching Server Libraries

You can view the settings for the SDQ Matching Server libraries using Siebel Tools.

To view settings for SDQ Matching Server libraries

  1. Start Siebel Tools.
  2. In the Object Explorer, expand the Business Service object.
  3. In the Business Services window, query for DeDuplication.
  4. In the Object Explorer, select Business Service User Prop, and then query for
    SSA Population-Codepage*.

    TIP:   If the Business Service User Prop object is not visible in the Object Explorer, you can enable it in the Development Tools Options dialog box. For information about how to view object types in the Object Explorer, see the topic about showing and hiding object types in Using Siebel Tools.

For more information about matching libraries, see SDQ Matching Server Libraries for Multiple Languages.

Siebel Data Quality Administration Guide