International Language Environments Guide

Summary of the Solaris 8 Locale

All Solaris 8 locale packages are classified into two categories. The first category is for partial locales, which are the enablers of the locales. With partial locales installed on the system, users can run applications on the target locales, while the OS/GUI messages from Solaris are English. All partial locale packages are available on the Solaris OS CDs.

The second category is for full locale packages. These packages include translations of software messages, on-line help files, optional fonts, and language specific features. Full locale packages provide the full set of language features to 9 languages.

Full locale packages are available on the languages CD. Partial locale packages (locale enablers) have to be installed in order for the full locales to be functional.

Localization Content on Solaris 8 CD-ROMs

Partial locales are selected at the beginning of the install procedure on the OS CD-ROM. Full locales are automatically installed from the Language CD-ROM according to the locale selections made at the beginning of the install procedure.

The distribution of locales is shown in the table below.

Table 3-1 Solaris 8 Installation CD-ROMs

Disk 

Contents 

Solaris OS CD-ROM 

Solaris 8 Operating System 

all partial locales 

Language CD-ROM 

message translations for 9 languages 

locale specific utilities 

As mentioned, the locales include partial locales. These are based on core locales for the main language. For example, the fr_CA.ISO8859-1 (French Canadian) is based on the fr_FR.ISO8859-1 (French) locale. These partial locales utilize the messages that are delivered into its parent locale (French for fr_CA). If a locale hasn't been fully localized, then it might contain only English messages.

Localization Functions in Solaris Interfaces

The OS locale layer provides the basic locale database and functions that are plugged into the OS system interface at the application's run time. Applications will access these OS locale modules through standard APIs as described in Chapter 2.

The X11 locale layer provides the interface to X input method and X output method such that the X11 applications can allow local text input and display. Fonts are provided to allow applications to display characters from various languages.

CDE/Motif is built on top of the X11 window system. Hence, it can utilize the X11 locale capability through X11 APIs. Solaris localizations have various locale-specific configurations for CDE applications, in order to make the desktop functional within the target locale.

Message translations and on-line help contents are provided throughout different layers as described in the following diagram.

Figure 3-1 Functions and Structure of Locales in Solaris

Graphic

Script Enabling for Solaris 8

The Solaris 8 base product provides multiple levels of script enabling, such as simple ASCII support, Latin/European support, Asian multibyte support, and Arabic/Hebrew bidirectional support.

The interfaces defined within the X/Open specification are capable of supporting a large set of languages and territories, including the following types of script:

Script 

Description 

Latin Language 

Americas, Eastern/Western Europe, Turkey 

Greek 

Greece 

East Asia 

Japanese, Korean, and Chinese 

Indic 

Thai 

Bidirectional 

Arabic and Hebrew 

Cyrillic 

Russian 

Localization in the Base and Multilingual Solaris Product

The base Solaris 8 product includes all partial locales, (including multibyte locales) which provide the functionality needed to input, display, and print text in their target languages while using English user interfaces.

The multilingual Solaris 8 product is a super set of the base Solaris product. It additionally includes 9 language translations (user interface and documentation) and some additional software such as BCP support, optional fonts, and optional utilities on the Language CD.

The English Unicode locale (en_US.UTF-8) is installed as the default, while other locales are installed when the locale is selected as install locale during the Solaris install process. Since the UTF-8 locales require all the languages fonts, basic fonts supporting all languages are also installed as the default.

The File System Safe Universal Transformation Format, or UTF-8, is an encoding defined by X/Open as a multi-byte representation of Unicode. UTF-8 encompasses almost all of the characters for traditional single-byte and multi-byte locales for European and Asian languages for Solaris locales.

Additional locale support is packaged according to the geographic region which they support. During the Solaris install process, you are prompted to choose which geographic regions require your support. The locale support available after installation has finished depends on the choices made at this stage.

The following tables lists all the locales supported by the Solaris 8 environment. The locale names have been updated from the Solaris 7 environment in keeping with international naming standards.

All of these locales are also present in the base Solaris 8 release.

Table 3-2 Asia

Locale 

User Interface 

Territory 

Codeset 

Language Support 

ja 

Japanese 

Japan 

eucJP 

Japanese (EUC) 

 

 

 

 

JISX0201-1976 

 

 

 

 

JISX0208-1990 

 

 

 

 

JISX0212-1990 

ja_JP.PCK 

Japanese 

Japan 

PCK 

Japanese (PC kanji) 

 

 

 

 

JISX0201-1976 

 

 

 

 

JISX0208-1990 

ja_JP.UTF-8 

Japanese 

Japan 

UTF-8 

Japanese (UTF-8) Unicode 3.0 

ko 

Korean 

Korea 

5601 

Korean (EUC) 

KSC 5601-1987 

ko.UTF-8 

Korean 

Korea 

UTF-8 

Korean (UTF-8) KSC 

Unicode 3.0 

th 

English 

Thailand 

TIS620.2533 

Thai TIS620.2533 

zh 

Simplified Chinese 

PRC 

gb2312 

Simplified Chinese (EUC)  

GB2312-1980 

zh.GBK 

Simplified Chinese 

PRC 

GBK 

Simplified Chinese (GBK) GBK 

zh.UTF-8 

Simplified Chinese 

PRC 

UTF-8 

Simplified Chinese (UTF-8)  

Unicode 3.0 

zh_TW 

Traditional Chinese 

Taiwan 

cns11643 

Traditional Chinese (EUC)  

CNS 11643-1992 

zh_TW.BIG5 

Traditional Chinese 

Taiwan 

 

BIG5 

Traditional Chinese (BIG5) 

BIG5 

zh_TW.UTF-8 

Traditional Chinese 

Taiwan 

UTF-8 

Traditional Chinese (UTF-8)  

Unicde 3.0 

Table 3-3 Australasia

Locale 

User Interface 

Territory 

Codeset 

Language Support 

en_AU.ISO8859-1 

English 

Australia 

ISO8859-1 

English (Australia)  

en_NZ.ISO8859-1 

English 

New Zealand 

ISO8859-1 

English (New Zealand) 

Table 3-4 Central America

Locale 

User Interface 

Territory 

Codeset 

Language Support 

es_CR.ISO8859-1 

Spanish 

Costa Rica 

ISO8859-1 

Spanish (Costa Rica) 

es_GT.ISO8859-1 

Spanish 

Guatemala 

ISO8859-1 

Spanish (Guatemala) 

es_MX.ISO8859-1 

Spanish 

Mexico 

ISO8859-1 

Spanish (Mexico) 

es_NI.ISO8859-1 

Spanish 

Nicaragua 

ISO8859-1 

Spanish (Nicaragua) 

es_PA.ISO8859-1 

Spanish 

Panama 

ISO8859-1 

Spanish (Panama) 

es_SV.ISO8859-1 

Spanish 

El Salvador 

ISO8859-1 

Spanish (El Salvador) 

Table 3-5 Central Europe

Locale 

User Interface 

Territory 

Codeset 

Language Support 

cs_CZ.ISO8859-2 

English 

Czech Republic 

ISO8859-2 

Czech (Czech Republic) 

de_AT.ISO8859-1 

German  

Austria 

ISO8859-1 

German (Austria)  

de_AT.ISO8859-15 

German  

Austria 

ISO8859-15 

German (Austria, ISO8859-15 - Euro) 

de_CH.ISO8859-1 

German  

Switzerland 

ISO8859-1 

German (Switzerland)  

de_DE.UTF-8 

German  

Germany 

UTF-8 

German (Germany, Unicode 3.0) 

de_DE.ISO8859-1 

German  

Germany 

ISO8859-1 

German (Germany) 

de_DE.ISO8859-15 

German  

Germany 

ISO8859-15 

German (Germany, ISO8859-15 - Euro) 

fr_CH.ISO8859-1 

French  

Switzerland 

ISO8859-1 

German (Switzerland) 

hu_HU.ISO8859-2 

English 

Hungary 

ISO8859-2 

Hungarian (Hungary) 

pl_PL.ISO8859-2 

English 

Poland 

ISO8859-2 

Polish (Poland) 

sk_SK.ISO8859-2 

English 

Slovakia 

ISO8859-2 

Slovak (Slovakia) 

Table 3-6 Eastern Europe

Locale 

User Interface 

Territory 

Codeset 

Language Support 

bg_BG.ISO8859-5 

English 

Bulgaria 

ISO8859-5 

Bulgarian (Bulgaria) 

et_EE.ISO8859-15  

English 

Estonia 

ISO8859-15 

Estonian (Estonia) 

hr_HR.ISO8859-2 

English 

Croatia 

ISO8859-2 

Croatian (Croatia) 

lt_LT.ISO8859-13  

English 

Lithuania 

ISO8859-13 

Lithuanian (Lithuania) 

lv_LV.ISO8859-13  

English 

Latvia 

ISO8859-13 

Latvian (Latvia) 

mk_MK.ISO8859-5 

English 

Macedonia 

ISO8859-5 

Macedonian (Macedonia) 

ro_RO.ISO8859-2 

English 

Romania 

ISO8859-2 

Romanian (Romania) 

ru_RU.KOI8-R 

English 

Russia 

KOI8-R 

Russian (Russia, KOI8-R) 

ru_RU.ANSI1251 

English 

Russia 

ansi-1251 

Russian (Russia, ANSI 1251) 

ru_RU.ISO8859-5 

English 

Russia 

ISO8859-5 

Russia (Russia) 

sh_BA.ISO8859-2@bosnia 

English 

Bosnia 

ISO8859-2 

Bosnian (Bosnia) 

sl_SI.ISO8859-2 

English 

Slovenia 

ISO8859-2 

Slovenian (Slovenia) 

sq_AL.ISO8859-2 

English 

Albania 

ISO8859-2 

Albanian (Albania) 

sr_YU.ISO8859-5 

English 

Serbia 

ISO8859-5 

Serbian (Serbia) 

tr_TR.ISO8859-9 

English 

Turkey 

ISO8859-9 

Turkish (Turkey) 

Table 3-7 Middle East

Locale 

User Interface 

Territory 

Codeset 

Language Support 

he_IL.ISO8859-6 

English 

Israel 

ISO8859-6 

Hebrew (Israel) 

Table 3-8 North Africa

Locale 

User Interface 

Territory 

Codeset 

Language Support 

ar_EY.ISO8859-1 

English 

Egypt 

ISO8859-6 

Arabic (Egypt) 

Table 3-9 North America

Locale 

User Interface 

Territory 

Codeset 

Language Support 

en_CA.ISO8859-1 

English 

Canada 

ISO8859-1 

English (Canada) 

en_US.ISO8859-1 

English 

USA 

ISO8859-1 

English (U.S.A.) 

en_US.ISO8859-15 

English 

USA 

ISO8859-15 

English (U.S.A., ISO8859-15 - Euro) 

en_US.UTF-8 

English 

USA 

UTF-8 

English (U.S.A., Unicode 3.0) 

fr_CA.ISO8859-1 

French 

Canada 

ISO8859-1 

French (Canada) 

Table 3-10 North Europe

Locale 

User Interface 

Territory 

Codeset 

Language Support 

da_DK.ISO8859-1 

English 

Denmark 

ISO8859-1 

Danish (Denmark) 

da_DK.ISO8859-15 

English 

Denmark 

ISO8859-15 

Danish (Denmark, ISO8859-15 Euro) 

fi_FI.ISO8859-1 

English 

Finland 

ISO8859-1 

Finnish (Finland) 

fi_FI.ISO8859-15 

English 

Finland 

ISO8859-15 

Finnish (Finland ISO8859-15 Euro) 

is_IS.ISO8859-1 

English 

Iceland 

ISO8859-1 

Icelandic (Iceland) 

no_NO.ISO8859-1@bokmal 

English 

Norway 

ISO8859-1 

Norwegian (Norway -- Bokmal) 

no_NO.ISO8859-1@nyorsk 

English 

Norway 

ISO8859-1 

Norwegian (Norway -- Nynorsk) 

sv_SE.ISO8859-1 

Swedish 

Sweden 

ISO8859-1 

Swedish (Sweden) 

sv_SE.ISO8859-15 

Swedish 

Sweden 

ISO8859-15 

Swedish (Sweden, ISO8859-15 Euro) 

sv_SE..UTF-8 

Swedish 

Sweden 

UTF-8 

Swedish (Sweden, Unicode 3.0) 

Table 3-11 South America

Locale 

User Interface 

Territory 

Codeset 

Language Support 

es_AR.ISO8859-1 

Spanish 

Argentina 

ISO8859-1 

Spanish (Argentina) 

es_BO.ISO8859-1 

Spanish 

Bolivia 

ISO8859-1 

Spanish (Bolivia)  

es_CL.ISO8859-1 

Spanish 

Chilie 

ISO8859-1 

Spanish (Chile) 

es_CO.ISO8859-1 

Spanish 

Colombia 

ISO8859-1 

Spanish (Colombia) 

es_EC.ISO8859-1 

Spanish 

Ecuador 

ISO8859-1 

Spanish (Ecuador)  

es_PE.ISO8859-1 

Spanish 

Peru 

ISO8859-1 

Spanish (Peru) 

es_PY.ISO8859-1 

Spanish 

Paraguay 

ISO8859-1 

Spanish (Paraguay) 

es_UY.ISO8859-1 

Spanish 

Uruguay 

ISO8859-1 

Spanish (Uruguay) 

es_VE.ISO8859-1 

Spanish 

Venezuela 

ISO8859-1 

Spanish (Venezuela) 

pt_BR.ISO8859-1 

English 

Brazil 

ISO8859-1 

Portuguese (Brazil) 

Table 3-12 South Europe

Locale 

User Interface 

Territory 

Codeset 

Language Support 

el_GR.ISO8859-7 

English 

Greece 

ISO8859-7 

Greek (Greece) 

es_ES.ISO8859-1 

Spanish 

Spain 

ISO8859-1 

Spanish (Spain) 

es_ES.ISO8859-15 

Spanish 

Spain 

ISO8859-15 

Spanish (Spain, ISO8859-15 - Euro) 

es_ES.UTF-8 

Spanish 

Spain 

UTF-8 

Spanish (Spain, Unicode 3.0) 

it_IT.ISO8859-1 

Italian 

Italy 

ISO8859-1 

Italian (Italy) 

it_IT.ISO8859-15 

Italian 

Italy 

ISO8859-15 

Italian (Italy, ISO8859-15 - Euro) 

it_IT.UTF-8 

Italian 

Italy 

UTF-8 

Italian (Italy, Unicode 3.0) 

pt_PT.ISO8859-1 

English 

Portugal 

ISO8859-1 

Portuguese (Portugal) 

pt_PT.ISO8859-15 

English 

Portugal 

ISO8859-15 

Portuguese Portugal, ISO8859-15 - Euro) 

Table 3-13 Western Europe

Locale 

User Interface 

Territory 

Codeset 

Language Support 

en_GB.ISO8859-1 

English 

Great Britain 

ISO8859-1 

English (Great Britain) 

en_GB.ISO8859-15  

English 

Great Britain 

ISO8859-15 

English (Great Britain, ISO8859-15 - Euro) 

en_IE.ISO8859-1 

English 

Ireland 

ISO8859-1 

English (Ireland) 

en_IE.ISO8859-15  

English 

Ireland 

ISO8859-15 

English (Ireland, ISO8859-15 - Euro) 

fr_BE.ISO8859-1 

French 

Belgium-Walloon 

ISO8859-1  

French (Belgium-Walloon) 

fr_BE.ISO8859-15 

French 

Belgium-Wallon 

ISO8859-15  

French (Belgium-Walloon, ISO8859-15 - Euro) 

fr_FR.ISO8859-1 

French 

France 

ISO8859-1 

French (France) 

fr_FR.ISO8859-15 

French 

France 

ISO8859-15 

French (France, ISO8859-15 - Euro) 

fr_FR.UTF-8 

French 

France 

UTF-8 

French (France, Unicode 3.0) 

nl_BE.ISO8859-1 

English 

Belgium-Flemish  

ISO8859-1 

Dutch (Belgium-Flemish) 

 

nl_BE.ISO8859-15 

English 

Belgium-Flemish 

ISO8859-15  

Dutch (Belgium-Flemish, ISO8859-15 - Euro) 

nl_NL.ISO8859-1 

English 

Netherlands 

ISO8859-1 

Dutch (Netherlands) 

nl_NL.ISO8859-15 

English 

Netherlands 

ISO8859-15 

Dutch (Netherlands, ISO8859-15 - Euro) 


Note -

Locale naming conventions are as follows:

language[_territory][.codeset]

where language is from ISO639 and territory is from ISO3166.

All Solaris product locales preserve the Portable Character Set characters with US-ASCII code values.

A single locale can have more than one locale name. For example, ja_JP.eucJP is the same as ja. Also, fr_FR.ISO8859-1 is the same as fr.



Note -

5601 signifies the Korean EUC codeset containing KS C 5636 and KS C 5601-1987.

eucJP signifies the Japanese EUC codeset. It contains JIS X0201-1976, JIS X0208-1983, and JIS X0212-1990.

gb2312 signifies Simplified Chinese EUC codeset, which contains GV 1988-80 and GB 2312-80.

PCK is also known as Shift JIS (SJIS).

UTF-8 is the UTF-8 of ISO/IEC 10646-1 containing various approved amendments and Unicode 3.0

GBK signifies GB extensions. This includes all GB 2312-80 characters and all Unified Han characters of ISO/IEC 10646-1, as well as Japanese Hiragana and Katagana characters. It also includes many characters of Chinese, Japanese, and Korean character sets and of ISO/IEC 10646-1.