A script enabled browser is required for this page to function properly.
Exit Print View

Oracle® Java Micro Edition Embedded Client Reference Guide, Version 1.0

Get PDF Book Print View
 

Document Information

Preface

Part I  Developer Guide

1.  Introduction

2.  Execution

3.  Developing Applications

Part II  Java Virtual Machine Reference

4.  Java Virtual Machine Capabilities

5.  Internal Memory Allocator

6.  Threading

7.  Internationalization

7.1 ROMized Character Converters

7.2 ROMized Locales

7.3 Using Non-ROMized Locales and Character Converters

8.  External PBP Porting Layer Plugin

Part III  Working Without An IDE

A.  Legacy Tools

Index

Chapter 7

Internationalization

Internationalization support is fully integrated in the CVM classes and packages that provide language or culture-dependent functionality.

Some of the character converters and locales are ROMized in the CVM and some are supported as JAR files to be included at runtime through the runtime parameter -Xbootclasspath.

7.1 ROMized Character Converters

The following character converters are supported by default and are ROMized in the CVM:

The list of character converters in the charsets.jar file, to be included at run time is given in Character Converters .

The following character converters are compiled in the charsets.jar file.

Character Converters 
Canonical Name for java.io and java.lang API
Description
Ascii
American Standard Code for Information Interchange
Cp1250
Windows Eastern European
Cp1251
Windows Cyrillic
Cp1252
Windows Latin-1
Cp1253
Windows Greek
Cp1254
Windows Turkish
Cp1257
Windows Baltic
ISO8859_2
Latin Alphabet No. 2
ISO8859_4
Latin Alphabet No. 4
ISO8859_5
Latin/Cyrillic Alphabet
ISO8859_7
Latin/Greek Alphabet
ISO8859_9
Latin Alphabet No. 5
ISO8859_13
Latin Alphabet No. 7
ISO8859_15
Latin Alphabet No. 9
KOI8_R
KOI8-R, Russian
Cp1255
Windows Hebrew
Cp1256
Windows Arabic
Cp1258
Windows Vietnamese
ISO8859_3
Latin Alphabet No. 3
ISO8859_6
Latin/Arabic Alphabet
ISO8859_8
Latin/Hebrew Alphabet
MS932
Windows Japanese
EUC_JP
JISX 0201, 0208 and 0212, EUC encoding Japanese
EUC_JP_LINUX
JISX 0201, 0208, EUC encoding Japanese
SJIS
Shift-JIS, Japanese
ISO2022JP
JIS X 0201, 0208, in ISO 2022 form, Japanese
MS936
Windows Simplified Chinese
GB18030
Simplified Chinese, PRC standard
EUC_CN
GB2312, EUC encoding, Simplified Chinese
GBK
GBK, Simplified Chinese
ISCII91
ISCII91 encoding of Indic scripts
MS949
Windows Korean
EUC_KR
KS C 5601, EUC encoding, Korean
ISO2022KR
ISO 2022 KR, Korean
MS950
Windows Traditional Chinese
MS950_HKSCS
Windows Traditional Chinese with Hong Kong extensions
EUC_TW
CNS11643 (Plane 1-3), EUC encoding, Traditional Chinese
Big5
Big5, Traditional Chinese
Big5_HKSCS
Big5 with Hong Kong extensions, Traditional Chinese
TIS620
TIS620, Thai
Big5_Solaris
Big5 with seven additional Hanzi ideograph character mappings for the Solaris zh_TW.BIG5 locale
Cp037
USA, Canada (Bilingual, French), Netherlands, Portugal, Brazil, Australia
Cp273
IBM Austria, Germany
Cp277
IBM Denmark, Norway
Cp278
IBM Finland, Sweden
Cp280
IBM Italy
Cp284
IBM Catalan/Spain, Spanish Latin America
Cp285
IBM United Kingdom, Ireland
Cp297
IBM France
Cp420
IBM Arabic
Cp424
IBM Hebrew
Cp437
MS-DOS United States, Australia, New Zealand, South Africa
Cp500
EBCDIC 500V1
Cp737
PC Greek
Cp775
PC Baltic
Cp838
IBM Thailand extended SBCS
Cp850
MS-DOS Latin-1
Cp852
MS-DOS Latin-2
Cp855
IBM Cyrillic
Cp856
IBM Hebrew
Cp857
IBM Turkish
Cp858
Variant of Cp850 with Euro character
Cp860
MS-DOS Portuguese
Cp861
MS-DOS Icelandic
Cp862
PC Hebrew
Cp863
MS-DOS Canadian French
Cp864
PC Arabic
Cp865
MS-DOS Nordic
Cp866
MS-DOS Russian
Cp868
MS-DOS Pakistan
Cp869
IBM Modern Greek
Cp870
IBM Multilingual Latin-2
Cp871
IBM Iceland
Cp874
IBM Thai
Cp875
IBM Greek
Cp918
IBM Pakistan (Urdu)
Cp921
IBM Latvia, Lithuania (AIX, DOS)
Cp922
IBM Estonia (AIX, DOS)
Cp930
Japanese Katakana-Kanji mixed with 4370 UDC, superset of 5026
Cp933
Korean Mixed with 1880 UDC, superset of 5029
Cp935
Simplified Chinese Host mixed with 1880 UDC, superset of 5031
Cp937
Traditional Chinese Host mixed with 6204 UDC, superset of 5033
Cp939
Japanese Latin Kanji mixed with 4370 UDC, superset of 5035
Cp942
IBM OS/2 Japanese, superset of Cp932
Cp942C
Variant of Cp942
Cp943
IBM OS/2 Japanese, superset of Cp932 and Shift-JIS
Cp943C
Variant of Cp943
Cp948
OS/2 Chinese (Taiwan) superset of 938
Cp949
PC Korean
Cp949C
Variant of Cp949
Cp950
PC Chinese (Hong Kong, Taiwan)
Cp964
AIX Chinese (Taiwan)
Cp970
AIX Korean
Cp1006
IBM AIX Pakistan (Urdu)
Cp1025
IBM Multilingual Cyrillic: Bulgaria, Bosnia, Herzegovinia, Macedonia (FYR)
Cp1026
IBM Latin-5, Turkey
Cp1046
IBM Arabic – Windows
Cp1047
Latin-1 character set for EBCDIC hosts
Cp1097
IBM Iran (Farsi)/Persian
Cp1098
IBM Iran (Farsi)/Persian (PC)
Cp1112
IBM Latvia, Lithuania
Cp1122
IBM Estonia
Cp1123
IBM Ukraine
Cp1124
IBM AIX Ukraine
Cp1140
Variant of Cp037 with Euro character
Cp1141
Variant of Cp273 with Euro character
Cp1142
Variant of Cp277 with Euro character
Cp1143
Variant of Cp278 with Euro character
Cp1144
Variant of Cp280 with Euro character
Cp1145
Variant of Cp284 with Euro character
Cp1146
Variant of Cp285 with Euro character
Cp1147
Variant of Cp297 with Euro character
Cp1148
Variant of Cp500 with Euro character
Cp1149
Variant of Cp871 with Euro character
Cp1381
IBM OS/2, DOS People's Republic of China (PRC)
Cp1383
IBM AIX People's Republic of China (PRC)
Cp33722
IBM-eucJP - Japanese (superset of 5050)
ISO2022_CN_CNS
CNS11643 in ISO 2022 CN form, Traditional Chinese (conversion from Unicode only)
ISO2022_CN_GB
GB2312 in ISO 2022 CN form, Simplified Chinese (conversion from Unicode only)
JISAutoDetect
Detects and converts from Shift-JIS, EUC-JP, ISO 2022 JP (conversion to Unicode only)
MS874
Windows Thai
MacArabic
Macintosh Arabic
MacCentralEurope
Macintosh Latin-2
MacCroatian
Macintosh Croatian
MacCyrillic
Macintosh Cyrillic
MacDingbat
Macintosh Dingbat
MacGreek
Macintosh Greek
MacHebrew
Macintosh Hebrew
MacIceland
Macintosh Iceland
MacRoman
Macintosh Roman
MacRomania
Macintosh Romania
MacSymbol
Macintosh Symbol
MacThai
Macintosh Thai
MacTurkish
Macintosh Turkish
MacUkraine
Macintosh Ukraine

7.2 ROMized Locales

The US English locale is supported by default and is ROMized in the CVM.

Locales  lists other locales in the file localedata.jar.

Locales 
Locale ID
Country
Language
ar_SA
Saudi Arabia
Arabic
zh_CN
China
Chinese (Simplified)
zh_TW
Taiwan
Chinese (Traditional)
nl_NL
Netherlands
Dutch
en_AU
Australia
English
en_CA
Canada
English
en_GB
United Kingdom
English
fr_CA
Canada
French
fr_FR
France
French
de_DE
Germany
German
iw_IL
Israel
Hebrew
hi_IN
India
Hindi
it_IT
Italy
Italian
ja_JP
Japan
Japanese
ko_KR
South Korea
Korean
pt_BR
Brazil
Portuguese
es_ES
Spain
Spanish
sv_SE
Sweden
Swedish
th_TH
Thailand
Thai (Western digits)
th_TH_TH
Thailand
Thai (Thai digits)
sq_AL
Albania
Albanian
ar_DZ
Algeria
Arabic
ar_BH
Bahrain
Arabic
ar_EG
Egypt
Arabic
ar_IQ
Iraq
Arabic
ar_JO
Jordan
Arabic
ar_KW
Kuwait
Arabic
ar_LB
Lebanon
Arabic
ar_LY
Libya
Arabic
ar_MA
Morocco
Arabic
ar_OM
Oman
Arabic
ar_QA
Qatar
Arabic
ar_SD
Sudan
Arabic
ar_SY
Syria
Arabic
ar_TN
Tunisia
Arabic
ar_AE
United Arab Emirates
Arabic
ar_YE
Yemen
Arabic
be_BY
Belorussia
Belorussian
bg_BG
Bulgaria
Bulgarian
ca_ES
Spain
Catalan
zh_HK
Hong Kong
Chinese
hr_HR
Croatia
Croatian
cs_CZ
Czech Republic
Czech
da_DK
Denmark
Danish
nl_BE
Belgium
Dutch
en_IN
India
English
en_IE
Ireland
English
en_NZ
New Zealand
English
en_ZA
South Africa
English
et_EE
Estonia
Estonian
fi_FI
Finland
Finnish
fr_BE
Belgium
French
fr_LU
Luxembourg
French
fr_CH
Switzerland
French
de_AT
Austria
German
de_LU
Luxembourg
German
de_CH
Switzerland
German
el_GR
Greece
Greek
hu_HU
Hungary
Hungarian
is_IS
Iceland
Icelandic
it_CH
Switzerland
Italian
lv_LV
Latvia
Latvian
lt_LT
Lithuania
Lithuanian
mk_MK
Macedonia
Macedonian
no_NO
Norway
Norwegian (Bokmål)
no_NO_NY
Norway
Norwegian (Nynorsk)
pl_PL
Poland
Polish
pt_PT
Portugal
Portuguese
ro_RO
Romania
Romanian
ru_RU
Russia
Russian
sr_YU
Yugoslavia
Serbian (Cyrillic)
sh_YU
Yugoslavia
Serbo-Croatian
sk_SK
Slovakia
Slovak
sl_SI
Slovenia
Slovenian
es_AR
Argentina
Spanish
es_BO
Bolivia
Spanish
es_CL
Chile
Spanish
es_CO
Colombia
Spanish
es_CR
Costa Rica
Spanish
es_DO
Dominican Republic
Spanish
es_EC
Ecuador
Spanish
es_SV
El Salvador
Spanish
es_GT
Guatemala
Spanish
es_HN
Honduras
Spanish
es_MX
Mexico
Spanish
es_NI
Nicaragua
Spanish
es_PA
Panama
Spanish
es_PY
Paraguay
Spanish
es_PE
Peru
Spanish
es_PR
Puerto Rico
Spanish
es_UY
Uruguay
Spanish
es_VE
Venezuela
Spanish
tr_TR
Turkey
Turkish
uk_UA
Ukraine
Ukrainian

7.3 Using Non-ROMized Locales and Character Converters

To start the CVM with the locales and charsets enabled, use the following command:

 path/bin/cvm –Xbootclasspath/p:/emulator-platform/lib

path is the path to the CVM lib/ directory containing the files charsets.jar and locales.jar.

More details about internationalization can be found at the following link: http://java.sun.com/docs/books/tutorial/i18n/