Support for
the New Chinese GB18030-2000 Character Set
The Solaris platform enables input, display,
and print for the entire GB18030-2000 character set, including
nearly 30,000 characters. Any application that runs on the Solaris platform
can thus benefit from a wider set of Chinese characters. These glyphs are
primarily Han characters, but the encoding also includes glyphs for minority
languages such as Tibetan, Wei, Yi, and Mongolian.
GB18030-2000 support in the Solaris 9 release also includes backward
compatibility to previous Chinese codesets, GBK and GB2312. Conversion to other codesets such as Unicode is also included.
Solaris developers do not need to change their procedures to access this new
encoding support. Standard toolkits can use the new support.
For Java applications that need GB18030-2000 support, review
J2SE at the following Web site:
http://java.sun.com/j2se/1.4
|
New zh_CN.GB18030 Locale Enhanced From zh_CN.GBK
This new zh_CN.GB18030
locale is provided to support the new GB18030 standard encoding. This encoding
is required by a law that the Chinese government has enacted.
|
New Chinese
and Korean Collation Locales
Collation locales provide
different collation options to users, such as stroke count and radical, phonetic,
and dictionary options.
|
Wordbreaker
Modules for Thai
The wordbreaker module is used to
correctly break the Thai text into proper paragraphs, sentences, and words
within Motif.
|
New Asian UTF-8
(Unicode) Locales
The File System Safe Universal Transformation
Format, or UTF-8, is an encoding that is defined by X/Open® as a multibyte representation of Unicode. UTF-8 encompasses
almost all of the characters for traditional single-byte and multibyte locales
for European and Asian languages for Solaris locales.
-
th_TH.UTF-8 locale is the Unicode locale
for Thailand.
-
hi_IN.UTF-8 locale is the Unicode locale
for India.
-
zh_HK.UTF-8 locale is the Unicode locale
for Hong Kong, China.
|
New Thai Input
Method
The new Thai input method supports the Thai
input sequence checking, as defined in TIS 1566-2541 "Thai input/output methods
for computer" standard (or “WTT”) by the Thai Industrial Standards
Institute. The input sequence checking has 3 levels: level 0 (passthrough),
level 1 (basic check), and level 2 (strict).
|
New Chinese
Input Methods
More popular, powerful input methods
(IM) in the Traditional Chinese and Simplified Chinese locales have been added
for new character sets and new locales:
-
New Chuyin IM for Traditional Chinese locales
-
Cantonese IM for all Chinese locales
-
English-Chinese IM for all Chinese locales
|
New Auxiliary
Window for Chinese Input Methods
The Auxiliary window
provides a user interface (UI) for input methods. The UI is “friendly”
and extensible for all Chinese locales. New functions that are supported by
the auxiliary window are as follows:
-
Input method switching
-
Input method properties configuration
-
Look-up tables for GB2312, GBK, GB18030, HKSCS, CNS, Big–5,
and Unicode character sets
-
Code table management tool
-
A visual keyboard
|
New zh_HK.BIG5HK Locale to Support HKSCS for
Hong Kong, China
The new zh_HK.BIG5HK
locale is provided to support the Hong Kong Supplementary Character Set (HKSCS).
The HKSCS is a supplementary character set of the Big-5 and ISO 10646 coding
schemes. HKSCS contains Chinese characters that are needed in Chinese computing
in Hong Kong. However, these characters are not contained in either the Big-5
or ISO 10646 standard character sets.
|
Additional Japanese iconv Modules
Additional Japanese language support includes iconv code
conversions between Solaris Japanese locale codesets and Japanese mainframe
codesets. Japanese locale codesets include eucJP, PCK, and UTF-8. Japanese
mainframe codesets include Fujitsu JEF, Hitachi KEIS, and NEC JIPS.
|