This part introduces the features of the Traditional Chinese Solaris Operating System (Solaris OS).
The Traditional Chinese Solaris Operating System (Solaris OS) is the internationalization and localization of the Solaris Operating System and the Common Desktop Environment (CDE) window system.
This chapter describes the new features and the language support that are available in the release of the current Traditional Chinese release.
The information in this book applies to the Traditional Chinese used in Taiwan and Hong Kong .
Several new features are available in the current Solaris release of the Traditional Chinese Operating System.
UDC (user defined character) support for all of the Traditional Chinese locales, which include zh_TW.EUC, zh_TW), zh_TW.BIG5, and zh_TW.UTF-8 locales.
You can define new characters with the sdtudctool utility. You can then enter the characters with the native codepoint input method. Finally, you can use the mp command with the -u option to print the characters.
HKSCS-2001 support in the Solaris Operating System.
HKSCS-1999 is replaced by the HKSCS-2001 standard in the zh_HK.BIG5HK locale and in all the Chinese UTF-8 locales (zh_CN.UTF-8, zh_HK.UTF-8, and zh_TW.UTF-8). Changes in the HKSCS-2001 standard include:
116 new characters.
ISO 10646 mappings for most of the HKSCS-1999 characters. The mappings that are defined in the Private User Area (PUA) of ISO/IEC 10646-1:1993 have changed to the corresponding code points of Unicode Supplementary Ideograph Plane in Unicode 3.2.
Extended CNS11643 support.
In the Solaris 9 release, the zh_TW.EUC/zh_TW locale supported only the first 3 planes of CNS 11643-1992. In the current Solaris Operating System, all of the CNS 11643-1992 planes are supported.
No fonts are available in this release for the characters in the new supported planes.
Unicode 3.2 support. The zh__TW.UTF—8 and the zh_HK.UTF-8 locales have been updated to support the new 3.2 version of the Unicode Standard. The new version introduces an additional 1,016 new characters and contains various normative and informative changes.
Unicode 3.2 also defines the following UTF-8 byte sequences as invalid.
0xED as the first byte
0xA0 to 0xBF as the second byte
These sequences exclude the surrogate code points between U+D800 and U+DFFF. To comply with the new definition, the UTF-8 iconv modules have been enhanced to detect the newly defined UTF-8 invalid byte sequences.
In the current Solaris release, the mp printing utility replaces the xetops and the xutops utilities.
The xetops and xutops printing utilities are no longer supported in the Solaris Operating System. The utilities were formerly used to convert Traditional Chinese text files to PostScript. The conversion enabled the printing of Traditional Chinese characters to PostScript printers with no resident Asian fonts. The xetops utility was used in the zh_TW.EUC/zh_TW and zh_TW.BIG5 locales. The xutops utility was used in the zh_TW.UTF-8 and zh_TW.HK locales.
The mp printing utility was first released with the Solaris 9 Operating System.
The Solaris system builds inherent internationalization features into every localized product. Localization facilities support the ANSI C recommendations for internationalization and localization that define the locale and related categories.
A locale contains the culturally specific information and conventions of the language for a particular global region. Each process in the Solaris Operating System has the following set of locale attributes:
Locale settings, which provide the locale and setlocale commands you use to list and set attributes before you start a process from the command line.
For example, the Traditional Chinese locales and the English/ASCII locale both have a category that defines the display of time and date according to the cultural format, as well as the actual Traditional Chinese or English/ASCII characters for the time and date.
Code Sets, which support coding conventions for the CNS11643 and BIG5 character sets. These sets enable you to input, display, and print Traditional Chinese text in file names, system messages, and terminal (TTY), email, and data file content.
htt input method server, which handles Traditional Chinese input for the Solaris Operating System. The htt server receives your keyboard input and converts it to Traditional Chinese characters that are used in Traditional Chinese applications.
The Traditional Chinese Solaris Operating System provides simultaneous support for the locales in the following table . The locales look the same to the end user, but the internal character encoding is different.
Table 1–1 Traditional Chinese Locales
Locale |
Description |
---|---|
zh_TW.EUC (zh_TW) |
Traditional Chinese EUC locale (CNS11643-1992) |
zh_TW.BIG5 |
Traditional Chinese BIG5 |
zh_TW.UTF-8 |
Traditional Chinese UTF-8 (Unicode 3.2) |
zh_HK.BIG5HK |
Traditional Chinese (Hong Kong) BIG5-HKSCS |
zh_HK.UTF-8 |
Traditional Chinese (Hong Kong) UTF-8 (Unicode 3.2) |
The following table lists supported code sets for each Traditional Chinese locale.
Table 1–2 Traditional Chinese Code Sets
Locale |
Code Set |
---|---|
zh_TW.EUC (zh_TW) |
cns11643 |
zh_TW.BIG5 |
BIG5 |
zh_TW.UTF-8 |
UTF-8 |
zh_HK.BIG5HK |
BIG5-HKSCS |
zh_HK.UTF-8 |
Unicode 3.2 |
The Traditional Chinese Solaris Operating System provides input methods and fonts that enable you to enter, display, and print any character in any language. The following input methods are supported for the Traditional Chinese locales.
New ChuYin
ChuYin
TsangChieh
Array
Boshiamy
DaYi
JianYi
Cantonese
NeiMa (EUC, BIG5, BIG5-HKSCS)
English-Chinese
Optional codetable Input Methods, such as PinYin
For a complete list of fonts supported for the Traditional Chinese locales, see Chapter 9, Fonts.
The input method auxiliary window supports the following functions:
Input method switching
Input methods properties configuration
Lookup tables for the following character sets:
HKSCS
CNS11643
BIG5
Unicode
User-defined characters
Input method help
Virtual keyboard
You can use the following general and specific categories as defined by ANSI C for the Traditional Chinese and English locales.
General LC_ALL setting that invokes all of the categories for locale-related aspects of the system.
Specific settings for particular aspects of the system, which include:
LC_CTYPE
LC_TIME
LC_NUMERIC
LC_MONETARY
LC_COLLATE
LC_MESSAGES
For example, the Traditional Chinese and the English/ASCII locales have the LC_TIME category that defines the display of the time and date according to the cultural format, as well as the actual Traditional Chinese or English/ASCII characters used in the display.