Table 3-6 shows the Asian locales supported by these Asian products.
Table 3-5 Summary of Asian Locales
In December 1995, the Korean government announced a standard Korean codeset, KSC-5700, which is based on ISO-10646-1/Unicode 2.0. The standard codeset replaces KSC 5601, which was based on ISO-2022.
The ISO-10646 character set uses 2 (UCS-2; Universal Character Set two-byte form) or 4 (UCS-4) bytes to represent each character.
The ISO-10646 character set cannot be used directly on IBM-PC-based operating systems. For example, the kernel and many other modules of the Solaris operating environment interpret certain byte values as control instructions, such as a null character (0x00) in any string. The ISO-10646 character set can be encoded with any bit combinations in the first or subsequent bytes. The ISO-10646 characters cannot be freely transmitted through the Solaris system with these limitations. In order to establish a migration path, the ISO-10646 character set defines the UCS Transformation Format (UTF), which recodes the ISO-10646 characters without using C0 controls (0x00..0x1F), C1 controls (0x80..0x9F), space (0x20), and DEL (0x7F).
The ko.UTF-8 is a Solaris locale to support KSC-5700, the Korean standard codeset. It supports all characters in the previous KSC 5601 and all 11,172 Korean characters. Korean UTF-8 supports the Korean language-related ISO-10646 characters and fonts. Because ISO-10646 covers all characters in the world, all of the various input methods and fonts are supplied so that you may input and output any character in any language. Before Universal UTF/UCS becomes available, Korean UTF-8 supports the ISO-10646 code subset that is related to Korean characters as well as all other characters in the previous Korean standard codeset, and Extended ASCII.
Table 3-6 lists the Korean codesets.
Table 3-6 Codeset Conversions Supported for Korean ko, ko.UTF-8
Code |
Symbol |
TargetCode |
Symbol |
---|---|---|---|
UTF-8 |
ko_KR-UTF-8 |
Wansung |
ko_KR-euc |
UTF-8 |
ko_KR-UTF-8 |
Johap |
ko_KR-johap92 |
UTF-8 |
ko_KR-UTF-8 |
Packed |
ko_KR-johap |
UTF-8 |
ko_KR-UTF-8 |
ISO-2022-KR |
ko_KR-iso2022-7 |
Wansung |
ko_KR-euc |
UTF-8 |
ko_KR-UTF-8 |
Johap |
ko_KR-johap92 |
UTF-8 |
ko_KR-UTF-8 |
Packed |
ko_KR-johap |
UTF-8 |
ko_KR-UTF-8 |
ISO-2022-KR |
ko_KR-iso2022-7 |
UTF-8 |
ko_KR-UTF-8 |
Wansung |
ko_KR-euc |
Johap |
ko_KR-johap92 |
Wansung |
ko_KR-euc |
Packed |
ko_KR-johap |
Wansung |
ko_KR-euc |
N-Byte |
ko_KR-nbyte |
Wansung |
ko_KR-euc |
ISO-2022-KR |
ko_KR-iso2022-7 |
Johap |
ko_KR-johap92 |
Wansung |
ko_KR-euc |
Packed |
ko_KR-johap |
Wansung |
ko_KR-euc |
N-Byte |
ko_KR-nbyte |
Wansung |
ko_KR-euc |
ISO-2022-KR |
ko_KR-iso2022-7 |
Wansung |
ko_KR-euc |
Simplified Chinese in the Solaris 7 environment provides two locales: zh and zh.GBK. In the zh locale, the EUC scheme is usesd to encode GB2312-80 The zh.GBK locale supports the GBK codeset, which is a superset of GB2312-80.
Simplified Chinese is used mostly in the People's Republic of China (PRC) and in Singapore..
The following input methods are supported for the zh locale
New QuanPin
New ShuangPin
Quanpy
Location
PinYin
Stroke
Golden
Intelligent Pinyin
Simplified Chinese Symbol
The following input methods are supported for the zh.GBK locale
New QuanPin
New ShuangPin
Quanpy
GBK Code
Japanese
Hanja
Zhuyin
Unicode
Table 3-7 shows the TrueType Fonts for the zh Locale
Table 3-7 Solaris 7 TrueType Fonts for the zh LocaleFull Family Name | Subfamily | Format | Vendor | Encoding |
---|---|---|---|---|
Fangsong | R | TrueType | Hanyi | GB2312.1980 |
Hei | R | TrueType | Monotype | GB2312.1980 |
Kai | R | TrueType | Monotype | GB2312.1980 |
Song | R | TrueType | Monotype | GB2312.1980 |
Table 3-8 shows the Bitmap Fonts for the zh Locale
Table 3-8 Solaris 7 Bitmap Fonts for the zh LocaleFull Family Name | Subfamily | Format | Encoding |
---|---|---|---|
Song | B | PCF (14,16) | GB2312.1980 |
Song | R | PCF (12,14,16,20,24) | GB2312.1980 |
Table 3-9 shows the TrueType Fonts for the zh.GBK Locale
Table 3-9 TrueType Fonts for the zh.GBK LocaleFull Family NameS | Subfamily | Format | Vendor | Encoding |
---|---|---|---|---|
Fansong | R | TrueType | Zhongyi | GBK |
Hei | R | TrueType | Zhongyi | GBK |
Kai | R | TrueType | Zhongyi | GBK |
Song | R | TrueType | Zhongyi | GBK |
Table 3-10 shows the Bitmap Fonts for the zh.GBK Locale
Table 3-10 Bitmap Fonts for the zh.GBK LocaleFull Family Name | Subfamily | Format | Encoding |
---|---|---|---|
Song | R | PCF (12,14,16,20,24) | GBK |
Table 3-11 shows the supported codeset conversions for Simplified Chinese.
Table 3-11 Codeset Conversions for Simplified Chinese
Code |
Symbol |
TargetCode |
Symbol |
---|---|---|---|
GB2312-80 |
zh_CN.euc |
ISO 2022-7 |
zh_CN.iso2022-7 |
ISO 2022-7 |
zh_CN.iso2022-7 |
GB2312-80 |
zh_CN.euc |
GB2312-80 |
zh_CN.euc |
ISO 2022-CN |
zh_CN.iso2022-CN |
ISO-2022-CN |
zh_CN.iso2022-CN |
GB2312-80 |
zh_CN.euc |
UTF-8 |
UTF-8 |
GB2312-80 |
zh_CN.euc |
GB2312-80 |
zh_CN.euc |
UTF-8 |
UTF-8 |
zh.GBK |
zh_CN.gbk |
ISO2022-CN |
zh_CN.iso2022-CN |
ISO2022-CN | zh_CN.iso2022-CN | zh.GBK | zh_CN.gbk |
zh.GBK |
zh_CN.gbk |
Big-5 |
zh_TW-Big5 |
Big-5 |
zh_TW-Big5 |
zh.GBK |
zh_CN.gbk |
GB2312-80 |
zh_CN.euc |
Big-5 |
zh_TW-Big5 |
Big-5 | zh_TW-Big5 |
GB2312-80 |
zh_CN.euc |
UTF-8 |
UTF-8 | zh.GBK |
zh_CN.gbk |
zh.GBK |
zh_CN.gbk |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8 |
ISO2022-CN |
zh_CN.iso2022-CN |
ISO2022-CN |
zh_CN.iso2022-CN |
UTF-8 |
UTF-8 |
Traditional Chinese in the Solaris 7 product provides two locales: zh_TW and zh_TW.BIG5. In the zh_TW locale, the EUC scheme is used to encode CNS 11643.1992 codeset. The zh_TW.BIG5 locale supports the Big-5 codeset.
Traditional Chinese is used mostly in Taiwan and Hong Kong.
Traditional Chinese supports the following input methods:
Chuyin
I-Tien
Telecode
TsangChieh
CheinI
NeiMa
ChuangHsing
Array
BoShiaMy
DaYi
Table 3-12 shows Traditional Chinese Truetype Fonts for the zh_TW Locales
Full Family Name | Subfamily | Format | Vendor | Encoding |
---|---|---|---|---|
Hei | R | Truetype | Hanyi | CNS11643.1992 |
Kai | R | Truetype | Hanyi | CNS11643.1992 |
Ming | R | Truetype | Hanyi | CNS11643.1992 |
Table 3-13 shows the Traditional Chinese BitMap Fonts for the zh_TW Locales
Table 3-13 Traditional Chinese BitMap Fonts for the zh_TW LocalesFull Family Name | Subfamily | Format | Encoding |
---|---|---|---|
Ming | R | PCF (12,14,16,20,24) | CNS11643.1992 |
Table 3-14 shows the Traditional Chinese TrueType Fonts for the zh_TW.BIG5 Locales
Table 3-14 Traditional Chinese TrueType Fonts for the zh_TW.BIG5 LocalesFull Family Name | Subfamily | Format | Vendor | Encoding |
---|---|---|---|---|
Hei | R | TrueType | Hanyi | Big5 |
Kai | R | TrueType | Hanyi | Big5 |
Ming | R | TrueType | Hanyi | Big5 |
Table 3-15 shows the Traditional Chinese BitMap Fonts for the zh_TW.BIG5 Locales
Table 3-15 Traditional Chinese BitMap Fonts for the zh_TW.BIG5 LocalesFull Family Name | Subfamily | Format | Encoding |
---|---|---|---|
Ming | R | PCF (12,14,16,20,24) | Big5 |
Table 3-16 shows the supported codeset conversions for Traditional Chinese.
Table 3-16 Codeset Conversions for Traditional Chinese
Code |
Symbol |
TargetCode |
Symbol |
---|---|---|---|
CNS 11643 |
zh_TW-euc |
Big-5 |
zh_TW-Big5 |
CNS 11643 |
zh_TW-euc |
ISO 2022-7 |
zh_TW-iso2022-7 |
Big-5 |
zh_TW-Big5 |
CNS 11643 |
zh_TW-euc |
Big-5 |
zh_TW-Big5 |
ISO 2022-7 |
zh_TW-iso2022-7 |
ISO 2022-7 |
zh_TW-iso2022-7 |
CNS 11643 |
zh_TW-euc |
ISO 2022-7 |
zh_TW-iso2022-7 |
Big-5 |
zh_TW-Big5 |
CNS 11643 |
zh_TW-eu |
ISO 2022-CN-EXT |
zh_TW-iso2022-CN-EXT |
ISO 2022-CN-EXT |
zh_TW-iso2022-CN-EXT |
CNS 11643 |
zh_TW-euc |
Big-5 |
zh_TW-Big5 |
ISO 2022-CN |
zh_TW-iso2022-CN |
ISO 2022-CN |
zh_TW-iso2022-CN |
Big-5 |
zh_TW-Big5 |
UTF-8 |
UTF-8 |
CNS 11643 |
zh_TW-euc |
CNS 11643 |
zh_TW-euc |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8 |
Big-5 |
zh_TW-Big5 |
Big-5 |
zh_TW-Big5 |
UTF-8 |
UTF-8 |
UTF-8 |
UTF-8 |
ISO 2022-7 |
zh_TW-iso2022-7 |
ISO 2022-7 |
zh_TW-iso2022-7 |
UTF-8 |
UTF-8 |
ISO 2022-CN-EXT |
zh_TW-iso2022-CN-EX |
Big-5 |
zh_TW-Big5 |
Big-5 |
zh_TW-Big5 |
ISO 2022-CN-EXT |
zh_TW-iso2022-CN-EXT |
Three Japanese input systems are bundled in Japanese Solaris 7. They can be used in the ja, ja_JP.PCK and ja_JP.UTF-8 locales. However, some maintenance utilities do not support the PCK codeset.
The Japanese Input System is shown below in Table 3-17.
Table 3-17 Japanese Input Systems
Japanese TrueType Fonts are show below in Table 3-18.
Table 3-18 Japanese TrueType Fonts
Full Family Name |
Subfamily |
Format |
Vendor |
Encoding |
---|---|---|---|---|
hg gothic b |
R |
TrueType |
RICOH |
JISX0208.1983, JISX0201.1976 |
hg mincho l |
R |
TrueType |
RICOH |
JISX0208.1983, JISX0201.1976 |
heiseimin |
R |
TrueType |
RICOH |
JISX0212.1990 |
Japanese Bitmap Fonts are shown in Table 3-19 below.
Table 3-19 Japanese Bitmap Fonts
Full Family Name |
Subfamily |
Format |
Vendor |
Encoding |
---|---|---|---|---|
gothic |
R, B |
PCF(12,14,16,20,24) |
|
JISX0208.1983, JISX0201.1976 |
minchou |
R |
PCF(12,14,16,20,24) |
|
JISX0208.1983, JISX0201.1976 |
hg gothic b |
R |
PCF(12,14,16,18,20,24) |
RICOH |
JISX0208.1983, JISX0201.1976 |
hg mincho l |
R |
PCF(12,14,16,18,20,2) |
RICOH |
JISX0208.1983, JISX0201.1976 |
heiseimin |
R |
PCF(12,14,16,18,20,24) |
RICOH |
JISX0212.1990 |
Japanese Solaris 7 supports three locales. The ja locale is based on Japanese EUC. The ja_JP.PCK locale is based on PC-Kanji code (Shift JIS) and the ja_JP.UTF-8 locale is based on UTF-8.
Some messages and manual pages have been translated into Japanese in Japanese Solaris 7.
The following table shows supported conversion with iconv(1) and iconv(3). See the iconv_ja(5)man page for details.
Table 3-20 shows iconv Conversion Support.
Table 3-20 iconv Conversion Support
Source Code |
Target Code |
---|---|
eucJP |
PCK |
eucJP |
JIS7 |
eucJP |
SJIS |
eucJP |
UTF-8 |
eucJP |
jis |
eucJP |
ibmj |
SJIS |
eucJP |
SJIS |
ISO-2022-JP |
SJIS |
UTF-8 |
SJIS |
jis |
SJIS |
ibmj |
PCK |
eucJP |
PCK |
UTF-8 |
PCK |
ISO-2022-JP |
PCK |
jis |
PCK |
ibmj |
ISO-2022-JP |
eucJP |
ISO-2022-JP |
PCK |
ISO-2022-JP |
SJIS |
UTF-8 |
eucJP |
UTF-8 |
SJIS |
UTF-8 |
PCK |
JIS7 |
eucJP |
jis |
eucJP |
jis |
PCK |
jis |
SJIS |
ibmj |
eucJP |
ibmj |
PCK |
UTF-8 |
ISO-2022-JP |
ISO-2022-JP |
UTF-8 |
eucJP |
UTF-8-Java |
UTF-8-Java |
eucJP |
PCK |
UTF-8-Java |
UTF-8-Java |
PCK |
eucJP |
ISO-2022-JP.RFC1468 |
PCK |
ISO-2022-JP.RFC1468 |
UTF-8 |
ISO-2022-JP.RFC1468 |
eucJP |
ibmj-EBCDIK |
ibmj-EBCDIK |
eucJP |
PCK |
ibmj-EBCDIK |
ibmj-EBCDIK |
PCK |
There are TTY STREAMS modules that perform code conversion between an encoding for a specific terminal and an encoding for a specific locale. With an appropriate STREAMS module, a user can log in from a Japanese terminal into a Japanese locale, even if the encoding between the terminal and the Japanese locale does not match. tty(1) controls the behavior of those STREAMS modules.
The Japanese Solaris 7 product supports the following Japanese-specific printers:
Epson VP-5085 (based on ESC/P)
NEC PC-PR201 (based on 201PL)
Canon LASERSHOT (based on LIPS)
Japanese PostScript Printer
The Japanese Solaris 7 package also provides Japanese Solaris 1.1.x binary-compatibility packages that are the same as the base products.
To handle User-Defined Characters, sdtudctool has been available since the Solaris 2.6 release. Sdtudctool handles both outline (Type1) and bitmap (PCF) fonts. Some utilities are also available to migrate the UDC fonts that were created by old utilities, such as fontedit, type3creator andfontmanager in prior releases.