In December 1995, the Korean government announced a standard Korean codeset, KS C 5700, which is based on ISO 10646-1/Unicode 2.0.
The ISO-10646 character set uses 2 (UCS-2); Universal Character Set (two-byte form) or 4 (UCS-4) bytes to represent each character.
The ISO-10646 character set cannot be used directly on IBM-PC-based operating systems. For example, the kernel and many other modules of the Solaris operating environment interpret certain byte values as control instructions, such as a null character (0x00) in any string. The ISO-10646 character set can be encoded with any bit combinations in the first or subsequent bytes. The ISO-10646 characters cannot be freely transmitted through the Solaris system with these limitations. In order to establish a migration path, the ISO-10646 character set defines the UCS Transformation Format (UTF), which recodes the ISO-10646 characters without using C0 controls (0x00..0x1F), C1 controls (0x80..0x9F), space (0x20), and DEL (0x7F).
The ko.UTF-8 is a Solaris locale to support KSC-5700, the Korean standard codeset. It supports all characters in the previous KSC 5601 and all 11,172 Korean characters. Korean UTF-8 supports the Korean language-related ISO-10646 characters and fonts. Because ISO-10646 covers all characters in the world, all of the various input methods and fonts are supplied so that you can input and output any character in any language. Before Universal UTF/UCS becomes available, Korean UTF-8 supports the ISO-10646 code subset that is related to Korean characters as well as all other characters in the previous Korean standard codeset, and Extended ASCII.
In the ko locale, the EUC scheme is used to encode KSC 5601-1987. The ko.UTF-8 locale supports the KSC 5700-1995/Unicode 2.0 codeset, which is a super set of KSC 5601-1987. These two locales look the same to the end user, but the internal character encoding is different. The Korean Solaris product supports the following Input Methods:
For the ko locale:
Hangul 2-BeolSik (1 set of consonants and 1 set of vowels)
Hangul-Hanja conversion
Special character
Hexadecimal code
For the ko.UTF-8 locale:
Hangul 2-BeolSik (1 set of consonants and 1 set of vowels)
Hangul-Hanja conversion
Special character
Hexadecimal code
Full Family Name |
Subfamily |
Format |
Vendor |
Encoding |
---|---|---|---|---|
Gothic | R | CID/Type 1 | Hanyang | Adobe-Korean |
Graphic | R | CID/Type 1 | Hanyang | Adobe-Korean |
Haeso | R | CID/Type 1 | Hanyang | Adobe-Korean |
Kodig | R | CID/Type 1 | Hanyang | Adobe-Korean |
Myeongijo | R | CID/Type 1 | Hanyang | Adobe-Korean |
Pilki | R | CID/Type 1 | Hanyang | Adobe-Korean |
Roundgothic | R | CID/Type 1 | Hanyang | Adobe-Korean |
Table 3-33 Solaris 8 Korean Bitmap Fonts for the ko Locale
Full Family Name |
Subfamily |
Format |
Encoding |
---|---|---|---|
Gothic | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1987 |
Graphic | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1987 |
Haeso | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1987 |
Kodig | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1987 |
Myeongijo | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1987 |
Pilki | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1987 |
Roundgothic | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1987 |
Table 3-34 Solaris 8 Korean CID/Type 1 Fonts for the ko.UTF-8 Locale
Full Family Name |
Subfamily |
Format |
Vendor |
Encoding |
---|---|---|---|---|
Gothic | R | CID/Type 1 | Hanyang | Adobe-Korean |
Graphic | R | CID/Type 1 | Hanyang | Adobe-Korean |
Haeso | R | CID/Type 1 | Hanyang | Adobe-Korean |
Kodig | R | CID/Type 1 | Hanyang | Adobe-Korean |
Myeongijo | R | CID/Type 1 | Hanyang | Adobe-Korean |
Pilki | R | CID/Type 1 | Hanyang | Adobe-Korean |
Table 3-35 Solaris 8 Korean Bitmap Fonts for the ko.UTF-8 Locale
Full Family Name |
Subfamily |
Format |
Encoding |
---|---|---|---|
Gothic | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1992 (Johap) |
Graphic | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1992 (Johap) |
Haeso | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1992 (Johap) |
Kodig | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1992 (Johap) |
Myeongijo | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1992 (Johap) |
Pilki | R/B | PCF (12,14,16,18,20,24) | KSC 5601-1992 (Johap) |
Table 3-36 Solaris 8 Korean TrueType Fonts for the ko/ko.UTF-8 Locales
Full Family Name |
Subfamily |
Format |
Vendor |
Encoding |
---|---|---|---|---|
Kodig/Gothic |
R |
True Type |
Hanyang |
Unicode |
Myeongjo |
R |
True Type |
Hanyang |
Unicode |
Haeso |
R |
True Type |
Hanyang |
Unicode |
RoundGothic |
R |
True Type |
Hanyang |
Unicode |
Table 3-37 Korean ICONV
Code |
Symbol |
Target Code |
Symbol |
---|---|---|---|
KSC 5601-1987 | 1506 | UTF-8 | UTF-8 |
ISO 646 | 646 | KSC 5601-1987 | 5601 |
KSC 5601-1987 | EUC-KR | UTF-8 | UTF-8 |
KSC 5601-1987 | KSC5601 | UTF-8 | UTF-8 |
UTF-8 | UTF-8 | KSC 5601-1987 | 5601 |
UTF-8 | UTF-8 | KSC 5601-1987 | EUC-KR |
UTF-8 | UTF-8 | KSC 5601-1987 | KSC 5601 |
UTF-8 | ko-KR-UTF-8 | IBM CP 933 | cp 933 |
UTF-8 | ko-KR-UTF-8 | KSC 5601-1987 | ko_KR-euc |
UTF-8 | ko-KR-UTF-8 | ISO2022-KR | ko_KR-iso2022-7 |
UTF-8 | ko-KR-UTF-8 | KSC 5601-1987 - Johap | ko_KR-johap |
UTF-8 | ko-KR-UTF-8 | KSC5601-1992 - Johap | ko_KR-johap92 |
IBM CP933 | cp933 | UTF-8 | ko_KR-UTF-8 |
KSC 5601-1987 | ko_KR-euc | UTF-8 | ko_KR-UTF-8 |
KSC 5601-1987 | ko_KR-euc | ISO 2022-KR | ko_KR-iso2022-7 |
KSC 5601-1987 | ko_KR-euc | KSC 5601-1987 - Johap | ko_KR-johap |
KSC 5601-1987 | ko_KR-euc | KSC 5601-1992 - Johap | ko_KR-johap92 |
KSC 5601-1987 | ko_KR-euc | KSC 5601-1992-Annex:4 | ko_KR-nbyte |
ISO 2022-KR | iso2022-7 | UTF-8 | ko_KR-UTF-8 |
ISO 2022-KR | iso2022-7 | KSC 5601-1987 | ko_KR-euc |
KSC 5601-1987 - Johap | ko-KR-johap | UTF-8 | ko_KR-UTF-8 |
KSC 5601-1987 - Johap | ko-KR-johap | KSC 5601-1987 | ko_KR-euc |
KSC 5601-1992 - Johap | ko-KR-johap92 | UTF-8 | ko_KR-UTF-8 |
KSC 5601-1992 - Johap | ko-KR-johap92 | KSC 5601-1987 | ko_KR-euc |
KSC 5601-1992 - Annex:4 | ko-KR-nbyte | KSC 5601-1987 | ko_KR-euc |