International Language Environments Guide

Korean Localization

In December 1995, the Korean government announced a standard Korean codeset, KS C 5700, which is based on ISO 10646-1/Unicode 2.0.

The ISO-10646 character set uses 2 (UCS-2); Universal Character Set (two-byte form) or 4 (UCS-4) bytes to represent each character.

The ISO-10646 character set cannot be used directly on IBM-PC-based operating systems. For example, the kernel and many other modules of the Solaris operating environment interpret certain byte values as control instructions, such as a null character (0x00) in any string. The ISO-10646 character set can be encoded with any bit combinations in the first or subsequent bytes. The ISO-10646 characters cannot be freely transmitted through the Solaris system with these limitations. In order to establish a migration path, the ISO-10646 character set defines the UCS Transformation Format (UTF), which recodes the ISO-10646 characters without using C0 controls (0x00..0x1F), C1 controls (0x80..0x9F), space (0x20), and DEL (0x7F).

The ko.UTF-8 is a Solaris locale to support KSC-5700, the Korean standard codeset. It supports all characters in the previous KSC 5601 and all 11,172 Korean characters. Korean UTF-8 supports the Korean language-related ISO-10646 characters and fonts. Because ISO-10646 covers all characters in the world, all of the various input methods and fonts are supplied so that you can input and output any character in any language. Before Universal UTF/UCS becomes available, Korean UTF-8 supports the ISO-10646 code subset that is related to Korean characters as well as all other characters in the previous Korean standard codeset, and Extended ASCII.

In the ko locale, the EUC scheme is used to encode KSC 5601-1987. The ko.UTF-8 locale supports the KSC 5700-1995/Unicode 2.0 codeset, which is a super set of KSC 5601-1987. These two locales look the same to the end user, but the internal character encoding is different. The Korean Solaris product supports the following Input Methods:

For the ko locale:

For the ko.UTF-8 locale:

Table 3-32 Solaris 8 Korean CID/Type 1 Fonts for the ko Locale

Full Family Name 

Subfamily 

Format 

Vendor 

Encoding 

 Gothic R CID/Type 1 Hanyang Adobe-Korean
 Graphic R CID/Type 1 Hanyang Adobe-Korean
 Haeso R CID/Type 1 Hanyang Adobe-Korean
 Kodig R CID/Type 1 Hanyang Adobe-Korean
 Myeongijo R CID/Type 1 Hanyang Adobe-Korean
 Pilki R CID/Type 1 Hanyang Adobe-Korean
 Roundgothic R CID/Type 1 Hanyang Adobe-Korean

Table 3-33 Solaris 8 Korean Bitmap Fonts for the ko Locale

Full Family Name 

Subfamily 

Format 

Encoding 

 Gothic R/B PCF (12,14,16,18,20,24) KSC 5601-1987
 Graphic R/B PCF (12,14,16,18,20,24) KSC 5601-1987
 Haeso R/B PCF (12,14,16,18,20,24) KSC 5601-1987
 Kodig R/B PCF (12,14,16,18,20,24) KSC 5601-1987
 Myeongijo R/B PCF (12,14,16,18,20,24) KSC 5601-1987
 Pilki R/B PCF (12,14,16,18,20,24) KSC 5601-1987
 Roundgothic R/B PCF (12,14,16,18,20,24) KSC 5601-1987

Table 3-34 Solaris 8 Korean CID/Type 1 Fonts for the ko.UTF-8 Locale

Full Family Name 

Subfamily 

Format 

Vendor 

Encoding 

 Gothic R CID/Type 1 Hanyang Adobe-Korean
 Graphic R CID/Type 1 Hanyang Adobe-Korean
 Haeso R CID/Type 1 Hanyang Adobe-Korean
 Kodig R CID/Type 1 Hanyang Adobe-Korean
 Myeongijo R CID/Type 1 Hanyang Adobe-Korean
 Pilki R CID/Type 1 Hanyang Adobe-Korean

Table 3-35 Solaris 8 Korean Bitmap Fonts for the ko.UTF-8 Locale

Full Family Name 

Subfamily 

Format 

Encoding 

 Gothic R/B PCF (12,14,16,18,20,24) KSC 5601-1992 (Johap)
 Graphic R/B PCF (12,14,16,18,20,24) KSC 5601-1992 (Johap)
 Haeso R/B PCF (12,14,16,18,20,24) KSC 5601-1992 (Johap)
 Kodig R/B PCF (12,14,16,18,20,24) KSC 5601-1992 (Johap)
 Myeongijo R/B PCF (12,14,16,18,20,24) KSC 5601-1992 (Johap)
 Pilki R/B PCF (12,14,16,18,20,24) KSC 5601-1992 (Johap)

Table 3-36 Solaris 8 Korean TrueType Fonts for the ko/ko.UTF-8 Locales

Full Family Name 

Subfamily 

Format 

Vendor 

Encoding 

Kodig/Gothic 

True Type 

Hanyang 

Unicode 

Myeongjo 

True Type 

Hanyang 

Unicode 

Haeso 

True Type 

Hanyang 

Unicode 

RoundGothic 

True Type 

Hanyang 

Unicode 

Table 3-37 Korean ICONV

Code 

Symbol 

Target Code 

Symbol 

 KSC 5601-1987 1506 UTF-8 UTF-8
 ISO 646 646 KSC 5601-1987 5601
 KSC 5601-1987 EUC-KR UTF-8 UTF-8
 KSC 5601-1987 KSC5601 UTF-8 UTF-8
 UTF-8 UTF-8 KSC 5601-1987 5601
 UTF-8 UTF-8 KSC 5601-1987 EUC-KR
 UTF-8 UTF-8 KSC 5601-1987 KSC 5601
 UTF-8 ko-KR-UTF-8 IBM CP 933 cp 933
 UTF-8 ko-KR-UTF-8 KSC 5601-1987 ko_KR-euc
 UTF-8 ko-KR-UTF-8 ISO2022-KR ko_KR-iso2022-7
 UTF-8 ko-KR-UTF-8 KSC 5601-1987 - Johap ko_KR-johap
 UTF-8 ko-KR-UTF-8 KSC5601-1992 - Johap ko_KR-johap92
 IBM CP933 cp933 UTF-8 ko_KR-UTF-8
 KSC 5601-1987 ko_KR-euc UTF-8 ko_KR-UTF-8
 KSC 5601-1987 ko_KR-euc ISO 2022-KR ko_KR-iso2022-7
 KSC 5601-1987 ko_KR-euc KSC 5601-1987 - Johap ko_KR-johap
 KSC 5601-1987 ko_KR-euc KSC 5601-1992 - Johap ko_KR-johap92
 KSC 5601-1987 ko_KR-euc KSC 5601-1992-Annex:4 ko_KR-nbyte
 ISO 2022-KR iso2022-7 UTF-8 ko_KR-UTF-8
 ISO 2022-KR iso2022-7 KSC 5601-1987 ko_KR-euc
 KSC 5601-1987 - Johap ko-KR-johap UTF-8 ko_KR-UTF-8
 KSC 5601-1987 - Johap ko-KR-johap KSC 5601-1987 ko_KR-euc
 KSC 5601-1992 - Johap ko-KR-johap92 UTF-8 ko_KR-UTF-8
 KSC 5601-1992 - Johap ko-KR-johap92 KSC 5601-1987 ko_KR-euc
 KSC 5601-1992 - Annex:4 ko-KR-nbyte KSC 5601-1987 ko_KR-euc