Common Desktop Environment: Internationalization Programmer's Guide

eucKR

The EUC for Korean is an encoding consisting of single-byte and multibyte characters (shown in Table 3-5 ). The encoding conforms to ISO2022 and is based on Korean Standard Code (KSC) set and EUC definitions.

Table 3-5 Encoding for eucKR.

CS 

Encoding 

 

Character Set 

cs0 

0xxxxxxx 

 

ASCII 

cs1 

1xxxxxxx 

1xxxxxxx 

KS C 5601-1992 

cs2 

 

 

Not used 

cs3 

 

 

Not used 

KSC 5601-1992 (code of the Korean character set for information interchange, 1992 version) contains 432 special characters, 30 Arabic and Roman numeral characters, 94 Hangul alphabet characters, 52 Roman characters, 48 Greek characters, 27 Latin characters, 169 Japanese characters, 66 Russian characters, 68 line-drawing elements, 2344 precomposed Hangul characters, and 4888 Hanja characters.

One Hangul character can be comprised of several consonants and vowels. Most Hangul words can be expressed in Hanja words. Hanja is a set of Traditional Chinese characters, which is currently used by Korean people. Each Hanja character has its own meaning and is thus more specific than Hangul most of the time.