Common Desktop Environment: Internationalization Programmer's Guide

eucJP

The EUC for Japanese consists of single-byte and multibyte characters (2 and 3 bytes). The encoding conforms to ISO2022 and is based on JIS and EUC definitions, see Table 3-2.

Table 3-2 Encoding for eucJP

CS 

Encoding 

 

Character Set 

cs0 

0xxxxxxx 

 

ASCII 

cs1 

1xxxxxxx 

1xxxxxxx 

JIS X0208-1990 

cs2 

0x8E 

1xxxxxxx 

JIS X0201-1976 

cs3 

0x8F 

1xxxxxxx 1xxxxxxx 

JIS X0212-1990 

JIS X0208-1990

A code of the Japanese graphic character set for information interchange (1990 version) that contains 147 special characters, 10 numeric digits, 83 Hiragana characters, 86 Katakana characters, 52 Latin characters, 48 Greek characters, 66 Cyrillic characters, 32 line-drawing elements, and 6355 Kanji characters.

JIS X0201

A code for information interchange that contains 63 Katakana characters.

JIS X0212-1990

A code of the supplementary Japanese graphic character set for information interchange (1990 version) that contains 21 additional special characters, 21 additional Greek characters, 26 additional Cyrillic characters, 27 additional Latin characters, 171 Latin characters with diacritical marks, and 5801 additional Kanji characters.