Common Desktop Environment: Internationalization Programmer's Guide

eucTW

The EUC for Traditional Chinese is an encoding consisting of characters that contain single-byte and multibyte (2 and 4 bytes) characters. The EUC encoding conforms to ISO2022 and is based on the Chinese National Standard (CNS) as defined by the Republic of China and the EUC definition, see Table 3-3 .

Table 3-3 Encoding for eucTW

CS 

Encoding 

 

 

Character Set 

cs0 

0xxxxxxx 

 

 

ASCII 

cs1 

1xxxxxxx 

1xxxxxxx 

 

CNS 11643.1992 - plane 1 

cs2 

0x8EA2 

1xxxxxxx 

1xxxxxxx 

CNS 11643.1992 - plane 2 

cs3 

0x8EA3 

1xxxxxxx 

1xxxxxxx 

CNS 11643.1992 - plane 3 

 

0x8EB0 

1xxxxxxx 

1xxxxxxx 

CNS 11643.1992 - Plane 16 

CNS 11643-1992 defines 16 planes for the Chinese Standard Interchange Code, each plane can support up to 8836 characters (94x94). Currently, only planes 1 through 7 have characters assigned. Table 3-4 shows the 16 planes of the CNS 11643-1992 standard.

Table 3-4 16 Planes of the CNS 11643-1992 Standard

Plane 

Definition 

# of Character 

EUC Encoding 

Most frequently used 

6085 

A1A1-FDCB 

Secondary frequently 

7650 

8EA2 A1A1 - 8EA2 F2C4 

Exec.Yuen EDP 1 center

6148 

8EA3 A1A1 - 8EA3 E2C6 

RIS2, Vendor defined

7298 

8EA4 A1A1 - 8EA4 EEDC 

Rarely used by MOE3

8603 

8EA5 A1A1 - 8EA5 FCD1 

Variation char set 1 by MOE 

6388 

8EA6 A1A1 - 8EA6 E4FA 

Variation char set 2 by MOE 

6539 

8EA7 A1A1 - 8EA7 E6D5 

Undefined 

8EA8 A1A1 - 8EA8 FEFE 

Undefined 

8EA9 A1A1 - 8EA9 FEFE 

10 

Undefined 

8EAA A1A1 - 8EAA FEFE 

11 

Undefined 

8EAB A1A1 - 8EAB FEFE 

12 

User Defined Character (UDC) 

8EAC A1A1 - 8EAC FEFE 

13 

UDC 

8EAD A1A1 - 9EAD FEFE 

14 

UDC 

8EAE A1A1 - 8EAE FEFE 

15 

UDC 

8EAF A1A1 - 8EAF FEFE 

16 

UDC 

8EB0 A1A1 - 8EB0 FEFE 

1. EDP: Center of Directorate, General of Budget, Accounting, and Statistics

2. RIS: Residence Information System

3. MOE: Ministry of Education