Asian Application Developer's Guide

Chapter 3 The Codeset Conversion Utility

Asian Solaris software provides the iconv library as a conversion utility for character-based code conversion.

Code Set Conversion Modules for Korean

Conversion for Korean EUC Character Codes

The following modules perform character-based code conversion on the KS C 5601 character set. They convert KS C 5601 characters, also called Completion code or Wansung, to Combination code (Johap), and vice versa.

For further information, see the iconv(3) and iconv_ko(5) man pages.

Table 3-1 Korean iconv Code Conversion Modules (ko locale)

Code 

Symbol 

TargetCode 

Symbol 

Wansung 

ko_KR-euc 

Johap 

ko_KR-johap92 

Wansung 

ko_KR-euc 

Packed 

ko_KR-johap 

Wansung 

ko_KR-euc 

N-Byte 

ko_KR-nbyte 

Wansung 

ko_KR-euc 

ISO-2022-KR 

ko_KR-iso2022-7 

Johap 

ko_KR-johap92 

Wansung 

ko_KR-euc 

Packed 

ko_KR-johap 

Wansung 

ko_KR-euc 

N-Byte 

ko_KR-nbyte 

Wansung 

ko_KR-euc 

ISO-2022-KR 

ko_KR-iso2022-7 

Wansung 

ko_KR-euc 

Conversion for ko.UTF-8 Character Codes

The following modules perform character-based code conversion on the KS C 5700 character set. They convert KSC 5700 characters between Korean UTF-8, Completion code (Wansung), and Combination code (Johap).

For further information, see the iconv(3), iconv_ko.UTF-8(5), iconv_utf(5) man pages.

Table 3-2 Common Korean iconv Code Conversion Modules (ko and ko.UTF-8 locales)

Code 

Symbol 

Target Code 

Symbol 

UTF-8 

ko_KR-UTF-8 

Wansung 

ko_KR-euc 

UTF-8 

ko_KR-UTF-8 

Johap 

ko_KR-johap92 

UTF-8 

ko_KR-UTF-8 

Packed 

ko_KR-johap 

UTF-8 

ko_KR-UTF-8 

ISO-2022-KR 

ko_KR-iso2022-7 

Wansung 

ko_KR-euc 

UTF-8  

ko_KR-UTF-8 

Johap 

ko_KR-johap92 

UTF-8  

ko_KR-UTF-8 

Packed 

ko_KR-johap 

UTF-8  

ko_KR-UTF-8 

ISO-2022-KR 

ko_KR-iso2022-7 UTF-8 

UTF-8  

ko_KR-UTF-8 

Code Set Conversion Modules for Simplified Chinese

The following code set conversion modules are supported in Simplified Chinese Solaris software. For further information, see the iconv(3) and iconv_zh(5) man pages.

Table 3-3 Simplified Chinese iconv Code Conversion Modules (zh locale)

Code 

Symbol 

TargetCode 

Symbol 

GB2312-80 

zh_CN.euc 

ISO 2022-7 

zh_CN.iso2022-7 

ISO 2022-7 

zh_CN.iso2022-7 

GB2312-80 

zh_CN.euc 

GB2312-80 

zh_CN.euc 

ISO 2022-CN 

zh_CN.iso2022-CN 

ISO-2022-CN 

zh_CN.iso2022-CN 

GB2312-80 

zh_CN.euc 

UTF-8 

UTF-8 

GB2312-80 

zh_CN.euc 

GB2312-80 

zh_CN.euc 

UTF-8 

UTF-8  

Code Set Conversion Modules for Traditional Chinese

The following code set conversion modules are supported in Traditional Chinese Solaris. For further information, see the iconv(3) and iconv_zh_TW(5) man pages.

Table 3-4 Traditional Chinese iconv Code Conversion Modules (zh_TW and zh_TW.BIG5 locales)

Code 

Symbol 

Target Code 

Symbol 

CNS 11643 

zh_TW-euc 

Big-5 

zh_TW-big5 

CNS 11643 

zh_TW-euc 

ISO 2022-7 

zh_TW-iso2022-7 

Big-5 

zh_TW-big5 

CNS 11643 

zh_TW-euc 

Big-5 

zh_TW-big5 

ISO 2022-7 

zh_TW-iso2022-7 

ISO 2022-7 

zh_TW-iso2022-7 

CNS 11643 

zh_TW-euc 

ISO 2022-7 

zh_TW-iso2022-7 

Big-5 

zh_TW-big5 

CNS 11643 

zh_TW-euc 

ISO 2022-CN-EXT 

zh_TW-iso2022-CN-EXT 

ISO 2022-CN-EX 

zh_TW-iso2022-CN-EXT 

CNS 11643 

zh_TW-euc 

Big-5 

zh_TW-big5 

ISO 2022-CN 

zh_TW-iso2022-CN 

ISO 2022-CN 

zh_TW-iso2022-CN 

Big-5 

zh_TW-big5 

UTF-8 

UTF-8 

CNS 11643 

zh_TW-euc 

CNS 11643 

CNS 11643 

UTF-8 

UTF-8 

UTF-8 

UTF-8 

Big-5 

zh_TW-big5 

Big-5 

zh_TW-big5 

UTF-8 

UTF-8 

UTF-8 

UTF-8 

ISO 2022-7 

zh_TW-iso2022-7 

ISO 2022-7 

zh_TW-iso2022-7 

UTF-8 

UTF-8 

ISO 2022-CN-EXT 

zh_TW-iso2022-CN-EXT 

Big-5 

zh_TW-big5 

Big-5 

zh_TW-big5 

ISO 2022-CN-EXT 

zh_TW-iso2022-CN-EXT