Solaris Internationalization Guide For Developers

Code Conversions

The en_US.UTF-8 locale supports various code conversions among major codesets of several countries through iconv(1) and iconv(1).

The available fromcode and tocode names that can be applied to iconv(1) and iconv_open(3)are shown in Table 4-3.

Table 4-3 Available Code Conversions in en_US.UTF-8

From Code 

To Code 

Description 

646 

UTF-8 

ISO 646 (US-ASCII) to UTF-8 

UTF-8 

646 

UTF-8 to ISO 646 (US-ACII) 

UTF-8 

8859-1 

UTF-8 to ISO 8859-1 

UTF-8 

8859-2 

UTF-8 to ISO 8859-2 

UTF-8 

8859-3 

UTF-8 to ISO 8859-3 

UTF-8 

8859-4 

UTF-8 to ISO 8859-4 

UTF-8 

8859-5 

UTF-8 to ISO 8859-5 (Cyrillic) 

UTF-8 

8859-6 

UTF-8 to ISO 8859-6 (Arabic) 

UTF-8 

8859-7 

UTF-8 to ISO 8859-7 (Greek) 

UTF-8 

8859-8 

UTF-8 to ISO 8859-8 (Hebrew) 

UTF-8 

8859-9 

UTF-8 to ISO 8859-9 

UTF-8 

8859-10 

UTF-8 to ISO 8859-10 

UTF-8  

8859-11 

UTF-8 to TIS 620.2533 (Thai) 

UTF-8 

8859-15 

UTF-8 to ISO 8859-15 

8859-1 

UTF-8 

ISO 8859-1 to UTF-8 

8859-2 

UTF-8 

ISO 8859-2 to UTF-8 

8859-3 

UTF-8 

ISO 8859-3 to UTF-8 

8859-4 

UTF-8 

ISO 8859-4 to UTF-8 

8859-5 

UTF-8 

ISO 8859-5 (Cyrillic) to UTF-8 

8859-6 

UTF-8 

ISO 8859-6 (Arabic) to UTF-8 

8859-7 

UTF-8 

ISO 8859-7 (Greek) to UTF-8 

8859-8 

UTF-8 

ISO 8859-8 (Hebrew) to UTF-8 

8859-9 

UTF-8 

ISO 8859-9 to UTF-8 

8859-10 

UTF-8 

ISO 8859-10 to UTF-8 

8859-11 

UTF-8 

TIS 620.2553 to UTF-8 

8859-15 

UTF-8 

ISO 8859-15 to UTF-8 

UTF-8 

KOI8-R 

UTF-8 to KOI8-R (Cyrillic) 

KOI8-R 

UTF-8 

KOI8-R (Cyrillic) to UTF-8 

UTF-8 

UCS-2 

UTF-8 to UCS-2 

UCS-2 

UTF-8 

UCS-2 to UTF-8 

UTF-8 

UCS-4 

UTF-8 to UCS-4 

UCS-4 

UTF-8 

UCS-4 to UTF-8 

UTF-8 

UTF-7 

UTF-8 to UTF-7 

UTF-7 

UTF-8 

UTF-7 to UTF-8 

UTF-8 

UTF-16 

UTF-8 to UTF-16 

UTF-16 

UTF-8 

UTF-16 to UTF-8 

UTF-8 

eucJP 

UTF-8 to Japanese EUC (JIS X0201-1976, JIS X0208-1983, and JIS X0212-1990) 

UTF-8 

PCK 

UTF-8 to Japanese PC Kanji ( SJIS) 

UTF-8 

ISO-2022-JP 

UTF-8 to Japanese MIME character set ISO-2022-JP 

eucJP 

UTF-8 

Japanese EUC to UTF-8 

PCK 

UTF-8 

Japanese PC Kanji (SJIS) to UTF-8 

ISO-2022-JP 

UTF-8 

Japanese MIME character set to UTF-8 

UTF-8 

ko_KR-euc 

UTF-8 to Korean EUC (KS C 5636 and KS C 5601-1987) 

UTF-8 

ko_KR-johap 

UTF-8 to Korean Johap (KS C 5601-1987) 

UTF-8 

ko_KR-johap92 

UTF-8 to Korean Johap (KS C 5601-1992) 

UTF-8 

ko_KR-iso2022-7 

UTF-8 to ISO-2022-KR 

ko_KR-euc 

UTF-8 

Korean EUC to UTF-8 

ko_KR-johap 

UTF-8 

Korean Johap (KS C 5601-1987) to UTF-8 

ko_KR-johap92 

UTF-8 

Korean Johap (KS C 5601-1992) to UTF-8 

ko_KR-iso2022-7 

UTF-8 

ISO-2022-KR to UTF-8 

ko_KR-cp933 

UTF-8 

IBM MBCS CP933 to UTF-8 

UTF-8 

gb2312 

UTF-8 to Simplified Chinese EUC (GB 1988-1980 and GB2312-1980) 

UTF-8 

iso2022 

UTF-8 to ISimplified Chinese MIME character set (ISO-2022-cn) 

UTF-8 

GBK 

UTF-8 to Simplified Chinese MIME character set (ISO-2022-cn) 

gb2312 

UTF-8 

Chinese/PRC EUC (GB 2312-1980) to UTF-8 

iso2022 

UTF-8 

ISO-2022-CN to UTF-8 

GBK 

UTF-8 

Simplified Chinese GBK to UTF-8 

UTF-8 

zh_TW-euc 

UTF-8 to Traditional Chinese EUC (CNS 11643-1992) 

UTF-8 

zh_TW-big5 

UTF-8 to Traditional Chinese Big5 

UTF-8 

zh_TW-iso2022-7 

UTF-8 to Traditional Chinese MIME character set (ISO-2022-TW) 

UTF-8 

zh_TW-cp937 

UTF-8 to IBM MBCS CP937 

zh_TW-euc 

UTF-8 

Traditional Chinese EUC to UTF-8 

zh_TW-big5 

UTF-8 

Traditional Chinese Big5 to UTF-8 

zh_TW-iso2022-7 

UTF-8 

Traditional Chinese MIME character set (ISO-2022-TW) to UTF-8 

zh_TW-cp937 

UTF-8 

IBM MBCS CP937 to UTF-8 

For more details on iconv code conversion, see the , iconv(1) and iconv_open(3), iconv(3), and iconv_close(3) man pages. For more information on available code conversions, see iconv_en_US.UTF-8(5).