Solaris Internationalization Guide For Developers

Code Conversions

The en_US.UTF-8 locale supports various code conversions among major codesets of several countries through iconv(1) and iconv(1).

The available fromcode and tocode names that can be applied to iconv(1) and iconv_open(3)are shown in Table 4-3.

Table 4-3 Available Code Conversions in en_US.UTF-8


From Code	To Code	Description
646	UTF-8	ISO 646 (US-ASCII) to UTF-8
UTF-8	646	UTF-8 to ISO 646 (US-ACII)
UTF-8	8859-1	UTF-8 to ISO 8859-1
UTF-8	8859-2	UTF-8 to ISO 8859-2
UTF-8	8859-3	UTF-8 to ISO 8859-3
UTF-8	8859-4	UTF-8 to ISO 8859-4
UTF-8	8859-5	UTF-8 to ISO 8859-5 (Cyrillic)
UTF-8	8859-6	UTF-8 to ISO 8859-6 (Arabic)
UTF-8	8859-7	UTF-8 to ISO 8859-7 (Greek)
UTF-8	8859-8	UTF-8 to ISO 8859-8 (Hebrew)
UTF-8	8859-9	UTF-8 to ISO 8859-9
UTF-8	8859-10	UTF-8 to ISO 8859-10
UTF-8	8859-11	UTF-8 to TIS 620.2533 (Thai)
UTF-8	8859-15	UTF-8 to ISO 8859-15
8859-1	UTF-8	ISO 8859-1 to UTF-8
8859-2	UTF-8	ISO 8859-2 to UTF-8
8859-3	UTF-8	ISO 8859-3 to UTF-8
8859-4	UTF-8	ISO 8859-4 to UTF-8
8859-5	UTF-8	ISO 8859-5 (Cyrillic) to UTF-8
8859-6	UTF-8	ISO 8859-6 (Arabic) to UTF-8
8859-7	UTF-8	ISO 8859-7 (Greek) to UTF-8
8859-8	UTF-8	ISO 8859-8 (Hebrew) to UTF-8
8859-9	UTF-8	ISO 8859-9 to UTF-8
8859-10	UTF-8	ISO 8859-10 to UTF-8
8859-11	UTF-8	TIS 620.2553 to UTF-8
8859-15	UTF-8	ISO 8859-15 to UTF-8
UTF-8	KOI8-R	UTF-8 to KOI8-R (Cyrillic)
KOI8-R	UTF-8	KOI8-R (Cyrillic) to UTF-8
UTF-8	UCS-2	UTF-8 to UCS-2
UCS-2	UTF-8	UCS-2 to UTF-8
UTF-8	UCS-4	UTF-8 to UCS-4
UCS-4	UTF-8	UCS-4 to UTF-8
UTF-8	UTF-7	UTF-8 to UTF-7
UTF-7	UTF-8	UTF-7 to UTF-8
UTF-8	UTF-16	UTF-8 to UTF-16
UTF-16	UTF-8	UTF-16 to UTF-8
UTF-8	eucJP	UTF-8 to Japanese EUC (JIS X0201-1976, JIS X0208-1983, and JIS X0212-1990)
UTF-8	PCK	UTF-8 to Japanese PC Kanji ( SJIS)
UTF-8	ISO-2022-JP	UTF-8 to Japanese MIME character set ISO-2022-JP
eucJP	UTF-8	Japanese EUC to UTF-8
PCK	UTF-8	Japanese PC Kanji (SJIS) to UTF-8
ISO-2022-JP	UTF-8	Japanese MIME character set to UTF-8
UTF-8	ko_KR-euc	UTF-8 to Korean EUC (KS C 5636 and KS C 5601-1987)
UTF-8	ko_KR-johap	UTF-8 to Korean Johap (KS C 5601-1987)
UTF-8	ko_KR-johap92	UTF-8 to Korean Johap (KS C 5601-1992)
UTF-8	ko_KR-iso2022-7	UTF-8 to ISO-2022-KR
ko_KR-euc	UTF-8	Korean EUC to UTF-8
ko_KR-johap	UTF-8	Korean Johap (KS C 5601-1987) to UTF-8
ko_KR-johap92	UTF-8	Korean Johap (KS C 5601-1992) to UTF-8
ko_KR-iso2022-7	UTF-8	ISO-2022-KR to UTF-8
ko_KR-cp933	UTF-8	IBM MBCS CP933 to UTF-8
UTF-8	gb2312	UTF-8 to Simplified Chinese EUC (GB 1988-1980 and GB2312-1980)
UTF-8	iso2022	UTF-8 to ISimplified Chinese MIME character set (ISO-2022-cn)
UTF-8	GBK	UTF-8 to Simplified Chinese MIME character set (ISO-2022-cn)
gb2312	UTF-8	Chinese/PRC EUC (GB 2312-1980) to UTF-8
iso2022	UTF-8	ISO-2022-CN to UTF-8
GBK	UTF-8	Simplified Chinese GBK to UTF-8
UTF-8	zh_TW-euc	UTF-8 to Traditional Chinese EUC (CNS 11643-1992)
UTF-8	zh_TW-big5	UTF-8 to Traditional Chinese Big5
UTF-8	zh_TW-iso2022-7	UTF-8 to Traditional Chinese MIME character set (ISO-2022-TW)
UTF-8	zh_TW-cp937	UTF-8 to IBM MBCS CP937
zh_TW-euc	UTF-8	Traditional Chinese EUC to UTF-8
zh_TW-big5	UTF-8	Traditional Chinese Big5 to UTF-8
zh_TW-iso2022-7	UTF-8	Traditional Chinese MIME character set (ISO-2022-TW) to UTF-8
zh_TW-cp937	UTF-8	IBM MBCS CP937 to UTF-8

For more details on iconv code conversion, see the , iconv(1) and iconv_open(3), iconv(3), and iconv_close(3) man pages. For more information on available code conversions, see iconv_en_US.UTF-8(5).