The en_US.UTF-8 locale supports various code conversions among major codesets of several countries through iconv(1) and iconv(1).
The available fromcode and tocode names that can be applied to iconv(1) and iconv_open(3)are shown in Table 4-3.
Table 4-3 Available Code Conversions in en_US.UTF-8
From Code |
To Code |
Description |
---|---|---|
646 |
UTF-8 |
ISO 646 (US-ASCII) to UTF-8 |
UTF-8 |
646 |
UTF-8 to ISO 646 (US-ACII) |
UTF-8 |
8859-1 |
UTF-8 to ISO 8859-1 |
UTF-8 |
8859-2 |
UTF-8 to ISO 8859-2 |
UTF-8 |
8859-3 |
UTF-8 to ISO 8859-3 |
UTF-8 |
8859-4 |
UTF-8 to ISO 8859-4 |
UTF-8 |
8859-5 |
UTF-8 to ISO 8859-5 (Cyrillic) |
UTF-8 |
8859-6 |
UTF-8 to ISO 8859-6 (Arabic) |
UTF-8 |
8859-7 |
UTF-8 to ISO 8859-7 (Greek) |
UTF-8 |
8859-8 |
UTF-8 to ISO 8859-8 (Hebrew) |
UTF-8 |
8859-9 |
UTF-8 to ISO 8859-9 |
UTF-8 |
8859-10 |
UTF-8 to ISO 8859-10 |
UTF-8 |
8859-11 |
UTF-8 to TIS 620.2533 (Thai) |
UTF-8 |
8859-15 |
UTF-8 to ISO 8859-15 |
8859-1 |
UTF-8 |
ISO 8859-1 to UTF-8 |
8859-2 |
UTF-8 |
ISO 8859-2 to UTF-8 |
8859-3 |
UTF-8 |
ISO 8859-3 to UTF-8 |
8859-4 |
UTF-8 |
ISO 8859-4 to UTF-8 |
8859-5 |
UTF-8 |
ISO 8859-5 (Cyrillic) to UTF-8 |
8859-6 |
UTF-8 |
ISO 8859-6 (Arabic) to UTF-8 |
8859-7 |
UTF-8 |
ISO 8859-7 (Greek) to UTF-8 |
8859-8 |
UTF-8 |
ISO 8859-8 (Hebrew) to UTF-8 |
8859-9 |
UTF-8 |
ISO 8859-9 to UTF-8 |
8859-10 |
UTF-8 |
ISO 8859-10 to UTF-8 |
8859-11 |
UTF-8 |
TIS 620.2553 to UTF-8 |
8859-15 |
UTF-8 |
ISO 8859-15 to UTF-8 |
UTF-8 |
KOI8-R |
UTF-8 to KOI8-R (Cyrillic) |
KOI8-R |
UTF-8 |
KOI8-R (Cyrillic) to UTF-8 |
UTF-8 |
UCS-2 |
UTF-8 to UCS-2 |
UCS-2 |
UTF-8 |
UCS-2 to UTF-8 |
UTF-8 |
UCS-4 |
UTF-8 to UCS-4 |
UCS-4 |
UTF-8 |
UCS-4 to UTF-8 |
UTF-8 |
UTF-7 |
UTF-8 to UTF-7 |
UTF-7 |
UTF-8 |
UTF-7 to UTF-8 |
UTF-8 |
UTF-16 |
UTF-8 to UTF-16 |
UTF-16 |
UTF-8 |
UTF-16 to UTF-8 |
UTF-8 |
eucJP |
UTF-8 to Japanese EUC (JIS X0201-1976, JIS X0208-1983, and JIS X0212-1990) |
UTF-8 |
PCK |
UTF-8 to Japanese PC Kanji ( SJIS) |
UTF-8 |
ISO-2022-JP |
UTF-8 to Japanese MIME character set ISO-2022-JP |
eucJP |
UTF-8 |
Japanese EUC to UTF-8 |
PCK |
UTF-8 |
Japanese PC Kanji (SJIS) to UTF-8 |
ISO-2022-JP |
UTF-8 |
Japanese MIME character set to UTF-8 |
UTF-8 |
ko_KR-euc |
UTF-8 to Korean EUC (KS C 5636 and KS C 5601-1987) |
UTF-8 |
ko_KR-johap |
UTF-8 to Korean Johap (KS C 5601-1987) |
UTF-8 |
ko_KR-johap92 |
UTF-8 to Korean Johap (KS C 5601-1992) |
UTF-8 |
ko_KR-iso2022-7 |
UTF-8 to ISO-2022-KR |
ko_KR-euc |
UTF-8 |
Korean EUC to UTF-8 |
ko_KR-johap |
UTF-8 |
Korean Johap (KS C 5601-1987) to UTF-8 |
ko_KR-johap92 |
UTF-8 |
Korean Johap (KS C 5601-1992) to UTF-8 |
ko_KR-iso2022-7 |
UTF-8 |
ISO-2022-KR to UTF-8 |
ko_KR-cp933 |
UTF-8 |
IBM MBCS CP933 to UTF-8 |
UTF-8 |
gb2312 |
UTF-8 to Simplified Chinese EUC (GB 1988-1980 and GB2312-1980) |
UTF-8 |
iso2022 |
UTF-8 to ISimplified Chinese MIME character set (ISO-2022-cn) |
UTF-8 |
GBK |
UTF-8 to Simplified Chinese MIME character set (ISO-2022-cn) |
gb2312 |
UTF-8 |
Chinese/PRC EUC (GB 2312-1980) to UTF-8 |
iso2022 |
UTF-8 |
ISO-2022-CN to UTF-8 |
GBK |
UTF-8 |
Simplified Chinese GBK to UTF-8 |
UTF-8 |
zh_TW-euc |
UTF-8 to Traditional Chinese EUC (CNS 11643-1992) |
UTF-8 |
zh_TW-big5 |
UTF-8 to Traditional Chinese Big5 |
UTF-8 |
zh_TW-iso2022-7 |
UTF-8 to Traditional Chinese MIME character set (ISO-2022-TW) |
UTF-8 |
zh_TW-cp937 |
UTF-8 to IBM MBCS CP937 |
zh_TW-euc |
UTF-8 |
Traditional Chinese EUC to UTF-8 |
zh_TW-big5 |
UTF-8 |
Traditional Chinese Big5 to UTF-8 |
zh_TW-iso2022-7 |
UTF-8 |
Traditional Chinese MIME character set (ISO-2022-TW) to UTF-8 |
zh_TW-cp937 |
UTF-8 |
IBM MBCS CP937 to UTF-8 |
For more details on iconv code conversion, see the , iconv(1) and iconv_open(3), iconv(3), and iconv_close(3) man pages. For more information on available code conversions, see iconv_en_US.UTF-8(5).