iconv_ko - man pages section 7: Standards, Environments, Macros, Character Sets, and Miscellany

Language:

iconv_ko (7)

Name

iconv_ko - codeset conversion for Korean encodings

Description

The table below provides basic information on available Korean codeset code conversions.

The listed names in the table are canonical names. There are aliases, e.g., Wansung, eucKR, and such for EUC-KR, that are also supported. For the supported aliases, please refer to "iconv -l" output.

Code Sets	Description
`EUC-KR`	Korean Extended UNIX Code representation using `KS X 1003` or US-ASCII as the primary single byte codeset and `KS X 1001` as the supplementary two-byte codeset. Also known as Wansung and Completion code.
`IBM-933`	IBM EBCDIC mixed multibyte code page for Korean.
`ISO-2022-KR`	Korean 7-bit encoded codeset based on `ISO/IEC 2022` extension mechanism and as specified in the `RFC 1557`. Can represent characters of `KS X 1003` or US-ASCII and `KS X 1001` character sets.
`JOHAP82`	Korean multibyte codeset using `KS X 1003` or US-ASCII as the single byte sub-codeset and `KS C 5601-1987` Annex 3: Supplementary Code System (2 Byte Johap Code System) as the double-byte sub-codeset. Also known as Packed and old Combination code.
`JOHAP92`	Korean multibyte codeset using `KS X 1003` or US-ASCII as the single byte sub-codeset and `KS X 1001` Annex 3: Supplementary Code System (2 Byte Johap Code System) as the double-byte sub-codeset. Also known as Combination and Combination code.
`CP949`	Windows code page 949 for Korean extends `EUC-KR` by including pre-composed 8,822 modern day Hangul syllables that are not present in the `KS X 1001` pre-composed Hangul blocks. Also known as Unified Hangul code.
`N-BYTE`	Korean 7-bit encoding for Hangul and `KS X 1003` or US-ASCII characters as specified in the `KS X 1001` Annex 4: 7 Bit Hangul Alphabet codes. Do not include Hanja or special symbols.

For any characters that exist and are valid in fromcode that do not exist in tocode of any of the above codesets, each of such characters will be converted into '?' (0x3f) as a non-identical code conversion. If the tocode is a Unicode encoding, U+FFFD (replacement character) will be used instead.

Files

/usr/lib/iconv/*.so: iconv conversion modules
/usr/lib/iconv/*.bt: cconv code conversion binary tables for iconv(1), cconv(3C), and iconv(3C)

Notes

Available iconv and cconv code conversions related to Korean codesets and their aliases can be obtained by running "iconv -l" as described in the iconv(1) manual page.

Selected cconv code conversions related to Korean codesets also support the following variations based on the Unihan database:

level 1:: kSimplifiedVariant
level 2:: kTraditionalVariant
level 3:: kSemanticVariant
level 4:: kZVariant
level 5:: kCompatibilityVariant
level 6:: kSpecializedSemanticVariant

The conversions supporting these variant levels are: EUC-KR, IBM-933, ISO-2022-KR, JOHAP92 and CP949.

man pages section 7: Standards, Environments, Macros, Character Sets, and Miscellany

iconv_ko (7)

Name

Description

Files

See Also

Notes