Go to main content

man pages section 7: Standards, Environments, Macros, Character Sets, and Miscellany

Exit Print View

Updated: Wednesday, August 8, 2018
 
 

iconv_ko (7)

Name

iconv_ko - codeset conversion for Korean encodings

Description

The table below provides basic information on available Korean codeset code conversions.

The listed names in the table are canonical names. There are aliases, e.g., Wansung, eucKR, and such for EUC-KR, that are also supported. For the supported aliases, please refer to "iconv -l" output.

Code Sets
Description
EUC-KR
Korean Extended Unix Code representation using KS X 1003 or US-ASCII as the primary single byte codeset and KS X 1001 as the supplementary two-byte codeset. Also known as Wansung and Completion code.
IBM-933
IBM EBCDIC mixed multibyte code page for Korean.
ISO-2022-KR
Korean 7-bit encoded codeset based on ISO/IEC 2022 extension mechanism and as specified in the RFC 1557. Can represent characters of KS X 1003 or US-ASCII and KS X 1001 character sets.
JOHAP82
Korean multibyte codeset using KS X 1003 or US-ASCII as the single byte sub-codeset and KS C 5601-1987 Annex 3: Supplementary Code System (2 Byte Johap Code System) as the double-byte sub-codeset. Also known as Packed and old Combination code.
JOHAP92
Korean multibyte codeset using KS X 1003 or US-ASCII as the single byte sub-codeset and KS X 1001 Annex 3: Supplementary Code System (2 Byte Johap Code System) as the double-byte sub-codeset. Also known as Combination and Combination code.
CP949
Windows code page 949 for Korean extends EUC-KR by including pre-composed 8,822 modern day Hangul syllables that are not present in the KS X 1001 pre-composed Hangul blocks. Also known as Unified Hangul code.
N-BYTE
Korean 7-bit encoding for Hangul and KS X 1003 or US-ASCII characters as specified in the KS X 1001 Annex 4: 7 Bit Hangul Alphabet codes. Do not include Hanja or special symbols.

For any characters that exist and are valid in fromcode that do not exist in tocode of any of the above codesets, each of such characters will be converted into '?' (0x3f) as a non-identical code conversion. If the tocode is a Unicode encoding, U+FFFD (replacement character) will be used instead.

Files

/usr/lib/iconv/*.so

iconv conversion modules

/usr/lib/iconv/*.bt

cconv code conversion binary tables for iconv(1), cconv(3C), and iconv(3C)

See Also

geniconvtbl(1), iconv(1), cconv_close(3C), cconv_open(3C), cconvctl(3C), iconv(3C), iconv_close(3C), iconv_open(3C), iconvctl(3C), alias(5), geniconvtbl-cconv(5), iconv_extra(7), iconv_unicode(7), iconv_zh(7), iconv_zh_HK(7), iconv_zh_TW(7)

Chon, K., H. Je Park, and U. Choi, Korean Character Encoding for Internet Messages, RFC 1557, Solvit Chosun Media, December 1993.

KS X 1003 Code For Information Interchange, 2008.12.31.

KS X 1001 Code For Information Interchange (Hangeul and hanja), 2009.12.29.

Unicode Standard Annex #38

Unicode Han Database (Unihan)

http://www.unicode.org/reports/tr38/

Notes

Available iconv and cconv code conversions related to Korean codesets and their aliases can be obtained by running "iconv -l" as described in the iconv(1) manual page.

Selected cconv code conversions related to Korean codesets also support the following variations based on the Unihan database:

level 1:

kSimplifiedVariant

level 2:

kTraditionalVariant

level 3:

kSemanticVariant

level 4:

kZVariant

level 5:

kCompatibilityVariant

level 6:

kSpecializedSemanticVariant

The conversions supporting these variant levels are: EUC-KR, IBM-933, ISO-2022-KR, JOHAP92 and CP949.