Unified Hangul Code

From Wikipedia, the free encyclopedia
  (Redirected from Code page 949)
Jump to navigation Jump to search
Unified Hangul Code
Unified Hangul Code.svg
Layout of the Unified Hangul Code
Alias(es)Windows Code Page 949
StandardWHATWG Encoding Standard (as "EUC-KR")[1]
ClassificationExtended ISO 646,[a] Variable-width encoding, CJK encoding
  1. ^ Not in the strictest sense of the term, as ASCII bytes can appear as trail bytes, although this is limited to letter bytes.

Unified Hangul Code (UHC,[2] Korean: 통합형 한글 코드[3], translit. Tonghabhyeong Hangeul Kodeu), also known under Microsoft Windows as Code Page 949 (Windows-949), is the Microsoft Windows code page for the Korean language. It is an extension of Wansung Code (KS C 5601:1987, encoded as EUC-KR) to include all 11172 Hangul syllables present in Johab (KS C 5601:1992 annex 3).[2] This corresponds to the pre-composed syllables available in Unicode 2.0 and later.

IBM's code page for Unified Hangul Code is called Code page 1363 (IBM-1363), or "Korean MS-Win". It is a combination of Code page 1126 and Code page 1362.[4] It differs in having a single byte mapping of 0x5C to the Won sign (U+20A9);[5] Windows maps 0x5C to U+005C (the Unicode code point for the backslash) as in ASCII,[6] although fonts often still render it as a Won sign.[7] IBM's code page 949 is a different extension of EUC-KR.

The code page is not registered with IANA as a standard to communicate information over the Internet.[8] Alternatives include UTF-8. Microsoft assigns it the label ks_c_5601-1987,[9][10] which properly applies to KS X 1001 itself. However, the W3C/WHATWG Encoding Standard used by HTML5 incorporates the Unified Hangul Code extensions into its definition of "EUC-KR",[1] which it treats interchangeably with "ks_c_5601-1987" with the intent of being "compatible with deployed content".[11]


  1. ^ a b "5. Indexes (§ index EUC-KR)", Encoding Standard, WHATWG
  2. ^ a b "INFO: Hangul (Korean) Character Sets", Microsoft Support, Microsoft
  3. ^ "한글 코드에 대하여" (in Korean). W3C.
  4. ^ "Coded character set identifiers - CCSID 1363", IBM Globalization, IBM, archived from the original on 2014-11-29
  5. ^ "IBM-1363", Converter Explorer, International Components for Unicode
  6. ^ "Windows-949", Converter Explorer, International Components for Unicode
  7. ^ Kaplan, Michael S. (2005-09-17), "When is a backslash not a backslash?", Sorting it all out
  8. ^ "Character Sets". Iana.org. Retrieved 2017-01-11.
  9. ^ "Encoding.WindowsCodePage Property - .NET Framework (current version)". MSDN. Microsoft.
  10. ^ "Code Page Identifiers", Windows Dev Center, Microsoft
  11. ^ "4.2. Names and labels". Encoding Standard. WHATWG.

External links[edit]