Unified Hangul Code
Layout of the Unified Hangul Code
|Alias(es)||Windows Code Page 949|
|Standard||WHATWG Encoding Standard (as "EUC-KR")|
|Classification||Extended ISO 646,[a] Variable-width encoding, CJK encoding|
Unified Hangul Code (UHC, Korean: 통합형 한글 코드, translit. Tonghabhyeong Hangeul Kodeu), also known under Microsoft Windows as Code Page 949 (Windows-949), is the Microsoft Windows code page for the Korean language. It is an extension of Wansung Code (KS C 5601:1987, encoded as EUC-KR) to include all 11172 Hangul syllables present in Johab (KS C 5601:1992 annex 3). This corresponds to the pre-composed syllables available in Unicode 2.0 and later.
IBM's code page for Unified Hangul Code is called Code page 1363 (IBM-1363), or "Korean MS-Win". It is a combination of Code page 1126 and Code page 1362. It differs in having a single byte mapping of 0x5C to the Won sign (U+20A9); Windows maps 0x5C to U+005C (the Unicode code point for the backslash) as in ASCII, although fonts often still render it as a Won sign. IBM's code page 949 is a different extension of EUC-KR.
The code page is not registered with IANA as a standard to communicate information over the Internet. Alternatives include UTF-8. Microsoft assigns it the label ks_c_5601-1987, which properly applies to KS X 1001 itself. However, the W3C/WHATWG Encoding Standard used by HTML5 incorporates the Unified Hangul Code extensions into its definition of "EUC-KR", which it treats interchangeably with "ks_c_5601-1987" with the intent of being "compatible with deployed content".
- "5. Indexes (§ index EUC-KR)", Encoding Standard, WHATWG
- "INFO: Hangul (Korean) Character Sets", Microsoft Support, Microsoft
- "한글 코드에 대하여" (in Korean). W3C.
- "Coded character set identifiers - CCSID 1363", IBM Globalization, IBM, archived from the original on 2014-11-29
- "IBM-1363", Converter Explorer, International Components for Unicode
- "Windows-949", Converter Explorer, International Components for Unicode
- Kaplan, Michael S. (2005-09-17), "When is a backslash not a backslash?", Sorting it all out
- "Character Sets". Iana.org. Retrieved 2017-01-11.
- "Encoding.WindowsCodePage Property - .NET Framework (current version)". MSDN. Microsoft.
- "Code Page Identifiers", Windows Dev Center, Microsoft
- "4.2. Names and labels". Encoding Standard. WHATWG.
- Microsoft's Reference for Windows-949
- IBM's documentation for IBM-1363
- Mapping of Windows-949 to Unicode
- ICU demonstration for Windows-949 (with ASCII mappings)
- ICU demonstration for IBM-1363 (with 0x5C as Won sign)
|This Microsoft Windows article is a stub. You can help Wikipedia by expanding it.|
|This Korea-related article is a stub. You can help Wikipedia by expanding it.|