Code page 950

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Code page 950 is Microsoft's implementation of the de facto standard Big5. The code page is not registered with IANA,[1] and hence, is not a standard to communicate information over the internet. The major difference between code page 950 and Big5 is the incorporation of some ETEN characters at F9D6-F9FE (碁, 銹, 裏, 墻, 恒, 粧, and 嫺) and 34 box drawing characters and block elements. It was updated in 2000 with the addition of the euro sign (€), at 0xA3E1. This update is called Code page 1370 by IBM (as IBM did not change Code page 950), but Microsoft Windows refers to it as a Code page 950 update.

Code page 951[edit]

Code page 951 is a misnomer for a hack used in HKSCS-2001 support in Windows XP, derived from the file name.[2] The "new" code page is a replacement for CP950 with Unicode mappings for some Extended User-defined Characters (EUDC) found in HKSCS. HKSCS characters without a Unicode mapping are assigned a Unicode Private Use Area (PUA) code point following previous practises.

Private Use Area usage[edit]

Mapping from Big5 EUDC to PUA code points[3]
Big5 range Unicode range Formula[4]
81 40–8D FE U+EEB8–U+F6B0 0xeeb8 + (157 * (H-0x81)) + (L<0x80)?(L-0x40):(L-0x62)
8E 40–A0 FE U+E311–U+EEB7 0xe311 + (157 * (H-0x8e)) + (L<0x80)?(L-0x40):(L-0x62)
C6 A1–C8 FE U+F6B1–U+F848 0xf672 + (157 * (H-0xc6)) + (L<0x80)?(L-0x40):(L-0x62)
FA 40–FE FE U+E000–U+E310 0xe000 + (157 * (H-0xfa)) + (L<0x80)?(L-0x40):(L-0x62)

This mapping is also used in HKSCS where a given glyph is not yet found in the Unicode revision specified.[5]

See also[edit]

References[edit]

  1. ^ [1]
  2. ^ Steele, Shawn. "CP 951 & HKSCS". I'm not a Klingon. MS Dev Blog. Retrieved 13 September 2016. 
  3. ^ "Windows Best Fit Chart: CP950". unicode.org. Retrieved 13 September 2016. 
  4. ^ "Big5". Kanji Database. Retrieved 13 September 2016. 
  5. ^ "Big5-HKSCS:2008". Archived from the original on 2016-09-13. 

External links[edit]