Template talk:Character encoding
|WikiProject Computing||(Rated Template-class)|
Encoding vs. TES
HZ is a TES, Transfer Encoding Syntax, see UTR17, of GB2312, not a character encoding proper. Nor is it a national standard. If at all kept in this template it should be in the misc section.
Similarly, UTF-7 is also a TES, not a UTF (despite the name). So I was thinking of removing UTF-7 from this template. It's included in the "Table Unicode" template, and I think that is enough.
I've tried to group certain encodings in a "logical" way. For instance, even if the GOST standard is/was a national standard, it's for 4, 5, and 6-bit character encodings. Not something used in modern computers. So it's amongst "misc" items. Likewise, HKSCS is near Big5 and CP950 since they are so closely related. Etc.
The Big5-HKSCS encoding is not really supported by Windows. Windows 950 should not be considered HKSCS compatible by default. Windows Vista only supports the Unicode characters of Big5-HKSCS. Microsoft HKSCS —Preceding unsigned comment added by 18.104.22.168 (talk) 04:57, 26 July 2009 (UTC)
UTF-8, read that article please. It is not a "single character" (like horizontal tabulation, backspace etc.), it is a piece of encoding troubles related to line separation. Incnis Mrsi (talk) 09:06, 15 March 2010 (UTC)
I notice, that there are a few code pages messing, namely the following
- Code page 708 (Arabic ASMO);
- Code page 851 (Greek III);
- Code page 853 (Latin III);
- Code page 868 (IBM Persian);
- Code page 934 (MS-DOS Korean);
- Code page 938 (MS-DOS Taiwanese);
- Code page 999 (Yugoslavian ASCII-7).
I have the Korean edition of MS-DOS 6.2, which uses code page 934. It, and code page 938, are also referenced in MS-DOS 6.22 COUNTRY.TXT file.
MS-DOS code page 999 seems to be the code page version of the Yugoslavian ASCII-7 codepage, commonly used especially in Croatia and Slovenia before the advent of code page 852. One notable user of it is the Slovenia SAOP programming corporation's software.