|This article is of interest to the following WikiProjects:|
Shouldn't the Unicode input methods require a page of its own? Or should it be made an item in Unicode? — Hhielscher
moved it to Unicode, as (Unicode in use).(Input methods). --Mac-arena the Bored Zo 15:21, 2004 Dec 28 (UTC)
Charset table as ACII
I removed the following section. See below for rationale. --Pjacobi 10:51, September 5, 2005 (UTC)
- Extended ASCII Table
- Extended ASCII uses 8 digits of 1's and 0's for a total of 256 characters. The first 32 however cannot be shown as they are special control sequences and thus they cannot be printed. Also the two blanks are space characters.
IMHO this section is a bad idea for two reasons:
- Giving some specific charset, whereas the article correctly states, that there are plenty.
- Using graphics for a text table
10:51, September 5, 2005 (UTC)
- I agree on the second reson just some of the charecters cant be displayed on the web as far as I know If anyone knows a way a regular table would be nicer. However on the first part ASCII has a table plus I believe a table could be beneficial and a major point of interest it can teach you a fair amount about computers. Or maybe a text table of 00100001 through 01111110 (! through ~ on the chart) as they are the most common used? or just 0-9 and A-Z? --Shimonnyman 11:28, 5 September 2005 (UTC)
- There is an ASCII table in ASCII.
- Extended ASCII tables are in ISO 8859-1, ISO 8859-2, ISO 8859-3, ISO 8859-4, ISO 8859-5, ISO 8859-6, ISO 8859-7, ISO 8859-8, ISO 8859-9, ISO 8859-10, ISO 8859-11, ISO 8859-13, ISO 8859-14, ISO 8859-15, ISO 8859-16, Code page 437, Code page 850, Code page 858, KOI8-R, KOI8-U, TSCII, Mac-Roman encoding, Kamenicky encoding (list is incomplete).
- Pjacobi 13:19, September 5, 2005 (UTC)
Those arent Extended ASCII binary tables, thats what i was refering to thinking could help but the ASCII table on the ASCII page appears to be extended ASCII (a bit incomplete) I didnt check every code but every one i checked was extended ASCII and they arent the same in both because well obviously 7-bit and 8-bit arent going to look identicle. So maybe it belongs here, I dont know anyways just observing.
How are They Stored?
If you have a string of characters, how does one differentiate between two different code pages? I'd imagine that some form of escape character would be used. For example, if I'm writing using ISO-88591 and want to switch over to greek, what is the internal change, on non-unicode terminals? 188.8.131.52 07:23, 26 November 2005 (UTC)
- In general, the only way to conclusively know the encoding of a sequence of bytes is with some out of band meta-information. For example, MIME and HTTP use the Content-Type header to specify the character set and encoding.
- For your second question, it sounds like you want to mix characters from different character sets. That is difficult or impossible to do without Unicode. There are some encoding formats, such as ISO-2022 that permit mixing encodings, but Unicode is generally the best choice. 184.108.40.206 20:04, 21 December 2005 (UTC)
- Some programs like Microsoft Internet Explorer try to "guess" the code page based on how the text looks like in each code page 220.127.116.11 21:27, 25 April 2007 (UTC)
ANSI or ASCII?
What is the difference between ANSI and Extended ASCII?
- ANSI's not a character set (it's used to refer to the ANSI escape sequences). Tedickey (talk) 21:44, 30 October 2009 (UTC)
- ‘ANSI’ as used in Microsoft Windows programming means any character encoding other than Unicode (the exact meaning depending on the codepage setting and particular language version of Windows in use). A number of APIs have both A (‘ANSI’) and W (wide, i.e. Unicode) versions of functions.
- I guess this is an example of double metonymy: ‘ANSI’ being used not to refer to just one of the organization’s standards (as with the X3.64 escape sequences mentioned by Mr Dickey) but to the whole kind of character encoding they might standardize (i.e. byte-level).
- (Note that although the original 7-bit ASCII was an ANSI standard (X3.4), it’s unlikely to be referred to as ANSI: the ASCII name is so much better known.)
- So to answer your question, they’re both vague terms that can mean vaguely the same thing. --18.104.22.168 (talk) 18:59, 3 April 2011 (UTC)
- You are right that wikipedia is not a correct reference, but the above page provide this reference http://msdn.microsoft.com/en-us/goglobal/bb964658.aspx#a where MSDN explain why ANSI is a misnommer.
520256644 identified as vandalism
Variations and extensions
As hundreds or thouthands standards do exist with many variations and common part from one to one other it is difficult to have an overview of each relationship. Next table give an illustration of how ASCII and ASCII extended and variants, have central influence in technology evolutions.
|Original Baudot code (International alphabet n°1)|
|International alphabet n°2 (IA n°2)|
|Variants of EBCDIC and other character encodings|
|ISO 646 - IRV (international reference variant)||Arinc|
|ISO 646 - US (United States)|
|ISO 646: Other countries|
|ISO 646: Other countries|
|ASCII||Code page DOS (437, 850, ...)||ISO 8859 series (for example ISO 8859-1, ISO 8859-15)||ISO 2022 (supports more than 256 characters)||↓|
|Windows code page such as Windows-1252 (or Ansinew)||↓|
|ISO 10646 / Unicode|
|IA n°5||GSM 03.38 (SMS)|
|ASCII||ASCII standard or standards very close from ASCII|
|Extended ASCII||Add additional characters to ASCII ones.|
|Extended ASCII||Add additional characters to ASCII ones, but ASCII bytes may represent other characters depenfing on context|
|ASCII variants||Mostly ASCII, but with some code points representing different characters|
|ASCII subset||Mostly ASCII, but with some code points reserved for national variants|
|Unrelated to ASCII||Not related to ASCII|
|⇟||New encoding; no conservation of previous set of characters|
|⇣||New encoding providing the set of characters yet avilable in the previous one|