GEM character set
The GEM character set is the character set of Digital Research's graphical user interface GEM on Intel platforms. It is based on code page 437, the original character set of the IBM PC, and like that set includes ASCII codes 32–126, extended codes for accented letters (diacritics), and other symbols. It differs from code page 437 in using other dingbats at code points 0–31, in exchanging the box-drawing characters 176–223 for international characters and other symbols, and exchanging code point 236 with the symbol for line integral. However, GEM is more similar to code page 865 because the codepoints of Ø and ø match the codepoints in that codepage.
The Motorola-based GEM adaptation for the Atari ST family of computers utilized the similar Atari ST character set. It has swapped ¢ and ø and has also swapped ¥ and Ø (to match code page 437 more). It also has the ß (sharp s) at code point 158, reversed not sign (⌐) at code point 169 (as in code page 437), not sign (¬) at code point 170 (as in code page 437), ½ at code point 171 (as in code page 437), ¼ at code point 172 (as in code page 437), ¨ (diaeresis) at code point 184, ´ (acute) at code point 185, ij at code point 192, IJ at code point 193, Hebrew characters at code points 194-220, section sign (§) at code point 221, logical and at code point 222, infinity sign at code point 223, bullet (•) at codepoint 249, cubed sign (superscript three) at code point 254, the macron at code point 255, ATARI-specific characters at codepoints 5, 6, 7, 14, 15, 28, 29, 30, and 31, LED 0-9 at codepoints 16-25, and ə (Schwa) at codepoint 26. Codepoints 12, 13, and 27 are mapped to the C0 controls.
A slight adaptation for Ventura Publisher is the similar Ventura International character set, it has code points 0-31, 127, and 218-255 empty, and has swapped ¢ and ø and has also swapped ¥ and Ø (to match code page 437 more).
In contrast to this, the GEM-derived file manager ViewMAX, which shipped with some versions of DR DOS as a DOSSHELL replacement, does not use the GEM character set, but loads its display fonts from DOS .CPI files[1][2][3][4][5][6] depending on the system's current code page.
Character set
The following table shows the GEM character set. Each character is shown with a potential Unicode equivalent, although some codes do not have a unique, single Unicode equivalent; the correct choice may depend upon context. Note that code point 20 (1416) has an unfilled paragraph sign, and code point 188 (BC16) has a filled paragraph sign.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NUL | ⇧[a] | ⇩[a] | ⇨[a] | ⇦[a] | ◼[a] | 🗗[a] | ◆[a] | ✓[a] | 🕒︎ | 🔔︎ | ♪ | ▴ | ▾ | ▸ | ◂ |
1x | ► | ◄ | ⧓[a] | ▂[a] | ¶ | § | ↕ | ↨ | ↑ | ↓ | → | ← | ∟ | ↔ | ▲ | ▼ |
2x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | ⌂ |
8x | Ç | ü | é | â | ä | à | å | ç | ê | ë | è | ï | î | ì | Ä | Å |
9x | É | æ | Æ | ô | ö | ò | û | ù | ÿ | Ö | Ü | ø | £ | Ø | ¤ | ƒ |
Ax | á | í | ó | ú | ñ | Ñ | ª | º | ¿ | “ | ” | ‹ | › | ¡ | « | » |
Bx | ã | õ | ¥ | ¢ | œ | Œ | À | à | Õ | § | ‡ | † | ¶ | © | ® | ™ |
Cx | „ | … | ‰ | • | – | — | ° | Á | Â | È | Ê | Ë | Ì | Í | Î | Ï |
Dx | Ò | Ó | Ô | Š | š | Ù | Ú | Û | Ÿ | ß | ||||||
Ex | α | β | Γ | π | Σ | σ | µ[b] | τ | Φ | Θ | Ω[c] | δ[d] | ∮ | ɸ | ∈ | ∩ |
Fx | ≡ | ± | ≥ | ≤ | ⌠ | ⌡ | ÷ | ≈ | ° | ∙ | · | √ | ⁿ | ² | ■ | ∅ |
- ^ a b c d e f g h i j 1–8 and 18-19 are used in GEM AES user interface elements such as icons for closing, resizing and maximizing windows.
- ^ 230 (E6hex) is both the micro sign (U+00B5, µ) and the Greek lowercase mu (U+03BC, μ).
- ^ 234 (EAhex) is both the ohm sign (U+2126, Ω) and the Greek uppercase omega (U+03A9, Ω). (Unicode considers the ohm sign to be equivalent to uppercase omega, and suggests that the latter be used in both contexts.[10])
- ^ 235 (EBhex) is the Greek lowercase delta (U+03B4, δ), but it has also been used as a surrogate for the Icelandic lowercase eth (U+00F0, ð) and the partial derivative sign (U+2202, ∂).
See also
- Atari ST character set
- Western Latin character sets (computing)
- Alt codes
- Bitstream International Character Set
- Ventura International Character Set
References
- ^ Paul, Matthias R. (2001-06-10) [1995]. "Format description of DOS, OS/2, and Windows NT .CPI, and Linux .CP files" (CPI.LST file) (1.30 ed.). Archived from the original on 2016-04-20. Retrieved 2016-08-20.
- ^ Elliott, John C. (2006-10-14). "CPI file format". Seasip.info. Archived from the original on 2016-09-22. Retrieved 2016-09-22.
- ^ a b Elliott, John C. (2006-09-03). "Codepage-related software". Seasip.info. Archived from the original on 2016-11-08. Retrieved 2016-11-09.
- ^ Brouwer, Andries Evert (2001-02-10). "CPI fonts". 0.2. Archived from the original on 2016-09-22. Retrieved 2016-09-22.
- ^ Haralambous, Yannis (September 2007). Fonts & Encodings. Translated by Horne, P. Scott (1 ed.). Sebastopol, California, USA: O'Reilly Media, Inc. pp. 601–602, 611. ISBN 978-0-596-10242-5.
- ^ MS-DOS Programmer's Reference. Microsoft Press. 1991. ISBN 1-55615-329-5.
- ^ "WordPlus GEM character set". Archived from the original on 2017-01-23. Retrieved 2017-01-23.
- ^ Lineback, Nathan. "GEM 3.11 Screen Shots". Toastytech.com. Archived from the original on 2019-11-18. Retrieved 2016-08-01.
- ^ Elliott, John C. (2006-09-04). "Fonts for Intel GEM - System Fonts". Seasip.info. Archived from the original on 2017-02-06. Retrieved 2016-09-22.
- ^ The Unicode Consortium (2003-05-21). "Chapter 7: European Alphabetic Scripts" (PDF). The Unicode Standard 4.0. Addison-Wesley (published August 2003). p. 176. ISBN 0-321-18578-1. Retrieved 2016-06-09.