Latin-1 Supplement
C1 Controls and Latin-1 Supplement | |
---|---|
Range | U+0080..U+00FF (128 code points) |
Plane | BMP |
Scripts | Latin (64 char.) Common (64 char.) |
Major alphabets | French German Spanish Icelandic Vietnamese |
Symbol sets | Punctuation Mathematics Currency |
Assigned | 128 code points 33 Control or Format |
Unused | 0 reserved code points |
Source standards | ISO/IEC 8859-1 |
Unicode version history | |
1.0.0 (1991) | 128 (+128) |
Unicode documentation | |
Code chart ∣ Web page | |
Note: [1][2] |
The Latin-1 Supplement (also called C1 Controls and Latin-1 Supplement) is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). Controls C1 (0080–009F) are not graphic.
The C1 Controls and Latin-1 Supplement block has been included in its present form, with the same character repertoire since version 1.0 of the Unicode Standard, where it was known as Latin 1.[2]
Character table
Subheadings
The C1 Controls and Latin-1 Supplement block has four subheadings within its character collection: C1 Controls, Latin-1 Punctuation and Symbols, Letters, and Mathematical Operator(s).[3]
C1 controls
The C1 Controls subheading contains 32 supplementary control codes inherited from ISO/IEC 8859-1 and many other 8-bit character standards. The alias names for the C0 and C1 control codes are taken from ISO/IEC 6429:1992.[3]
Latin-1 punctuation and symbols
The Latin-1 Punctuation and Symbols subheading contains 32 characters of common international punctuation characters, such as inverted exclamation and question marks, and a middle dot; and symbols like foreign currency signs, spacing diacritic marks, vulgar fraction, and superscript numbers.[3]
Letters
The Letters subheading contains 30 pairs of majuscule and minuscule accented or novel Latin characters for western European languages, and two extra minuscule characters not commonly used word-initially.[3]
Mathematical operator
The Mathematical Operator subheading is used for the multiplication and division signs.[3]
Compact table
C1 Controls and Latin-1 Supplement[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+008x | XXX | XXX | BPH | NBH | IND | NEL | SSA | ESA | HTS | HTJ | VTS | PLD | PLU | RI | SS2 | SS3 |
U+009x | DCS | PU1 | PU2 | STS | CCH | MW | SPA | EPA | SOS | XXX | SCI | CSI | ST | OSC | PM | APC |
U+00Ax | NBSP | ¡ | ¢ | £ | ¤ | ¥ | ¦ | § | ¨ | © | ª | « | ¬ | SHY | ® | ¯ |
U+00Bx | ° | ± | ² | ³ | ´ | µ | ¶ | · | ¸ | ¹ | º | » | ¼ | ½ | ¾ | ¿ |
U+00Cx | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï |
U+00Dx | Ð | Ñ | Ò | Ó | Ô | Õ | Ö | × | Ø | Ù | Ú | Û | Ü | Ý | Þ | ß |
U+00Ex | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï |
U+00Fx | ð | ñ | ò | ó | ô | õ | ö | ÷ | ø | ù | ú | û | ü | ý | þ | ÿ |
1.^ As of Unicode version 16.0 |
Emoji
The Latin-1 Supplement block contains two emoji: U+00A9 and U+00AE.[4][5]
The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation.[6][7]
U+ | 00A9 | 00AE |
base codepoint | © | ® |
base+VS15 (text) | ©︎ | ®︎ |
base+VS16 (emoji) | ©️ | ®️ |
See also
External links
- ^ "Unicode character database". The Unicode Standard. Retrieved 22 March 2013.
- ^ a b The Unicode Standard Version 1.0, Volume 1. Addison-Wesley Publishing Company, Inc. 1991 [1990]. ISBN 0-201-56788-1.
- ^ a b c d e "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013.
- ^ "UTR #51: Unicode Emoji". Unicode Consortium. 2015-11-12.
- ^ "UCD: Emoji Data for UTR #51". Unicode Consortium. 2015-11-11.
- ^ "Unicode Character Database: Standardized Variants". The Unicode Consortium.
- ^ "Unicode Character Database: Standardized Variation Sequences". The Unicode Consortium.