Jump to content

Latin-1 Supplement

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 2620:101:f000:3400::dad (talk) at 22:46, 14 March 2016 (Character table: Provided proper links for letters.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

C1 Controls and Latin-1 Supplement
RangeU+0080..U+00FF
(128 code points)
PlaneBMP
ScriptsLatin (64 char.)
Common (64 char.)
Major alphabetsFrench
German
Spanish
Icelandic
Vietnamese
Symbol setsPunctuation
Mathematics
Currency
Assigned128 code points
33 Control or Format
Unused0 reserved code points
Source standardsISO/IEC 8859-1
Unicode version history
1.0.0 (1991)128 (+128)
Unicode documentation
Code chart ∣ Web page
Note: [1][2]

The Latin-1 Supplement (also called C1 Controls and Latin-1 Supplement) is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). Controls C1 (0080–009F) are not graphic.

The C1 Controls and Latin-1 Supplement block has been included in its present form, with the same character repertoire since version 1.0 of the Unicode Standard, where it was known as Latin 1.[2]

Character table

Code Result Description
C1 Controls
U+0080 PAD – Padding Character
U+0081 HOP – High Octet Preset
U+0082 BPH – Break Permitted Here
U+0083 NBH – No Break Here
U+0084 IND – Index
U+0085 NEL – Next Line
U+0086 SSA – Start of Selected Area
U+0087 ESA – End of Selected Area
U+0088 HTS – Character (Horizontal) Tabulation Set
U+0089 HTJ – Character (Horizontal) Tabulation with Justification
U+008A LTS – Line (Vertical) Tabulation Set
U+008B PLD – Partial Line Forward (Down)
U+008C PLU – Partial Line Backward (Up)
U+008D RI – Reverse Line Feed (Index)
U+008E SS2 – Single-Shift Two
U+008F SS3 – Single-Shift Three
U+0090 DCS – Device Control String
U+0091 PU1 – Private Use 1
U+0092 PU2 – Private Use 2
U+0093 STS – Set Transmit State
U+0094 CCH – Cancel character
U+0095 MW – Message Waiting
U+0096 SPA – Start of Protected Area
U+0097 EPA – End of Protected Area
U+0098 SOS – Start of String
U+0099 SGCI – Single Graphic Character Introducer
U+009A SCI – Single Character Introducer
U+009B CSI – Control Sequence Introducer
U+009C ST – String Terminator
U+009D OSC – Operating System Command
U+009E PM – Private Message
U+009F APC – Application Program Command
Latin-1 Punctuation and Symbols
U+00A0   Non-breaking space
U+00A1 ¡ Inverted Exclamation Mark
U+00A2 ¢ Cent sign
U+00A3 £ Pound sign
U+00A4 ¤ Currency sign
U+00A5 ¥ Yen sign
U+00A6 ¦ Broken bar
U+00A7 § Section sign
U+00A8 ¨ Diaeresis
U+00A9 © Copyright sign
U+00AA ª Feminine Ordinal Indicator
U+00AB « Left-pointing double angle quotation mark
U+00AC ¬ Not sign
U+00AD Soft hyphen
U+00AE ® Registered sign
U+00AF ¯ Macron
U+00B0 ° Degree symbol
U+00B1 ± Plus-minus sign
U+00B2 ² Superscript two
U+00B3 ³ Superscript three
U+00B4 ´ Acute accent
U+00B5 µ Micro sign
U+00B6 Pilcrow sign
U+00B7 · Middle dot
U+00B8 ¸ Cedilla
U+00B9 ¹ Superscript one
U+00BA º Masculine ordinal indicator
U+00BB » Right-pointing double-angle quotation mark
U+00BC ¼ Vulgar fraction one quarter
U+00BD ½ Vulgar fraction one half
U+00BE ¾ Vulgar fraction three quarters
U+00BF ¿ Inverted Question Mark
Letters
U+00C0 À Latin Capital Letter A with grave
U+00C1 Á Latin Capital letter A with acute
U+00C2 Â Latin Capital letter A with circumflex
U+00C3 Ã Latin Capital letter A with tilde
U+00C4 Ä Latin Capital letter A with diaeresis
U+00C5 Å Latin Capital letter A with ring above
U+00C6 Æ Latin Capital letter AE
U+00C7 Ç Latin Capital letter C with cedilla
U+00C8 È Latin Capital letter E with grave
U+00C9 É Latin Capital letter E with acute
U+00CA Ê Latin Capital letter E with circumflex
U+00CB Ë Latin Capital letter E with diaeresis
U+00CC Ì Latin Capital letter I with grave
U+00CD Í Latin Capital letter I with acute
U+00CE Î Latin Capital letter I with circumflex
U+00CF Ï Latin Capital letter I with diaeresis
U+00D0 Ð Latin Capital letter Eth
U+00D1 Ñ Latin Capital letter N with tilde
U+00D2 Ò Latin Capital letter O with grave
U+00D3 Ó Latin Capital letter O with acute
U+00D4 Ô Latin Capital letter O with circumflex
U+00D5 Õ Latin Capital letter O with tilde
U+00D6 Ö Latin Capital letter O with diaeresis
Mathematical Operator
U+00D7 × Multiplication sign
Letters
U+00D8 Ø Latin Capital letter O with stroke
U+00D9 Ù Latin Capital letter U with grave
U+00DA Ú Latin Capital letter U with acute
U+00DB Û Latin Capital Letter U with circumflex
U+00DC Ü Latin Capital Letter U with diaeresis
U+00DD Ý Latin Capital Letter Y with acute
U+00DE Þ Latin Capital Letter Thorn
U+00DF ß Latin Small Letter sharp S
U+00E0 à Latin Small Letter A with grave
U+00E1 á Latin Small Letter A with acute
U+00E2 â Latin Small Letter A with circumflex
U+00E3 ã Latin Small Letter A with tilde
U+00E4 ä Latin Small Letter A with diaeresis
U+00E5 å Latin Small Letter A with ring above
U+00E6 æ Latin Small Letter AE
U+00E7 ç Latin Small Letter C with cedilla
U+00E8 è Latin Small Letter E with grave
U+00E9 é Latin Small Letter E with acute
U+00EA ê Latin Small Letter E with circumflex
U+00EB ë Latin Small Letter E with diaeresis
U+00EC ì Latin Small Letter I with grave
U+00ED í Latin Small Letter I with acute
U+00EE î Latin Small Letter I with circumflex
U+00EF ï Latin Small Letter I with diaeresis
U+00F0 ð Latin Small Letter Eth
U+00F1 ñ Latin Small Letter N with tilde
U+00F2 ò Latin Small Letter O with grave
U+00F3 ó Latin Small Letter O with acute
U+00F4 ô Latin Small Letter O with circumflex
U+00F5 õ Latin Small Letter O with tilde
U+00F6 ö Latin Small Letter O with diaeresis
Mathematical Operator
U+00F7 ÷ Division sign
Letters
U+00F8 ø Latin Small Letter O with stroke
U+00F9 ù Latin Small Letter U with grave
U+00FA ú Latin Small Letter U with acute
U+00FB û Latin Small Letter U with circumflex
U+00FC ü Latin Small Letter U with diaeresis
U+00FD ý Latin Small Letter Y with acute
U+00FE þ Latin Small Letter Thorn
U+00FF ÿ Latin Small Letter Y with diaeresis

Subheadings

The C1 Controls and Latin-1 Supplement block has four subheadings within its character collection: C1 Controls, Latin-1 Punctuation and Symbols, Letters, and Mathematical Operator(s).[3]

C1 controls

The C1 Controls subheading contains 32 supplementary control codes inherited from ISO/IEC 8859-1 and many other 8-bit character standards. The alias names for the C0 and C1 control codes are taken from ISO/IEC 6429:1992.[3]

Latin-1 punctuation and symbols

The Latin-1 Punctuation and Symbols subheading contains 32 characters of common international punctuation characters, such as inverted exclamation and question marks, and a middle dot; and symbols like foreign currency signs, spacing diacritic marks, vulgar fraction, and superscript numbers.[3]

Letters

The Letters subheading contains 30 pairs of majuscule and minuscule accented or novel Latin characters for western European languages, and two extra minuscule characters not commonly used word-initially.[3]

Mathematical operator

The Mathematical Operator subheading is used for the multiplication and division signs.[3]

Compact table

C1 Controls and Latin-1 Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+008x XXX XXX BPH NBH  IND NEL SSA ESA HTS HTJ VTS PLD PLU  RI   SS2 SS3
U+009x DCS PU1 PU2 STS CCH  MW  SPA EPA SOS XXX SCI  CSI   ST  OSC  PM  APC
U+00Ax NBSP ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ SHY ® ¯
U+00Bx ° ± ² ³ ´ µ · ¸ ¹ º » ¼ ½ ¾ ¿
U+00Cx À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
U+00Dx Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
U+00Ex à á â ã ä å æ ç è é ê ë ì í î ï
U+00Fx ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
1.^ As of Unicode version 16.0

Emoji

The Latin-1 Supplement block contains two emoji: U+00A9 and U+00AE.[4][5]

The block has four standardized variants defined to specify emoji-style (U+FE0F VS16) or text presentation (U+FE0E VS15) for the two emoji, both of which default to a text presentation.[6][7]

Emoji variation sequences
U+ 00A9 00AE
base codepoint © ®
base+VS15 (text) ©︎ ®︎
base+VS16 (emoji) ©️ ®️

See also

  1. ^ "Unicode character database". The Unicode Standard. Retrieved 22 March 2013.
  2. ^ a b The Unicode Standard Version 1.0, Volume 1. Addison-Wesley Publishing Company, Inc. 1991 [1990]. ISBN 0-201-56788-1.
  3. ^ a b c d e "Unicode 6.2 code charts" (PDF). The Unicode Standard. Retrieved 1 April 2013.
  4. ^ "UTR #51: Unicode Emoji". Unicode Consortium. 2015-11-12.
  5. ^ "UCD: Emoji Data for UTR #51". Unicode Consortium. 2015-11-11.
  6. ^ "Unicode Character Database: Standardized Variants". The Unicode Consortium.
  7. ^ "Unicode Character Database: Standardized Variation Sequences". The Unicode Consortium.