Latin-1 Supplement (Unicode block)

From Wikipedia, the free encyclopedia
Jump to: navigation, search
C1 controls and Latin-1 supplement
Range U+0080..U+00FF
(128 code points)
Plane BMP
Scripts Latin
Common
Symbol sets Punctuation
Mathematics
Currency
Major alphabets French
German
Spanish
Icelandic
Vietnamese
Assigned 128 code points
33 Control or Format
Unused 0 reserved code points
Source standard(s) ISO/IEC 8859-1
Unicode version history
1.0.0 128 (+128)
Note: [1][2]

The Latin-1 Supplement (also called C1 controls and Latin-1 supplement) is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). Controls C1 (0080–009F) are not graphic.

The C1 Controls and Latin-1 Supplement block has been included in its present form, with the same character repertoire since version 1.0 of the Unicode Standard, where it was known as Latin 1.[2]

Character table[edit]

Code Result Description
C1 Controls
U+0080 PAD – Padding Character
U+0081 HOP – High Octet Preset
U+0082 BPH – Break Permitted Here
U+0083 NBH – No Break Here
U+0084 IND – Index
U+0085 NEL – Next Line
U+0086 SSA – Start of Selected Area
U+0087 ESA – End of Selected Area
U+0088 HTS – Character (Horizontal) Tabulation Set
U+0089 HTJ – Character (Horizontal) Tabulation with Justification
U+008A LTS – Line (Vertical) Tabulation Set
U+008B PLD – Partial Line Forward (Down)
U+008C PLU – Partial Line Backward (Up)
U+008D RI – Reverse Line Feed (Index)
U+008E SS2 – Single-Shift Two
U+008F SS3 – Single-Shift Three
U+0090 DCS – Device Control String
U+0091 PU1 – Private Use 1
U+0092 PU2 – Private Use 2
U+0093 STS – Set Transmit State
U+0094 CCH – Cancel character
U+0095 MW – Message Waiting
U+0096 SPA – Start of Protected Area
U+0097 EPA – End of Protected Area
U+0098 SOS – Start of String
U+0099 SGCI – Single Graphic Character Introducer
U+009A SCI – Single Character Introducer
U+009B CSI – Control Sequence Introducer
U+009C ST – String Terminator
U+009D OSC – Operating System Command
U+009E PM – Private Message
U+009F APC – Application Program Command
Latin-1 Punctuation and Symbols
U+00A0   Non-breaking space
U+00A1 ¡ Inverted Exclamation Mark
U+00A2 ¢ Cent sign
U+00A3 £ Pound sign
U+00A4 ¤ Currency sign
U+00A5 ¥ Yen sign
U+00A6 ¦ Broken bar
U+00A7 § Section sign
U+00A8 ¨ Diaeresis
U+00A9 © Copyright sign
U+00AA ª Feminine Ordinal Indicator
U+00AB « Left-pointing double angle quotation mark
U+00AC ¬ Not sign
U+00AD ­ Soft hyphen
U+00AE ­® Registered sign
U+00AF ¯ Macron
U+00B0 ° Degree symbol
U+00B1 ± Plus-minus sign
U+00B2 ² Superscript two
U+00B3 ³ Superscript three
U+00B4 ´ Acute accent
U+00B5 µ Micro sign
U+00B6 Pilcrow sign
U+00B7 · Middle dot
U+00B8 ¸ Cedilla
U+00B9 ¹ Superscript one
U+00BA º Masculine ordinal indicator
U+00BB » Right-pointing double-angle quotation mark
U+00BC ¼ Vulgar fraction one quarter
U+00BD ½ Vulgar fraction one half
U+00BE ¾ Vulgar fraction three quarters
U+00BF ¿ Inverted Question Mark
Letters
U+00C0 À Latin Capital Letter A with grave
U+00C1 Á Latin Capital letter A with acute
U+00C2 Â Latin Capital letter A with circumflex
U+00C3 Ã Latin Capital letter A with tilde
U+00C4 Ä Latin Capital letter A with diaeresis
U+00C5 Å Latin Capital letter A with ring above
U+00C6 Æ Latin Capital letter AE
U+00C7 Ç Latin Capital letter C with cedilla
U+00C8 È Latin Capital letter E with grave
U+00C9 É Latin Capital letter E with acute
U+00CA Ê Latin Capital letter E with circumflex
U+00CB Ë Latin Capital letter E with diaeresis
U+00CC Ì Latin Capital letter I with grave
U+00CD Í Latin Capital letter I with acute
U+00CE Î Latin Capital letter I with circumflex
U+00CF Ï Latin Capital letter I with diaeresis
U+00D0 Ð Latin Capital letter Eth
U+00D1 Ñ Latin Capital letter N with tilde
U+00D2 Ò Latin Capital letter O with grave
U+00D3 Ó Latin Capital letter O with acute
U+00D4 Ô Latin Capital letter O with circumflex
U+00D5 Õ Latin Capital letter O with tilde
U+00D6 Ö Latin Capital letter O with diaeresis
Mathematical Operator
U+00D7 × Multiplication sign
Letters
U+00D8 Ø Latin Capital letter O with stroke
U+00D9 Ù Latin Capital letter U with grave
U+00DA Ú Latin Capital letter U with acute
U+00DB Û Latin Capital Letter U with circumflex
U+00DC Ü Latin Capital Letter U with diaeresis
U+00DD Ý Latin Capital Letter Y with acute
U+00DE Þ Latin Capital Letter Thorn
U+00DF ß Latin Small Letter sharp S
U+00E0 à Latin Small Letter A with grave
U+00E1 á Latin Small Letter A with acute
U+00E2 â Latin Small Letter A with circumflex
U+00E3 ã Latin Small Letter A with tilde
U+00E4 ä Latin Small Letter A with diaeresis
U+00E5 å Latin Small Letter A with ring above
U+00E6 æ Latin Small Letter AE
U+00E7 ç Latin Small Letter C with cedilla
U+00E8 è Latin Small Letter E with grave
U+00E9 é Latin Small Letter E with acute
U+00EA ê Latin Small Letter E with circumflex
U+00EB ë Latin Small Letter E with diaeresis
U+00EC ì Latin Small Letter I with grave
U+00ED í Latin Small Letter I with acute
U+00EE î Latin Small Letter I with circumflex
U+00EF ï Latin Small Letter I with diaeresis
U+00F0 ð Latin Small Letter Eth
U+00F1 ñ Latin Small Letter N with tilde
U+00F2 ò Latin Small Letter O with grave
U+00F3 ó Latin Small Letter O with acute
U+00F4 ô Latin Small Letter O with circumflex
U+00F5 õ Latin Small Letter O with tilde
U+00F6 ö Latin Small Letter O with diaeresis
Mathematical Operator
U+00F7 ÷ Division sign
Letters
U+00F8 ø Latin Small Letter O with stroke
U+00F9 ù Latin Small Letter U with grave
U+00FA ú Latin Small Letter U with acute
U+00FB û Latin Small Letter U with circumflex
U+00FC ü Latin Small Letter U with diaeresis
U+00FD ý Latin Small Letter Y with acute
U+00FE þ Latin Small Letter Thorn
U+00FF ÿ Latin Small Letter Y with diaeresis

Subheadings[edit]

The C1 Controls and Latin-1 Supplement block has four subheadings within its character collection: C1 Controls, Latin-1 Punctuation and Symbols, Letters, and Mathematical Operator(s).[3]

C1 Controls[edit]

The C1 Controls subheading contains 32 supplementary control codes inherited from ISO/IEC 8859-1 and many other 8-bit character standards. The alias names for the C0 and C1 control codes are taken from ISO/IEC 6429:1992.[3]

Latin-1 Punctuation and Symbols[edit]

The Latin-1 Punctuation and Symbols subheading contains 32 characters of common international punctuation characters, such as inverted exclamation and question marks, and a middle dot; and symbols like foreign currency signs, spacing diacritic marks, vulgar fraction, and superscript numbers.[3]

Letters[edit]

The Letters subheading contains 30 pairs of majuscule and minuscule accented or novel Latin characters for western European languages, and two extra minuscule characters not commonly used word-initially.[3]

Mathematical Operator(s)[edit]

The Mathematical Operator subheading is used for the multiplication and division signs.[3]

External links[edit]

  1. ^ "Unicode character database". The Unicode Standard. Retrieved 22 March 2013. 
  2. ^ a b The Unicode Standard Version 1.0, Volume 1. Addison-Wesley Publishing Company, Inc. 1990, 1991. ISBN 0-201-56788-1. 
  3. ^ a b c d e "Unicode 6.2 code charts". The Unicode Standard. Retrieved 1 April 2013.