User:Incnis Mrsi/UCS map

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Mapping of important parts of the Universal Character Set (usually referred to as Unicode) for English Wikipedia.

The table[edit]

Block(s)  0  1  2  3  4  5  6  7  8  9 10
0a
11
0b
12
0c
13
0d
14
0e
15
0f
16
10
17
11
18
12
19
13
20
14
21
15
22
16
23
17
24
18
25
19
26
1a
27
1b
28
1c
29
1d
30
1e
31
1f
U+0000: C0 controls                       
       (skipped)
U+0020: Basic Latin ! " # $ % & ' ( ) * + , - . / 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
U+0040: Basic Latin @ A B C D E F G H I J K L M N O P Q R S T U V W X Y Z [ \ ] ^ _
U+0060: Basic Latin ` a b c d e f g h i j k l m n o p q r s t u v w x y z { | } ~  
U+0080: C1 controls (disallowed in HTML)
U+00a0: Latin-1 Supplement   ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ­ ® ¯ ° ± ² ³ ´ µ · ¸ ¹ º » ¼ ½ ¾ ¿
U+00c0: Latin-1 Supplement À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
U+00e0: Latin-1 Supplement à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
U+0100: Latin Extended-A Ā ā Ă ă Ą ą Ć ć Ĉ ĉ Ċ ċ Č č Ď ď Đ đ Ē ē Ĕ ĕ Ė ė Ę ę Ě ě Ĝ ĝ Ğ ğ
U+0120: Latin Extended-A Ġ ġ Ģ ģ Ĥ ĥ Ħ ħ Ĩ ĩ Ī ī Ĭ ĭ Į į İ ı IJ ij Ĵ ĵ Ķ ķ ĸ Ĺ ĺ Ļ ļ Ľ ľ Ŀ
U+0140: Latin Extended-A ŀ Ł ł Ń ń Ņ ņ Ň ň ʼn Ŋ ŋ Ō ō Ŏ ŏ Ő ő Œ œ Ŕ ŕ Ŗ ŗ Ř ř Ś ś Ŝ ŝ Ş ş
U+0160: Latin Extended-A Š š Ţ ţ Ť ť Ŧ ŧ Ũ ũ Ū ū Ŭ ŭ Ů ů Ű ű Ų ų Ŵ ŵ Ŷ ŷ Ÿ Ź ź Ż ż Ž ž ſ
U+0180: Latin Extended-B
U+0250: IPA Extensions ɐ ɑ ɒ ɓ ɔ ɕ ɖ ɗ ɘ ə ɚ ɛ ɜ ɝ ɞ ɟ
U+0260: IPA Extensions ɠ ɡ ɢ ɣ ɤ ɥ ɦ ɧ ɨ ɩ ɪ ɫ ɬ ɭ ɮ ɯ ɰ ɱ ɲ ɳ ɴ ɵ ɶ ɷ ɸ ɹ ɺ ɻ ɼ ɽ ɾ ɿ
U+0280: IPA Extensions ʀ ʁ ʂ ʃ ʄ ʅ ʆ ʇ ʈ ʉ ʊ ʋ ʌ ʍ ʎ ʏ ʐ ʑ ʒ ʓ ʔ ʕ ʖ ʗ ʘ ʙ ʚ ʛ ʜ ʝ ʞ ʟ
U+02a0: IPA Extensions ʠ ʡ ʢ ʣ ʤ ʥ ʦ ʧ ʨ ʩ ʪ ʫ ʬ ʭ ʮ ʯ (skipped)
U+0300: Combining Diacritical Marks ◌̀ ◌́ ◌̂ ◌̃ ◌̄ ◌̅ ◌̆ ◌̇ ◌̈ ◌̉ ◌̊ ◌̋ ◌̌ ◌̍ ◌̎ ◌̏ ◌̐ ◌̑ ◌̒ ◌̓ ◌̔ ◌̕ ◌̖ ◌̗ ◌̘ ◌̙ ◌̚ ◌̛ ◌̜ ◌̝ ◌̞ ◌̟
U+0320: Combining Diacritical Marks ◌̠ ◌̡ ◌̢ ◌̣ ◌̤ ◌̥ ◌̦ ◌̧ ◌̨ ◌̩ ◌̪ ◌̫ ◌̬ ◌̭ ◌̮ ◌̯ ◌̰ ◌̱ ◌̲ ◌̳ ◌̴ ◌̵ ◌̶ ◌̷ ◌̸ ◌̹ ◌̺ ◌̻ ◌̼ ◌̽ ◌̾ ◌̿
U+0340: Combining Diacritical Marks ◌̀ ◌́ ◌͂ ◌̓ ◌̈́ ◌ͅ ◌͆ ◌͇ ◌͈ ◌͉ ◌͊ ◌͋ ◌͌ ◌͍ ◌͎ ◌͏ ◌͐ ◌͑ ◌͒ ◌͓ ◌͔ ◌͕ ◌͖ ◌͗ ◌͘ ◌͙ ◌͚ ◌͛ ◌͜ ◌͝ ◌͞ ◌͟
U+0360: Combining Diacritical Marks ◌͠ ◌͡ ◌͢ ◌ͣ ◌ͤ ◌ͥ ◌ͦ ◌ͧ ◌ͨ ◌ͩ ◌ͪ ◌ͫ ◌ͬ ◌ͭ ◌ͮ ◌ͯ Ͱ ͱ Ͳ ͳ ʹ ͵ Ͷ ͷ ͸ ͹ ͺ ͻ ͼ ͽ ; Ϳ
U+0380: Greek alphabet ΀ ΁ ΂ ΃ ◌΄ ◌΅ Ά · Έ Ή Ί ΋ Ό ΍ Ύ Ώ ΐ Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο
U+03a0: Greek alphabet Π Ρ ΢ Σ Τ Υ Φ Χ Ψ Ω Ϊ Ϋ ά έ ή ί ΰ α β γ δ ε ζ η θ ι κ λ μ ν ξ ο
U+03c0: Greek alphabet π ρ ς σ τ υ φ χ ψ ω ϊ ϋ ό ύ ώ Ϗ ϐ ϑ ϒ ϓ ϔ ϕ ϖ ϗ Ϙ ϙ Ϛ ϛ Ϝ ϝ Ϟ ϟ
U+03e0: Greek alphabet Ϡ ϡ Ϣ ϣ Ϥ ϥ Ϧ ϧ Ϩ ϩ Ϫ ϫ Ϭ ϭ Ϯ ϯ ϰ ϱ ϲ ϳ ϴ ϵ ϶ Ϸ ϸ Ϲ Ϻ ϻ ϼ Ͻ Ͼ Ͽ
U+0400: Cyrillic and Armenian scripts
U+0590: Right-to-left scripts Unfinished row
U+0900: Left-to-right scripts again, and some extensions
U+2000: General Punctuation
U+2020: General Punctuation
U+2040: General Punctuation
U+2060: USS
U+2080: USS
U+20a0: Currency
U+20c0: Currency
U+20e0: Currency
U+2100: Letterlike Symbols
U+2120: Letterlike Symbols Ω K Å
U+2140: Letterlike Symbols
U+2160: Roman numerals
U+2180: Number Forms (skipped)
U+2190: Arrows
U+21a0: Arrows
U+21c0: Arrows
U+21e0: Arrows
U+2200: Mathematical Operators
U+2220: Mathematical Operators
U+2240: Mathematical Operators
U+2260: Mathematical Operators
U+2280: Mathematical Operators
U+22a0: Mathematical Operators
U+22c0: Mathematical Operators
U+22e0: Mathematical Operators
U+2300: Miscellaneous Technical
U+2320: Miscellaneous Technical
U+2340: Miscellaneous Technical
U+2360: Miscellaneous Technical
U+2380: Miscellaneous Technical
U+23a0: Miscellaneous Technical
U+23c0: Miscellaneous Technical
U+23e0: Miscellaneous Technical
U+2400: Control Pictures
U+2420: Control Pictures
U+2440: Optical Character Recognition (little use in texts)
U+2460: Enclosed Alphanumerics (little use in texts)
U+2500: Box-drawing characters
U+2520: Box-drawing characters
U+2540: Box-drawing characters
U+2560: Block Elements
U+2580: Geometric Shapes
U+25a0: Geometric Shapes
U+25c0: Geometric Shapes
U+25e0: Geometric Shapes
U+2600: Miscellaneous Symbols
U+2620: Miscellaneous Symbols
U+2640: Miscellaneous Symbols
U+2660: Miscellaneous Symbols
U+2680: Miscellaneous Symbols
U+26a0: Miscellaneous Symbols (skipped)
U+2700: Unicode Dingbats
U+27c0: Miscellaneous Mathematical Symbols-A
U+27e0: Miscellaneous Mathematical Symbols-A
U+2800: A heap of various characters
U+2e80: East Asian scripts and symbols; the majority of BMP characters
U+a500: Miscellaneous scripts
U+d800: UTF-16 Surrogates (non-characters)
U+e000: Private Use Area (little use in texts)
U+f900: Miscellaneous rubbish
U+ff00: Fullwidth forms
U+ff20: Fullwidth forms _
U+ff40: Fullwidth forms
U+ff60: Some alien rubbish from the Halfwidth and Fullwidth Forms block
U+fff0: Specials                      
U+10000: Astral characters start here 𐀀 𐀁 𐀂 𐀃 𐀄 𐀅 𐀆 𐀇 𐀈 𐀉 𐀊 𐀋 𐀌 𐀍 𐀎 𐀏 𐀐 𐀑 𐀒 𐀓 𐀔 𐀕 𐀖 𐀗 𐀘 𐀙 𐀚 𐀛 𐀜 𐀝 𐀞 𐀟

Arithmetic[edit]

Addition: “+”.

Subtraction and opposite/negative: “−” (&minus;) in text, “-” in arguments of templates.

Multiplication: “×” (&times;), “⋅” (&sdot;); no substitutes shall be used.

Division: “/” (usually), “ ⁄ ” (virtually unused), “ ∕ ” (sometimes).

Specific symbols[edit]

Asterisk[edit]

Not a multiplication sign (except in source code in some languages).

Hyphen-minus[edit]

Dual use: as a hyphen in the text and minus sign in certain wiki contexts.

Slash[edit]

Dual use: as a punctuation and as a #division sign.

Degree sign[edit]

“°” is a separate character! Shall never be substituted with “0” or “o” in superscript!

Dashes[edit]

Several flavours of, but – is the preferred.

Quotation marks[edit]

All but “ " ” and “ ' ” are frowned upon. Also, “ ’ ” is sometimes mistakenly used as a substitute for prime (′) and vice versa.

Junk characters[edit]

Standalone “accents”[edit]

Both “ ´ ” and “ ` ” are virtually useless without the overstrike feature, but are sometimes mistakenly used instead of quotation marks. Though, “`” is an ASCII character and is actually used in some scripting languages.

Superscripts and fractions[edit]

Unicode superscripts and precomposed fractions are generally discouraged. Though, they may be used in edit summaries where text formatting is unavailable.

Ordinal indicators[edit]

“º” and “ª” are useless is English language, but the former is sometimes mistakenly used as a substitute for #Degree sign.

Arrows[edit]

Left, right, and left–right arrows are very common in Wikipedia. The “⇌” symbols is used for reactions, but is rendered as image instead of the character: see {{eqm}}.