Wikipedia:Naming conventions (Unicode) (draft)

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Right now this page just contains info on special characters that are likely to be interesting/usefull for en. Hopefully it can be expanded into a policy on their use.

Unicode provides an international standard which has the goal of providing the means to encode the text of every document people want to store on computers. This includes all scripts in active use today, many scripts known only by scholars, and symbols which do not strictly represent scripts, like mathematical, linguistic and APL symbols.

ASCII[edit]

C0 Controls and Basic Latin[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+000x  NUL   SOH   STX   ETX   EOT   ENQ   ACK   BEL    BS     HT     LF     VT     FF     CR     SO     SI  
U+001x  DLE   DC1   DC2   DC3   DC4   NAK   SYN   ETB   CAN    EM     SUB     ESC     FS     GS     RS     US  
U+002x   SP    ! " # $  % & ' ( ) * + , - . /
U+003x 0 1 2 3 4 5 6 7 8 9  :  ; < = >  ?
U+004x @ A B C D E F G H I J K L M N O
U+005x P Q R S T U V W X Y Z [ \ ] ^ _
U+006x ` a b c d e f g h i j k l m n o
U+007x p q r s t u v w x y z { | } ~  DEL 
Notes
1.^ As of Unicode version 7.0

These are our bread and butter and have been dealt with exhaustively elsewhere. Plugwash 28 June 2005 20:37 (UTC)

Latin-1[edit]

C1 Controls and Latin-1 Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+008x  XXX   XXX   BPH   NBH   IND   NEL   SSA   ESA   HTS   HTJ   VTS   PLD   PLU     RI     SS2   SS3 
U+009x  DCS   PU1   PU2   STS   CCH     MW     SPA   EPA   SOS   XXX   SCI   CSI     ST     OSC     PM     APC 
U+00Ax NB
  SP  
¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬  SHY 
­
® ¯
U+00Bx ° ± ² ³ ´ µ · ¸ ¹ º » ¼ ½ ¾ ¿
U+00Cx À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
U+00Dx Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
U+00Ex à á â ã ä å æ ç è é ê ë ì í î ï
U+00Fx ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
Notes
1.^ As of Unicode version 7.0

Same as above pretty much Plugwash 28 June 2005 20:41 (UTC)

The degree sign and the masculine ordinal indicator are often conflated, and are sometimes misused in place of the raised "o" in approximations of the numero sign. The feminine ordinal indicator is also occasionally misused as a superscript "1". The passability of such approximations is predicated on the output media being visual (screen, print) and the characteristics of certain fonts at certain sizes. In other contexts, the semantics of the text are changed when the wrong characters are used. Therefore, these characters should only ever be used for their intended purposes: use the degree sign to mean degrees, and the ordinal indicators as appropriate for the languages that need them.

Latin extended A[edit]

Default font   Unicode font
Latin Extended-A[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+010x Ā ā Ă ă Ą ą Ć ć Ĉ ĉ Ċ ċ Č č Ď ď
U+011x Đ đ Ē ē Ĕ ĕ Ė ė Ę ę Ě ě Ĝ ĝ Ğ ğ
U+012x Ġ ġ Ģ ģ Ĥ ĥ Ħ ħ Ĩ ĩ Ī ī Ĭ ĭ Į į
U+013x İ ı IJ ij Ĵ ĵ Ķ ķ ĸ Ĺ ĺ Ļ ļ Ľ ľ Ŀ
U+014x ŀ Ł ł Ń ń Ņ ņ Ň ň ʼn Ŋ ŋ Ō ō Ŏ ŏ
U+015x Ő ő Œ œ Ŕ ŕ Ŗ ŗ Ř ř Ś ś Ŝ ŝ Ş ş
U+016x Š š Ţ ţ Ť ť Ŧ ŧ Ũ ũ Ū ū Ŭ ŭ Ů ů
U+017x Ű ű Ų ų Ŵ ŵ Ŷ ŷ Ÿ Ź ź Ż ż Ž ž ſ
Notes
1.^ As of Unicode version 7.0
2.^ Unicode code point U+0149 is deprecated as of Unicode version 5.2
 
Latin Extended-A[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+010x Ā ā Ă ă Ą ą Ć ć Ĉ ĉ Ċ ċ Č č Ď ď
U+011x Đ đ Ē ē Ĕ ĕ Ė ė Ę ę Ě ě Ĝ ĝ Ğ ğ
U+012x Ġ ġ Ģ ģ Ĥ ĥ Ħ ħ Ĩ ĩ Ī ī Ĭ ĭ Į į
U+013x İ ı IJ ij Ĵ ĵ Ķ ķ ĸ Ĺ ĺ Ļ ļ Ľ ľ Ŀ
U+014x ŀ Ł ł Ń ń Ņ ņ Ň ň ʼn Ŋ ŋ Ō ō Ŏ ŏ
U+015x Ő ő Œ œ Ŕ ŕ Ŗ ŗ Ř ř Ś ś Ŝ ŝ Ş ş
U+016x Š š Ţ ţ Ť ť Ŧ ŧ Ũ ũ Ū ū Ŭ ŭ Ů ů
U+017x Ű ű Ų ų Ŵ ŵ Ŷ ŷ Ÿ Ź ź Ż ż Ž ž ſ
Notes
1.^ As of Unicode version 7.0
2.^ Unicode code point U+0149 is deprecated as of Unicode version 5.2

This is mostly letters with less common diacritics. One or two of these are already in article titles due to conversions from windows-1252 (most browsers interpret iso-8859-1 as windows-1252 and so that stuff got into article names here). The extra diacritics are probably a good thing as long as redirects are in place from the diacritic-less names and these are likely to be widely supported so I think it's pretty safe to have them in article titles. Plugwash 28 June 2005 20:47 (UTC)

Example: Paul Erdős. Arbor 6 July 2005 12:59 (UTC)

It is erroneous to refer to these as "less common" diacritics. Many of these are diacritics used for Eastern European languages, including Polish, Czech, Slovakian, Croatian, Slovenian, Serbian in Latin script, Hungarian, Turkish, Romanian, Latvian, Lithuanian, etc. and since the introduction of MediaWiki 1.5 they are already extensively used in article titles. The fact that Latin 1 is limited to Western European languages is just an artifact of the Cold War. -- Curps 15:05, 2 August 2005 (UTC)

Latin extended B[edit]

Default font   Unicode font
Latin Extended-B[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+018x ƀ Ɓ Ƃ ƃ Ƅ ƅ Ɔ Ƈ ƈ Ɖ Ɗ Ƌ ƌ ƍ Ǝ Ə
U+019x Ɛ Ƒ ƒ Ɠ Ɣ ƕ Ɩ Ɨ Ƙ ƙ ƚ ƛ Ɯ Ɲ ƞ Ɵ
U+01Ax Ơ ơ Ƣ ƣ Ƥ ƥ Ʀ Ƨ ƨ Ʃ ƪ ƫ Ƭ ƭ Ʈ Ư
U+01Bx ư Ʊ Ʋ Ƴ ƴ Ƶ ƶ Ʒ Ƹ ƹ ƺ ƻ Ƽ ƽ ƾ ƿ
U+01Cx ǀ ǁ ǂ ǃ DŽ Dž dž LJ Lj lj NJ Nj nj Ǎ ǎ Ǐ
U+01Dx ǐ Ǒ ǒ Ǔ ǔ Ǖ ǖ Ǘ ǘ Ǚ ǚ Ǜ ǜ ǝ Ǟ ǟ
U+01Ex Ǡ ǡ Ǣ ǣ Ǥ ǥ Ǧ ǧ Ǩ ǩ Ǫ ǫ Ǭ ǭ Ǯ ǯ
U+01Fx ǰ DZ Dz dz Ǵ ǵ Ƕ Ƿ Ǹ ǹ Ǻ ǻ Ǽ ǽ Ǿ ǿ
U+020x Ȁ ȁ Ȃ ȃ Ȅ ȅ Ȇ ȇ Ȉ ȉ Ȋ ȋ Ȍ ȍ Ȏ ȏ
U+021x Ȑ ȑ Ȓ ȓ Ȕ ȕ Ȗ ȗ Ș ș Ț ț Ȝ ȝ Ȟ ȟ
U+022x Ƞ ȡ Ȣ ȣ Ȥ ȥ Ȧ ȧ Ȩ ȩ Ȫ ȫ Ȭ ȭ Ȯ ȯ
U+023x Ȱ ȱ Ȳ ȳ ȴ ȵ ȶ ȷ ȸ ȹ Ⱥ Ȼ ȼ Ƚ Ⱦ ȿ
U+024x ɀ Ɂ ɂ Ƀ Ʉ Ʌ Ɇ ɇ Ɉ ɉ Ɋ ɋ Ɍ ɍ Ɏ ɏ
Notes
1.^ As of Unicode version 7.0
 
Latin Extended-B[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+018x ƀ Ɓ Ƃ ƃ Ƅ ƅ Ɔ Ƈ ƈ Ɖ Ɗ Ƌ ƌ ƍ Ǝ Ə
U+019x Ɛ Ƒ ƒ Ɠ Ɣ ƕ Ɩ Ɨ Ƙ ƙ ƚ ƛ Ɯ Ɲ ƞ Ɵ
U+01Ax Ơ ơ Ƣ ƣ Ƥ ƥ Ʀ Ƨ ƨ Ʃ ƪ ƫ Ƭ ƭ Ʈ Ư
U+01Bx ư Ʊ Ʋ Ƴ ƴ Ƶ ƶ Ʒ Ƹ ƹ ƺ ƻ Ƽ ƽ ƾ ƿ
U+01Cx ǀ ǁ ǂ ǃ DŽ Dž dž LJ Lj lj NJ Nj nj Ǎ ǎ Ǐ
U+01Dx ǐ Ǒ ǒ Ǔ ǔ Ǖ ǖ Ǘ ǘ Ǚ ǚ Ǜ ǜ ǝ Ǟ ǟ
U+01Ex Ǡ ǡ Ǣ ǣ Ǥ ǥ Ǧ ǧ Ǩ ǩ Ǫ ǫ Ǭ ǭ Ǯ ǯ
U+01Fx ǰ DZ Dz dz Ǵ ǵ Ƕ Ƿ Ǹ ǹ Ǻ ǻ Ǽ ǽ Ǿ ǿ
U+020x Ȁ ȁ Ȃ ȃ Ȅ ȅ Ȇ ȇ Ȉ ȉ Ȋ ȋ Ȍ ȍ Ȏ ȏ
U+021x Ȑ ȑ Ȓ ȓ Ȕ ȕ Ȗ ȗ Ș ș Ț ț Ȝ ȝ Ȟ ȟ
U+022x Ƞ ȡ Ȣ ȣ Ȥ ȥ Ȧ ȧ Ȩ ȩ Ȫ ȫ Ȭ ȭ Ȯ ȯ
U+023x Ȱ ȱ Ȳ ȳ ȴ ȵ ȶ ȷ ȸ ȹ Ⱥ Ȼ ȼ Ƚ Ⱦ ȿ
U+024x ɀ Ɂ ɂ Ƀ Ʉ Ʌ Ɇ ɇ Ɉ ɉ Ɋ ɋ Ɍ ɍ Ɏ ɏ
Notes
1.^ As of Unicode version 7.0

Here, U+018F is uppercase schwa, used in Azerbaijani. Note the lowercase schwa is U+0259 in the IPA section; the character U+01DD is "Latin small letter turned e", whose uppercase is U+018E, which is used in pan-Nigerian alphabets.

Also, U+01A0, U+01A1 is "o with horn" and U+01AF, U+01B0 is "u with horn", used in Vietnamese.

Also, U+01CD through U+01DC are a, i, o, u, u-umlaut with caron, used in Chinese pinyin for the third tone. Notice that Ě/ě (e with caron) is U+011A/U+011B and is in Latin extended A, since it is also a letter of Czech alphabet.

Also, U+0218 through U+021B are "s with comma below" and "t with comma below", used in Romanian. Fonts are sometimes not available for these, so "s with cedilla" and "t with cedilla" from the Latin extended A section are sometimes used instead of these. See Special Romanian Unicode characters -- Curps 21:10, 24 August 2005 (UTC)

out of interest do you know if the romanian wp has a policy on using comma below or cedilla? Plugwash 22:46, 24 August 2005 (UTC)
I'm pretty sure they use the cedilla forms, simply because my browser can't display the comma versions at Special Romanian Unicode characters and when I go to the http://ro.wikipedia.org/ I can see the letters. The comma versions only display in the above table because the whole table is set up to use special Unicode fonts rather than the default fonts. I edited the above table to show the default-font version alongside the Unicode font version. -- Curps 03:39, 25 August 2005 (UTC)

Latin extended additional[edit]

Default font   Unicode font
Latin Extended Additional[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1E0x
U+1E1x
U+1E2x
U+1E3x ḿ
U+1E4x
U+1E5x
U+1E6x
U+1E7x ṿ
U+1E8x
U+1E9x
U+1EAx
U+1EBx ế
U+1ECx
U+1EDx
U+1EEx
U+1EFx ỿ
Notes
1.^ As of Unicode version 7.0
 
Latin Extended Additional[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1E0x
U+1E1x
U+1E2x
U+1E3x ḿ
U+1E4x
U+1E5x
U+1E6x
U+1E7x ṿ
U+1E8x
U+1E9x
U+1EAx
U+1EBx ế
U+1ECx
U+1EDx
U+1EEx
U+1EFx ỿ
Notes
1.^ As of Unicode version 7.0

The range U+01EA0 through U+01EF9 is used for Vietnamese. -- Curps 21:10, 24 August 2005 (UTC)

IPA extensions[edit]

Default font Unicode font IPA Font (for comparison purposes only)
IPA Extensions[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+025x ɐ ɑ ɒ ɓ ɔ ɕ ɖ ɗ ɘ ə ɚ ɛ ɜ ɝ ɞ ɟ
U+026x ɠ ɡ ɢ ɣ ɤ ɥ ɦ ɧ ɨ ɩ ɪ ɫ ɬ ɭ ɮ ɯ
U+027x ɰ ɱ ɲ ɳ ɴ ɵ ɶ ɷ ɸ ɹ ɺ ɻ ɼ ɽ ɾ ɿ
U+028x ʀ ʁ ʂ ʃ ʄ ʅ ʆ ʇ ʈ ʉ ʊ ʋ ʌ ʍ ʎ ʏ
U+029x ʐ ʑ ʒ ʓ ʔ ʕ ʖ ʗ ʘ ʙ ʚ ʛ ʜ ʝ ʞ ʟ
U+02Ax ʠ ʡ ʢ ʣ ʤ ʥ ʦ ʧ ʨ ʩ ʪ ʫ ʬ ʭ ʮ ʯ
Notes
1.^ As of Unicode version 7.0
IPA Extensions[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+025x ɐ ɑ ɒ ɓ ɔ ɕ ɖ ɗ ɘ ə ɚ ɛ ɜ ɝ ɞ ɟ
U+026x ɠ ɡ ɢ ɣ ɤ ɥ ɦ ɧ ɨ ɩ ɪ ɫ ɬ ɭ ɮ ɯ
U+027x ɰ ɱ ɲ ɳ ɴ ɵ ɶ ɷ ɸ ɹ ɺ ɻ ɼ ɽ ɾ ɿ
U+028x ʀ ʁ ʂ ʃ ʄ ʅ ʆ ʇ ʈ ʉ ʊ ʋ ʌ ʍ ʎ ʏ
U+029x ʐ ʑ ʒ ʓ ʔ ʕ ʖ ʗ ʘ ʙ ʚ ʛ ʜ ʝ ʞ ʟ
U+02Ax ʠ ʡ ʢ ʣ ʤ ʥ ʦ ʧ ʨ ʩ ʪ ʫ ʬ ʭ ʮ ʯ
Notes
1.^ As of Unicode version 7.0
IPA Extensions[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+025x ɐ ɑ ɒ ɓ ɔ ɕ ɖ ɗ ɘ ə ɚ ɛ ɜ ɝ ɞ ɟ
U+026x ɠ ɡ ɢ ɣ ɤ ɥ ɦ ɧ ɨ ɩ ɪ ɫ ɬ ɭ ɮ ɯ
U+027x ɰ ɱ ɲ ɳ ɴ ɵ ɶ ɷ ɸ ɹ ɺ ɻ ɼ ɽ ɾ ɿ
U+028x ʀ ʁ ʂ ʃ ʄ ʅ ʆ ʇ ʈ ʉ ʊ ʋ ʌ ʍ ʎ ʏ
U+029x ʐ ʑ ʒ ʓ ʔ ʕ ʖ ʗ ʘ ʙ ʚ ʛ ʜ ʝ ʞ ʟ
U+02Ax ʠ ʡ ʢ ʣ ʤ ʥ ʦ ʧ ʨ ʩ ʪ ʫ ʬ ʭ ʮ ʯ
Notes
1.^ As of Unicode version 7.0

Used for IPA in body text (with a special template to persuade IE to render them right) probably not appropriate for article titles here. Plugwash 28 June 2005 20:48 (UTC)

Note however that schwa (U+0259) is also a letter in the Azerbaijani alphabet. Lowercase schwa is in the IPA section, but uppercase (U+018F) is in the Latin B section. -- Curps 15:05, 2 August 2005 (UTC)

Spacing modifier letters[edit]

Default font   Unicode font
Spacing Modifier Letters[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+02Bx ʰ ʱ ʲ ʳ ʴ ʵ ʶ ʷ ʸ ʹ ʺ ʻ ʼ ʽ ʾ ʿ
U+02Cx ˀ ˁ ˂ ˃ ˄ ˅ ˆ ˇ ˈ ˉ ˊ ˋ ˌ ˍ ˎ ˏ
U+02Dx ː ˑ ˒ ˓ ˔ ˕ ˖ ˗ ˘ ˙ ˚ ˛ ˜ ˝ ˞ ˟
U+02Ex ˠ ˡ ˢ ˣ ˤ ˥ ˦ ˧ ˨ ˩ ˪ ˫ ˬ ˭ ˮ ˯
U+02Fx ˰ ˱ ˲ ˳ ˴ ˵ ˶ ˷ ˸ ˹ ˺ ˻ ˼ ˽ ˾ ˿
Notes
1.^ As of Unicode version 7.0
 
Spacing Modifier Letters[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+02Bx ʰ ʱ ʲ ʳ ʴ ʵ ʶ ʷ ʸ ʹ ʺ ʻ ʼ ʽ ʾ ʿ
U+02Cx ˀ ˁ ˂ ˃ ˄ ˅ ˆ ˇ ˈ ˉ ˊ ˋ ˌ ˍ ˎ ˏ
U+02Dx ː ˑ ˒ ˓ ˔ ˕ ˖ ˗ ˘ ˙ ˚ ˛ ˜ ˝ ˞ ˟
U+02Ex ˠ ˡ ˢ ˣ ˤ ˥ ˦ ˧ ˨ ˩ ˪ ˫ ˬ ˭ ˮ ˯
U+02Fx ˰ ˱ ˲ ˳ ˴ ˵ ˶ ˷ ˸ ˹ ˺ ˻ ˼ ˽ ˾ ˿
Notes
1.^ As of Unicode version 7.0

I am not formatting this correctly, but there is at least one interesting item in this code block:

  • MODIFIER LETTER TURNED COMMA (U+02BB): used in Hawaiian, where it is called ʻOkina
Unfortunately, this character does not seem to be available in either the default or Unicode fonts on Windows. -- Curps 21:38, 4 September 2005 (UTC)
Works fine on mine, although I couldn't tell you precisely which font is being so obliging. —Phil | Talk 14:14, 14 October 2005 (UTC)

Combining diacritical marks[edit]

Default font   Unicode font
Combining Diacritical Marks[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+030x  ̀  ́  ̂  ̃  ̄  ̅  ̆  ̇  ̈  ̉  ̊  ̋  ̌  ̍  ̎  ̏
U+031x  ̐  ̑  ̒  ̓  ̔  ̕  ̖  ̗  ̘  ̙  ̚  ̛  ̜  ̝  ̞  ̟
U+032x  ̠  ̡  ̢  ̣  ̤  ̥  ̦  ̧  ̨  ̩  ̪  ̫  ̬  ̭  ̮  ̯
U+033x  ̰  ̱  ̲  ̳  ̴  ̵  ̶  ̷  ̸  ̹  ̺  ̻  ̼  ̽  ̾  ̿
U+034x  ̀  ́  ͂  ̓  ̈́  ͅ  ͆  ͇  ͈  ͉  ͊  ͋  ͌  ͍  ͎  CGJ 
U+035x  ͐  ͑  ͒  ͓  ͔  ͕  ͖  ͗  ͘  ͙  ͚  ͛  ͜   ͝   ͞   ͟ 
U+036x  ͠   ͡   ͢   ͣ  ͤ  ͥ  ͦ  ͧ  ͨ  ͩ  ͪ  ͫ  ͬ  ͭ  ͮ  ͯ
Notes
1.^ As of Unicode version 7.0
 
Combining Diacritical Marks[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+030x  ̀  ́  ̂  ̃  ̄  ̅  ̆  ̇  ̈  ̉  ̊  ̋  ̌  ̍  ̎  ̏
U+031x  ̐  ̑  ̒  ̓  ̔  ̕  ̖  ̗  ̘  ̙  ̚  ̛  ̜  ̝  ̞  ̟
U+032x  ̠  ̡  ̢  ̣  ̤  ̥  ̦  ̧  ̨  ̩  ̪  ̫  ̬  ̭  ̮  ̯
U+033x  ̰  ̱  ̲  ̳  ̴  ̵  ̶  ̷  ̸  ̹  ̺  ̻  ̼  ̽  ̾  ̿
U+034x  ̀  ́  ͂  ̓  ̈́  ͅ  ͆  ͇  ͈  ͉  ͊  ͋  ͌  ͍  ͎  CGJ 
U+035x  ͐  ͑  ͒  ͓  ͔  ͕  ͖  ͗  ͘  ͙  ͚  ͛  ͜   ͝   ͞   ͟ 
U+036x  ͠   ͡   ͢   ͣ  ͤ  ͥ  ͦ  ͧ  ͨ  ͩ  ͪ  ͫ  ͬ  ͭ  ͮ  ͯ
Notes
1.^ As of Unicode version 7.0

Notice how the marks show up now they have something to combine with… —Phil | Talk 10:49, 24 October 2005 (UTC)

Greek[edit]

Default font Unicode font Polytonic font (for comparison purposes)
Greek and Coptic[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+037x Ͱ ͱ Ͳ ͳ ʹ ͵ Ͷ ͷ ͺ ͻ ͼ ͽ ; Ϳ
U+038x ΄ ΅ Ά · Έ Ή Ί Ό Ύ Ώ
U+039x ΐ Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο
U+03Ax Π Ρ Σ Τ Υ Φ Χ Ψ Ω Ϊ Ϋ ά έ ή ί
U+03Bx ΰ α β γ δ ε ζ η θ ι κ λ μ ν ξ ο
U+03Cx π ρ ς σ τ υ φ χ ψ ω ϊ ϋ ό ύ ώ Ϗ
U+03Dx ϐ ϑ ϒ ϓ ϔ ϕ ϖ ϗ Ϙ ϙ Ϛ ϛ Ϝ ϝ Ϟ ϟ
U+03Ex Ϡ ϡ Ϣ ϣ Ϥ ϥ Ϧ ϧ Ϩ ϩ Ϫ ϫ Ϭ ϭ Ϯ ϯ
U+03Fx ϰ ϱ ϲ ϳ ϴ ϵ ϶ Ϸ ϸ Ϲ Ϻ ϻ ϼ Ͻ Ͼ Ͽ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
Greek and Coptic[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+037x Ͱ ͱ Ͳ ͳ ʹ ͵ Ͷ ͷ ͺ ͻ ͼ ͽ ; Ϳ
U+038x ΄ ΅ Ά · Έ Ή Ί Ό Ύ Ώ
U+039x ΐ Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο
U+03Ax Π Ρ Σ Τ Υ Φ Χ Ψ Ω Ϊ Ϋ ά έ ή ί
U+03Bx ΰ α β γ δ ε ζ η θ ι κ λ μ ν ξ ο
U+03Cx π ρ ς σ τ υ φ χ ψ ω ϊ ϋ ό ύ ώ Ϗ
U+03Dx ϐ ϑ ϒ ϓ ϔ ϕ ϖ ϗ Ϙ ϙ Ϛ ϛ Ϝ ϝ Ϟ ϟ
U+03Ex Ϡ ϡ Ϣ ϣ Ϥ ϥ Ϧ ϧ Ϩ ϩ Ϫ ϫ Ϭ ϭ Ϯ ϯ
U+03Fx ϰ ϱ ϲ ϳ ϴ ϵ ϶ Ϸ ϸ Ϲ Ϻ ϻ ϼ Ͻ Ͼ Ͽ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
Greek and Coptic[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+037x Ͱ ͱ Ͳ ͳ ʹ ͵ Ͷ ͷ ͺ ͻ ͼ ͽ ; Ϳ
U+038x ΄ ΅ Ά · Έ Ή Ί Ό Ύ Ώ
U+039x ΐ Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο
U+03Ax Π Ρ Σ Τ Υ Φ Χ Ψ Ω Ϊ Ϋ ά έ ή ί
U+03Bx ΰ α β γ δ ε ζ η θ ι κ λ μ ν ξ ο
U+03Cx π ρ ς σ τ υ φ χ ψ ω ϊ ϋ ό ύ ώ Ϗ
U+03Dx ϐ ϑ ϒ ϓ ϔ ϕ ϖ ϗ Ϙ ϙ Ϛ ϛ Ϝ ϝ Ϟ ϟ
U+03Ex Ϡ ϡ Ϣ ϣ Ϥ ϥ Ϧ ϧ Ϩ ϩ Ϫ ϫ Ϭ ϭ Ϯ ϯ
U+03Fx ϰ ϱ ϲ ϳ ϴ ϵ ϶ Ϸ ϸ Ϲ Ϻ ϻ ϼ Ͻ Ͼ Ͽ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Already used heavily for math type stuff but probably not too appropriate for article titles in English. Plugwash 28 June 2005 20:38 (UTC)

Well, here are some English articles that could use those letters in the title. Some of these articles claim to have the “wrong title due to technical limitations”: Pi, C omega, Omega constant, Chi-squared distribution, Gamma function, Cronbach's alpha, Beta particle, Beta distribution, and many, many more. Arbor 6 July 2005 19:07 (UTC)

Also some star names such as α And as a redirect for Alpha Andromedae. But here we run into the "initial letter is capitalized" issue, which applies to Greek letters too. -- Curps 04:44, 25 August 2005 (UTC)

Greek extended[edit]

Default font Unicode font Polytonic font (for comparison purposes)
Greek Extended[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1F0x
U+1F1x
U+1F2x
U+1F3x Ἷ
U+1F4x
U+1F5x
U+1F6x
U+1F7x
U+1F8x
U+1F9x
U+1FAx
U+1FBx ᾿
U+1FCx
U+1FDx
U+1FEx
U+1FFx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
Greek Extended[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1F0x
U+1F1x
U+1F2x
U+1F3x Ἷ
U+1F4x
U+1F5x
U+1F6x
U+1F7x
U+1F8x
U+1F9x
U+1FAx
U+1FBx ᾿
U+1FCx
U+1FDx
U+1FEx
U+1FFx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
Greek Extended[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1F0x
U+1F1x
U+1F2x
U+1F3x Ἷ
U+1F4x
U+1F5x
U+1F6x
U+1F7x
U+1F8x
U+1F9x
U+1FAx
U+1FBx ᾿
U+1FCx
U+1FDx
U+1FEx
U+1FFx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Cyrillic[edit]

Default font   Unicode font
Cyrillic[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+040x Ѐ Ё Ђ Ѓ Є Ѕ І Ї Ј Љ Њ Ћ Ќ Ѝ Ў Џ
U+041x А Б В Г Д Е Ж З И Й К Л М Н О П
U+042x Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я
U+043x а б в г д е ж з и й к л м н о п
U+044x р с т у ф х ц ч ш щ ъ ы ь э ю я
U+045x ѐ ё ђ ѓ є ѕ і ї ј љ њ ћ ќ ѝ ў џ
U+046x Ѡ ѡ Ѣ ѣ Ѥ ѥ Ѧ ѧ Ѩ ѩ Ѫ ѫ Ѭ ѭ Ѯ ѯ
U+047x Ѱ ѱ Ѳ ѳ Ѵ ѵ Ѷ ѷ Ѹ ѹ Ѻ ѻ Ѽ ѽ Ѿ ѿ
U+048x Ҁ ҁ ҂  ҃  ҄  ҅  ҆  ҇  ҈  ҉ Ҋ ҋ Ҍ ҍ Ҏ ҏ
U+049x Ґ ґ Ғ ғ Ҕ ҕ Җ җ Ҙ ҙ Қ қ Ҝ ҝ Ҟ ҟ
U+04Ax Ҡ ҡ Ң ң Ҥ ҥ Ҧ ҧ Ҩ ҩ Ҫ ҫ Ҭ ҭ Ү ү
U+04Bx Ұ ұ Ҳ ҳ Ҵ ҵ Ҷ ҷ Ҹ ҹ Һ һ Ҽ ҽ Ҿ ҿ
U+04Cx Ӏ Ӂ ӂ Ӄ ӄ Ӆ ӆ Ӈ ӈ Ӊ ӊ Ӌ ӌ Ӎ ӎ ӏ
U+04Dx Ӑ ӑ Ӓ ӓ Ӕ ӕ Ӗ ӗ Ә ә Ӛ ӛ Ӝ ӝ Ӟ ӟ
U+04Ex Ӡ ӡ Ӣ ӣ Ӥ ӥ Ӧ ӧ Ө ө Ӫ ӫ Ӭ ӭ Ӯ ӯ
U+04Fx Ӱ ӱ Ӳ ӳ Ӵ ӵ Ӷ ӷ Ӹ ӹ Ӻ ӻ Ӽ ӽ Ӿ ӿ
Notes
1.^ As of Unicode version 7.0
 
Cyrillic[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+040x Ѐ Ё Ђ Ѓ Є Ѕ І Ї Ј Љ Њ Ћ Ќ Ѝ Ў Џ
U+041x А Б В Г Д Е Ж З И Й К Л М Н О П
U+042x Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я
U+043x а б в г д е ж з и й к л м н о п
U+044x р с т у ф х ц ч ш щ ъ ы ь э ю я
U+045x ѐ ё ђ ѓ є ѕ і ї ј љ њ ћ ќ ѝ ў џ
U+046x Ѡ ѡ Ѣ ѣ Ѥ ѥ Ѧ ѧ Ѩ ѩ Ѫ ѫ Ѭ ѭ Ѯ ѯ
U+047x Ѱ ѱ Ѳ ѳ Ѵ ѵ Ѷ ѷ Ѹ ѹ Ѻ ѻ Ѽ ѽ Ѿ ѿ
U+048x Ҁ ҁ ҂  ҃  ҄  ҅  ҆  ҇  ҈  ҉ Ҋ ҋ Ҍ ҍ Ҏ ҏ
U+049x Ґ ґ Ғ ғ Ҕ ҕ Җ җ Ҙ ҙ Қ қ Ҝ ҝ Ҟ ҟ
U+04Ax Ҡ ҡ Ң ң Ҥ ҥ Ҧ ҧ Ҩ ҩ Ҫ ҫ Ҭ ҭ Ү ү
U+04Bx Ұ ұ Ҳ ҳ Ҵ ҵ Ҷ ҷ Ҹ ҹ Һ һ Ҽ ҽ Ҿ ҿ
U+04Cx Ӏ Ӂ ӂ Ӄ ӄ Ӆ ӆ Ӈ ӈ Ӊ ӊ Ӌ ӌ Ӎ ӎ ӏ
U+04Dx Ӑ ӑ Ӓ ӓ Ӕ ӕ Ӗ ӗ Ә ә Ӛ ӛ Ӝ ӝ Ӟ ӟ
U+04Ex Ӡ ӡ Ӣ ӣ Ӥ ӥ Ӧ ӧ Ө ө Ӫ ӫ Ӭ ӭ Ӯ ӯ
U+04Fx Ӱ ӱ Ӳ ӳ Ӵ ӵ Ӷ ӷ Ӹ ӹ Ӻ ӻ Ӽ ӽ Ӿ ӿ
Notes
1.^ As of Unicode version 7.0

Cyrillic supplement[edit]

Default font   Unicode font
Cyrillic Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+050x Ԁ ԁ Ԃ ԃ Ԅ ԅ Ԇ ԇ Ԉ ԉ Ԋ ԋ Ԍ ԍ Ԏ ԏ
U+051x Ԑ ԑ Ԓ ԓ Ԕ ԕ Ԗ ԗ Ԙ ԙ Ԛ ԛ Ԝ ԝ Ԟ ԟ
U+052x Ԡ ԡ Ԣ ԣ Ԥ ԥ Ԧ ԧ Ԩ ԩ Ԫ ԫ Ԭ ԭ Ԯ ԯ
Notes
1.^ As of Unicode version 7.0
 
Cyrillic Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+050x Ԁ ԁ Ԃ ԃ Ԅ ԅ Ԇ ԇ Ԉ ԉ Ԋ ԋ Ԍ ԍ Ԏ ԏ
U+051x Ԑ ԑ Ԓ ԓ Ԕ ԕ Ԗ ԗ Ԙ ԙ Ԛ ԛ Ԝ ԝ Ԟ ԟ
U+052x Ԡ ԡ Ԣ ԣ Ԥ ԥ Ԧ ԧ Ԩ ԩ Ԫ ԫ Ԭ ԭ Ԯ ԯ
Notes
1.^ As of Unicode version 7.0

Armenian[edit]

Default font   Unicode font
Armenian[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+053x Ա Բ Գ Դ Ե Զ Է Ը Թ Ժ Ի Լ Խ Ծ Կ
U+054x Հ Ձ Ղ Ճ Մ Յ Ն Շ Ո Չ Պ Ջ Ռ Ս Վ Տ
U+055x Ր Ց Ւ Փ Ք Օ Ֆ ՙ ՚ ՛ ՜ ՝ ՞ ՟
U+056x ա բ գ դ ե զ է ը թ ժ ի լ խ ծ կ
U+057x հ ձ ղ ճ մ յ ն շ ո չ պ ջ ռ ս վ տ
U+058x ր ց ւ փ ք օ ֆ և ։ ֊ ֍ ֎ ֏
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Armenian[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+053x Ա Բ Գ Դ Ե Զ Է Ը Թ Ժ Ի Լ Խ Ծ Կ
U+054x Հ Ձ Ղ Ճ Մ Յ Ն Շ Ո Չ Պ Ջ Ռ Ս Վ Տ
U+055x Ր Ց Ւ Փ Ք Օ Ֆ ՙ ՚ ՛ ՜ ՝ ՞ ՟
U+056x ա բ գ դ ե զ է ը թ ժ ի լ խ ծ կ
U+057x հ ձ ղ ճ մ յ ն շ ո չ պ ջ ռ ս վ տ
U+058x ր ց ւ փ ք օ ֆ և ։ ֊ ֍ ֎ ֏
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Hebrew[edit]

Default font   Unicode font
Hebrew[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+059x ֑  ֒  ֓  ֔  ֕  ֖  ֗  ֘  ֙  ֚  ֛  ֜  ֝  ֞  ֟ 
U+05Ax ֠  ֡  ֢  ֣  ֤  ֥  ֦  ֧  ֨  ֩  ֪  ֫  ֬  ֭  ֮  ֯ 
U+05Bx ְ  ֱ  ֲ  ֳ  ִ  ֵ  ֶ  ַ  ָ  ֹ  ֺ  ֻ  ּ  ֽ  ־ ֿ 
U+05Cx ׀ ׁ  ׂ  ׃ ׄ  ׅ  ׆ ׇ 
U+05Dx א ב ג ד ה ו ז ח ט י ך כ ל ם מ ן
U+05Ex נ ס ע ף פ ץ צ ק ר ש ת
U+05Fx װ ױ ײ ׳ ״
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Hebrew[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+059x ֑  ֒  ֓  ֔  ֕  ֖  ֗  ֘  ֙  ֚  ֛  ֜  ֝  ֞  ֟ 
U+05Ax ֠  ֡  ֢  ֣  ֤  ֥  ֦  ֧  ֨  ֩  ֪  ֫  ֬  ֭  ֮  ֯ 
U+05Bx ְ  ֱ  ֲ  ֳ  ִ  ֵ  ֶ  ַ  ָ  ֹ  ֺ  ֻ  ּ  ֽ  ־ ֿ 
U+05Cx ׀ ׁ  ׂ  ׃ ׄ  ׅ  ׆ ׇ 
U+05Dx א ב ג ד ה ו ז ח ט י ך כ ל ם מ ן
U+05Ex נ ס ע ף פ ץ צ ק ר ש ת
U+05Fx װ ױ ײ ׳ ״
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Note that the mathematical symbol for Aleph (eg Aleph-0) is in the math section at U+2135. -- Curps 21:10, 24 August 2005 (UTC)

See also #Alphabetical presentation forms for forms used by Yiddish. -- Curps 00:56, 11 September 2005 (UTC)

Arabic[edit]

Default font   Unicode font
Arabic[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+060x  ؀   ؁   ؂   ؃   ؄   ؅  ؆ ؇ ؈ ؉ ؊ ؋ ، ؍ ؎ ؏
U+061x ؐ ؑ ؒ ؓ ؔ ؕ ؖ ؗ ؘ ؙ ؚ ؛  ALM  ؞ ؟
U+062x ؠ ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د
U+063x ذ ر ز س ش ص ض ط ظ ع غ ػ ؼ ؽ ؾ ؿ
U+064x ـ ف ق ك ل م ن ه و ى ي ً ٌ ٍ َ ُ
U+065x ِ ّ ْ ٓ ٔ ٕ ٖ ٗ ٘ ٙ ٚ ٛ ٜ ٝ ٞ ٟ
U+066x ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩ ٪ ٫ ٬ ٭ ٮ ٯ
U+067x ٰ ٱ ٲ ٳ ٴ ٵ ٶ ٷ ٸ ٹ ٺ ٻ ټ ٽ پ ٿ
U+068x ڀ ځ ڂ ڃ ڄ څ چ ڇ ڈ ډ ڊ ڋ ڌ ڍ ڎ ڏ
U+069x ڐ ڑ ڒ ړ ڔ ڕ ږ ڗ ژ ڙ ښ ڛ ڜ ڝ ڞ ڟ
U+06Ax ڠ ڡ ڢ ڣ ڤ ڥ ڦ ڧ ڨ ک ڪ ګ ڬ ڭ ڮ گ
U+06Bx ڰ ڱ ڲ ڳ ڴ ڵ ڶ ڷ ڸ ڹ ں ڻ ڼ ڽ ھ ڿ
U+06Cx ۀ ہ ۂ ۃ ۄ ۅ ۆ ۇ ۈ ۉ ۊ ۋ ی ۍ ێ ۏ
U+06Dx ې ۑ ے ۓ ۔ ە ۖ ۗ ۘ ۙ ۚ ۛ ۜ  ۝  ۞ ۟
U+06Ex ۠ ۡ ۢ ۣ ۤ ۥ ۦ ۧ ۨ ۩ ۪ ۫ ۬ ۭ ۮ ۯ
U+06Fx ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹ ۺ ۻ ۼ ۽ ۾ ۿ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
3.^ Unicode code point U+0673 is deprecated as of Unicode version 6.0
 
Arabic[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+060x  ؀   ؁   ؂   ؃   ؄   ؅  ؆ ؇ ؈ ؉ ؊ ؋ ، ؍ ؎ ؏
U+061x ؐ ؑ ؒ ؓ ؔ ؕ ؖ ؗ ؘ ؙ ؚ ؛  ALM  ؞ ؟
U+062x ؠ ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د
U+063x ذ ر ز س ش ص ض ط ظ ع غ ػ ؼ ؽ ؾ ؿ
U+064x ـ ف ق ك ل م ن ه و ى ي ً ٌ ٍ َ ُ
U+065x ِ ّ ْ ٓ ٔ ٕ ٖ ٗ ٘ ٙ ٚ ٛ ٜ ٝ ٞ ٟ
U+066x ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩ ٪ ٫ ٬ ٭ ٮ ٯ
U+067x ٰ ٱ ٲ ٳ ٴ ٵ ٶ ٷ ٸ ٹ ٺ ٻ ټ ٽ پ ٿ
U+068x ڀ ځ ڂ ڃ ڄ څ چ ڇ ڈ ډ ڊ ڋ ڌ ڍ ڎ ڏ
U+069x ڐ ڑ ڒ ړ ڔ ڕ ږ ڗ ژ ڙ ښ ڛ ڜ ڝ ڞ ڟ
U+06Ax ڠ ڡ ڢ ڣ ڤ ڥ ڦ ڧ ڨ ک ڪ ګ ڬ ڭ ڮ گ
U+06Bx ڰ ڱ ڲ ڳ ڴ ڵ ڶ ڷ ڸ ڹ ں ڻ ڼ ڽ ھ ڿ
U+06Cx ۀ ہ ۂ ۃ ۄ ۅ ۆ ۇ ۈ ۉ ۊ ۋ ی ۍ ێ ۏ
U+06Dx ې ۑ ے ۓ ۔ ە ۖ ۗ ۘ ۙ ۚ ۛ ۜ  ۝  ۞ ۟
U+06Ex ۠ ۡ ۢ ۣ ۤ ۥ ۦ ۧ ۨ ۩ ۪ ۫ ۬ ۭ ۮ ۯ
U+06Fx ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹ ۺ ۻ ۼ ۽ ۾ ۿ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
3.^ Unicode code point U+0673 is deprecated as of Unicode version 6.0

Actually I'm getting better coverage with the default font here. Phil | Talk 12:30, 24 October 2005 (UTC)

Arabic supplement[edit]

Default font   Unicode font
Arabic Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+075x ݐ ݑ ݒ ݓ ݔ ݕ ݖ ݗ ݘ ݙ ݚ ݛ ݜ ݝ ݞ ݟ
U+076x ݠ ݡ ݢ ݣ ݤ ݥ ݦ ݧ ݨ ݩ ݪ ݫ ݬ ݭ ݮ ݯ
U+077x ݰ ݱ ݲ ݳ ݴ ݵ ݶ ݷ ݸ ݹ ݺ ݻ ݼ ݽ ݾ ݿ
Notes
1.^ As of Unicode version 7.0
 
Arabic Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+075x ݐ ݑ ݒ ݓ ݔ ݕ ݖ ݗ ݘ ݙ ݚ ݛ ݜ ݝ ݞ ݟ
U+076x ݠ ݡ ݢ ݣ ݤ ݥ ݦ ݧ ݨ ݩ ݪ ݫ ݬ ݭ ݮ ݯ
U+077x ݰ ݱ ݲ ݳ ݴ ݵ ݶ ݷ ݸ ݹ ݺ ݻ ݼ ݽ ݾ ݿ
Notes
1.^ As of Unicode version 7.0

Not getting anything here with either option: is there a better alternative for displaying Arabic? —Phil | Talk 12:45, 24 October 2005 (UTC)

Syriac[edit]

Default font   Unicode font
Syriac[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+070x ܀ ܁ ܂ ܃ ܄ ܅ ܆ ܇ ܈ ܉ ܊ ܋ ܌ ܍ ܏
 SAM 
U+071x ܐ ܑ ܒ ܓ ܔ ܕ ܖ ܗ ܘ ܙ ܚ ܛ ܜ ܝ ܞ ܟ
U+072x ܠ ܡ ܢ ܣ ܤ ܥ ܦ ܧ ܨ ܩ ܪ ܫ ܬ ܭ ܮ ܯ
U+073x ܰ ܱ ܲ ܳ ܴ ܵ ܶ ܷ ܸ ܹ ܺ ܻ ܼ ܽ ܾ ܿ
U+074x ݀ ݁ ݂ ݃ ݄ ݅ ݆ ݇ ݈ ݉ ݊ ݍ ݎ ݏ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Syriac[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+070x ܀ ܁ ܂ ܃ ܄ ܅ ܆ ܇ ܈ ܉ ܊ ܋ ܌ ܍ ܏
 SAM 
U+071x ܐ ܑ ܒ ܓ ܔ ܕ ܖ ܗ ܘ ܙ ܚ ܛ ܜ ܝ ܞ ܟ
U+072x ܠ ܡ ܢ ܣ ܤ ܥ ܦ ܧ ܨ ܩ ܪ ܫ ܬ ܭ ܮ ܯ
U+073x ܰ ܱ ܲ ܳ ܴ ܵ ܶ ܷ ܸ ܹ ܺ ܻ ܼ ܽ ܾ ܿ
U+074x ݀ ݁ ݂ ݃ ݄ ݅ ݆ ݇ ݈ ݉ ݊ ݍ ݎ ݏ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Thaana[edit]

Default font   Unicode font
Thaana[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+078x ހ ށ ނ ރ ބ ޅ ކ އ ވ މ ފ ދ ތ ލ ގ ޏ
U+079x ސ ޑ ޒ ޓ ޔ ޕ ޖ ޗ ޘ ޙ ޚ ޛ ޜ ޝ ޞ ޟ
U+07Ax ޠ ޡ ޢ ޣ ޤ ޥ ަ ާ ި ީ ު ޫ ެ ޭ ޮ ޯ
U+07Bx ް ޱ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Thaana[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+078x ހ ށ ނ ރ ބ ޅ ކ އ ވ މ ފ ދ ތ ލ ގ ޏ
U+079x ސ ޑ ޒ ޓ ޔ ޕ ޖ ޗ ޘ ޙ ޚ ޛ ޜ ޝ ޞ ޟ
U+07Ax ޠ ޡ ޢ ޣ ޤ ޥ ަ ާ ި ީ ު ޫ ެ ޭ ޮ ޯ
U+07Bx ް ޱ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Devanagari[edit]

Default font   Unicode font
Devanagari[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+090x
U+091x
U+092x
U+093x ि
U+094x
U+095x
U+096x
U+097x ॿ
Notes
1.^ As of Unicode version 7.0
 
Devanagari[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+090x
U+091x
U+092x
U+093x ि
U+094x
U+095x
U+096x
U+097x ॿ
Notes
1.^ As of Unicode version 7.0

Bengali[edit]

Default font   Unicode font
Bengali[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+098x
U+099x
U+09Ax
U+09Bx ি
U+09Cx
U+09Dx
U+09Ex
U+09Fx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Bengali[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+098x
U+099x
U+09Ax
U+09Bx ি
U+09Cx
U+09Dx
U+09Ex
U+09Fx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Gujarati[edit]

Default font   Unicode font
Gujarati[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0A8x
U+0A9x
U+0AAx
U+0ABx િ
U+0ACx
U+0ADx
U+0AEx
U+0AFx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Gujarati[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0A8x
U+0A9x
U+0AAx
U+0ABx િ
U+0ACx
U+0ADx
U+0AEx
U+0AFx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Tamil[edit]

Default font   Unicode font
Tamil[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0B8x
U+0B9x
U+0BAx
U+0BBx ி
U+0BCx
U+0BDx
U+0BEx
U+0BFx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Tamil[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0B8x
U+0B9x
U+0BAx
U+0BBx ி
U+0BCx
U+0BDx
U+0BEx
U+0BFx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Telugu[edit]

Default font   Unicode font
Telugu[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0C0x
U+0C1x
U+0C2x
U+0C3x ి
U+0C4x
U+0C5x
U+0C6x
U+0C7x ౿
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Telugu[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0C0x
U+0C1x
U+0C2x
U+0C3x ి
U+0C4x
U+0C5x
U+0C6x
U+0C7x ౿
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Kannada[edit]

Default font   Unicode font
Kannada[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0C8x
U+0C9x
U+0CAx
U+0CBx ಿ
U+0CCx
U+0CDx
U+0CEx
U+0CFx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Kannada[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0C8x
U+0C9x
U+0CAx
U+0CBx ಿ
U+0CCx
U+0CDx
U+0CEx
U+0CFx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Malayalam[edit]

Default font   Unicode font
Malayalam[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0D0x
U+0D1x
U+0D2x
U+0D3x ി
U+0D4x     
U+0D5x
U+0D6x
U+0D7x ൿ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Malayalam[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0D0x
U+0D1x
U+0D2x
U+0D3x ി
U+0D4x     
U+0D5x
U+0D6x
U+0D7x ൿ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Thai[edit]

Default font   Unicode font
Thai[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0E0x
U+0E1x
U+0E2x
U+0E3x ฿
U+0E4x
U+0E5x
U+0E6x
U+0E7x
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Thai[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+0E0x
U+0E1x
U+0E2x
U+0E3x ฿
U+0E4x
U+0E5x
U+0E6x
U+0E7x
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Georgian[edit]

Default font   Unicode font
Georgian[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+10Ax
U+10Bx
U+10Cx
U+10Dx
U+10Ex
U+10Fx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Georgian[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+10Ax
U+10Bx
U+10Cx
U+10Dx
U+10Ex
U+10Fx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

General punctuation[edit]

Default font   Unicode font
General Punctuation[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+200x NQ
 SP 
MQ
 SP 
EN
 SP 
EM
 SP 
 3/M 
SP
 4/M 
SP
 6/M 
SP
F
 SP 
P
 SP 
TH
 SP 
H
 SP 
ZW
 SP 
ZW
 NJ 
 ZW 
J
 LRM   RLM 
U+201x  NB 
U+202x L
 SEP 
P
 SEP 
 LRE   RLE   PDF   LRO   RLO   NNB 
SP
U+203x
U+204x
U+205x MM
  SP  
U+206x  WJ   ƒ()    ×     ,     +    LRI   RLI   FSI   PDI  I
 SS 
A
 SS 
I
 AFS 
A
 AFS 
NA
 DS 
NO
 DS 
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
3.^ Unicode code points U+206A - U+206F are deprecated as of Unicode version 3.0
 
General Punctuation[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+200x NQ
 SP 
MQ
 SP 
EN
 SP 
EM
 SP 
 3/M 
SP
 4/M 
SP
 6/M 
SP
F
 SP 
P
 SP 
TH
 SP 
H
 SP 
ZW
 SP 
ZW
 NJ 
 ZW 
J
 LRM   RLM 
U+201x  NB 
U+202x L
 SEP 
P
 SEP 
 LRE   RLE   PDF   LRO   RLO   NNB 
SP
U+203x
U+204x
U+205x MM
  SP  
U+206x  WJ   ƒ()    ×     ,     +    LRI   RLI   FSI   PDI  I
 SS 
A
 SS 
I
 AFS 
A
 AFS 
NA
 DS 
NO
 DS 
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
3.^ Unicode code points U+206A - U+206F are deprecated as of Unicode version 3.0

Pages that already use such letters:

Candidate pages for use:

  • RIGHT SINGLE QUOTATION MARK (’): Mother's day, St. John's Cathedral, and thousands more that use a possessive. Also T'Pol and many more
  • LEFT SINGLE QUOTATION MARK (‘): 'Okina, Hawaii, Ayin and others, but all of those are more correctly spelt with MODIFIER LETTER TURNED COMMA (U+02BB in the "Spacing modifier letters" section)
  • EN DASH (–): Hasse-Minkowski theorem and hundreds more
  • LEFT and RIGHT DOUBLE QUOTATION MARK (“ and ”): Knights who say Ni and others that use a quotation in the title

Superscripts and subscripts[edit]

Default font   Unicode font
Superscripts and Subscripts[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+207x
U+208x
U+209x
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
3.^ Refer to the Latin-1 Supplement Unicode block for characters ¹ (U+00B9), ² (U+00B2) and ³ (U+00B3).
 
Superscripts and Subscripts[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+207x
U+208x
U+209x
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
3.^ Refer to the Latin-1 Supplement Unicode block for characters ¹ (U+00B9), ² (U+00B2) and ³ (U+00B3).

Probably not relevant for titles and in body text its probably safer to use html subscript/superscript.

Besides requiring some heavy-duty fonts to be rendered correctly, the lesser-known of these characters (all but ² and ³, really) tend to appear too small to be legible, at least compared with <sub>. The policy in place for minor planets (whose designations often include subscripts) is 1) to use <sub> systematically in the article and in the wrongtitle template, 2) use non-subscripts in the title, and 3) put in place a redirect from the subscripted title. Urhixidur 14:51, 2005 August 2 (UTC)

Currency symbols[edit]

Default font   Unicode font
Currency Symbols[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+20Ax
U+20Bx
U+20Cx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
 
Currency Symbols[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+20Ax
U+20Bx
U+20Cx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

The Euro (U+20AC) is heavily used within articles and should be OK to use in an article title (though it's hard to think of a plausible title that would use it... maybe if some book or movie title incorporates the symbol). -- Curps 21:08, 24 August 2005 (UTC)

How about €2 commemorative coins? ;)Nightstallion (?) 12:48, 28 July 2006 (UTC)

Letterlike symbols[edit]

Default font   Unicode font
Letterlike Symbols[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+210x
U+211x
U+212x
U+213x
U+214x
Notes
1.^ As of Unicode version 7.0
 
Letterlike Symbols[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+210x
U+211x
U+212x
U+213x
U+214x
Notes
1.^ As of Unicode version 7.0

Miscellaneous symbols[edit]

Default font   Unicode font
Miscellaneous Symbols[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+260x
U+261x
U+262x
U+263x
U+264x
U+265x
U+266x
U+267x
U+268x
U+269x
U+26Ax
U+26Bx
U+26Cx
U+26Dx
U+26Ex
U+26Fx
Notes
1.^ As of Unicode version 7.0
 
Miscellaneous Symbols[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+260x
U+261x
U+262x
U+263x
U+264x
U+265x
U+266x
U+267x
U+268x
U+269x
U+26Ax
U+26Bx
U+26Cx
U+26Dx
U+26Ex
U+26Fx
Notes
1.^ As of Unicode version 7.0
  • MUSICAL SHARP SIGN (Unicode 266F), ♯. Needed for Sharp-P and Sharp-P-complete. Some textbooks use NUMBER SIGN (#) instead, and refer to the complexity class as Number-P. I prefer that symbol myself, but that’s not the point here. Needed for C sharp. Arbor 7 July 2005 07:18 (UTC)

Alphabetical presentation forms[edit]

Default font   Unicode font
Alphabetic Presentation Forms[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FB0x
U+FB1x ﬞ 
U+FB2x שּׂ</