Combining Diacritical Marks

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Combining Diacritical Marks
Range U+0300..U+036F
(112 code points)
Plane BMP
Scripts Inherited
Major alphabets IPA, UPA
Symbol sets accents
diacritics
Assigned 112 code points
Unused 0 reserved code points
Unicode version history
1.0.0 66 (+66)
1.0.1 68 (+2)
1.1 72 (+4)
3.0 82 (+10)
3.2 96 (+14)
4.0 107 (+11)
4.1 112 (+5)
Note: Two characters were moved from the Greek and Coptic block to the Combining Diacritical Marks block in version 1.0.1 during the process of unifying with ISO 10646.[1][2][3]

Combining Diacritical Marks is a Unicode block containing the most common combining characters. It also contains the character "Combining Grapheme Joiner", which prevents canonical reordering of combining characters, and despite the name, actually separates characters that would otherwise be considered a single grapheme in a given context.

Block[edit]

Combining Diacritical Marks[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+030x  ̀  ́  ̂  ̃  ̄  ̅  ̆  ̇  ̈  ̉  ̊  ̋  ̌  ̍  ̎  ̏
U+031x  ̐  ̑  ̒  ̓  ̔  ̕  ̖  ̗  ̘  ̙  ̚  ̛  ̜  ̝  ̞  ̟
U+032x  ̠  ̡  ̢  ̣  ̤  ̥  ̦  ̧  ̨  ̩  ̪  ̫  ̬  ̭  ̮  ̯
U+033x  ̰  ̱  ̲  ̳  ̴  ̵  ̶  ̷  ̸  ̹  ̺  ̻  ̼  ̽  ̾  ̿
U+034x  ̀  ́  ͂  ̓  ̈́  ͅ  ͆  ͇  ͈  ͉  ͊  ͋  ͌  ͍  ͎  CGJ 
U+035x  ͐  ͑  ͒  ͓  ͔  ͕  ͖  ͗  ͘  ͙  ͚  ͛  ͜  ͝  ͞  ͟
U+036x  ͠  ͡  ͢  ͣ  ͤ  ͥ  ͦ  ͧ  ͨ  ͩ  ͪ  ͫ  ͬ  ͭ  ͮ  ͯ
Notes
1.^ As of Unicode version 10.0

History[edit]

The following Unicode-related documents record the purpose and process of defining specific characters in the Combining Diacritical Marks block:

Version Final code points[a] Count L2 ID WG2 ID Document
1.0.0 U+0300..0341 66 (to be determined)
L2/16-178 Evans, Lorna; Keating, Patricia (2016-07-06), Representative glyph and annotation additions for U+033B 
L2/16-216 Anderson, Deborah; Whistler, Ken; McGowan, Rick; Pournader, Roozbeh; Glass, Andrew; Iancu, Laurențiu; Moore, Lisa (2016-07-30), Recommendations to UTC #148 August 2016 on Script Proposals 
1.0.1 U+0344..0345 2 (to be determined)
L2/07-068 Muller, Eric (2007-02-05), Representation of Greek mute iota 
L2/07-198 Hudson, John (2007-06-01), Representation of Greek mute iota (response to L2/07-068) 
L2/07-263 Anderson, Deborah (2007-08-07), Input on Greek Mute Iota 
1.1 U+0342..0343, 0360..0361 4 (to be determined)
3.0 U+0346..034E, 0362 10 L2/98-299 N1845 Everson, Michael (1998-09-08), Additional IPA “disturbed speech” characters for the UCS 
3.2 U+034F 1 L2/00-274 N2236 Davis, Mark (2000-08-09), Proposal for addition of COMBINING GRAPHEME JOINER 
L2/01-042 N2317 Everson, Michael (2001-01-19), On the use of JOINERS in ligation 
L2/05-094 Hallissy, Bob; Priest, Lorna (2005-04-18), Use of CGJ in Latin script diacritics 
L2/05-108R Moore, Lisa (2005-08-26), "C.7", UTC #103 Minutes 
U+0363..036F 13 L2/00-061 N2160 Küster, Mark (2000-02-02), Proposal concerning combining diacritics for medievalistics [for Middle High German] 
L2/00-330 N2266 Küster, Marc; Wojtovicz, Isabel (2000-09-14), Diacritics for medieval studies 
L2/01-050 N2253 Umamaheswaran, V. S. (2001-01-21), "7.17", Minutes of the SC2/WG2 meeting in Athens, September 2000 
4.0 U+0350..0357 8 L2/02-141 N2419 Everson, Michael; et al. (2002-03-20), Uralic Phonetic Alphabet characters for the UCS 
L2/02-192 Everson, Michael (2002-05-02), Everson's Reply on UPA 
N2442 Everson, Michael; Kolehmainen, Erkki I.; Ruppel, Klaas; Trosterud, Trond (2002-05-21), Justification for placing the Uralic Phonetic Alphabet in the BMP 
L2/02-291 Whistler, Ken (2002-05-31), WG2 report from Dublin 
L2/02-292 Whistler, Ken (2002-06-03), Early look at WG2 consent docket 
L2/02-166R2 Moore, Lisa (2002-08-09), "Scripts and New Characters - UPA", UTC #91 Minutes 
L2/02-253 Moore, Lisa (2002-10-21), UTC #92 Minutes 
U+035D..035F 3 L2/02-176 Whistler, Ken (2002-03-07), How to make "oo" with combining breve/macron over pair? 
L2/02-204 N2457 Additional Double Diacritics, 2002-05-10 
L2/02-367 Constable, Peter (2002-10-30), Double Combining Breve Below, Double Combining Macron 
L2/03-142R N2594 McGowan, Rick; Constable, Peter; Whistler, Ken (2003-07-01), Proposal to encode combining double breve below 
4.1 U+0358 1 L2/97-148 N1593 Everson, Michael (1997-05-27), Proposal to add Latin characters by Latinized Taiwanese languages 
L2/98-089 N1712R Comments on proposal to add Latin characters to Latinized Taiwanese languages, 1998-03-18 
L2/02-348 N2507 Draft of Proposal to add Latin characters required by Latinized Taiwanese Holo language to ISO/IEC 10646, 2002-03-31 
L2/03-317 N2626 Proposal on IPA Extensions & Combining Diacritical Marks for ISO/IEC 10646 in BMP, 2003-09-27 
L2/03-318 N2628 Proposal to add Combining Right Dot Above as required by Taiwanese Holo language Romanization to ISO/IEC 10646, 2003-09-29 
L2/03-339 N2646 Constable, Peter (2003-10-08), Comments on N2626, Proposal on IPA Extensions and Combining Diacritic Marks for ISO/IEC 10646 in BMP 
L2/03-369 N2669 Comments on “Combining dot above”, 2003-10-20 
L2/04-076 N2699 Tai, Kaihsu; Chen, Pochung; Tan-Tenn, Henry (2003-10-20), Response to WG2-N2669 
L2/03-372 N2673 Anderson, Deborah (2003-10-22), Comments on N2626, “Proposal on IPA Extensions & Combining Diacritic Marks for ISO 10646 in BMP” 
L2/04-107 N2713 Revised Proposal for encoding A Supplemented Set of IPA Combining Marks, Modifier Letters & Five-Degree Contour Tone Marks, 2004-03-20 
L2/04-200 N2770R Tai, Kaihsu; Chen, Pochung; Tan-Tenn, Henry (2004-05-29), Response to WG2-N2713 - Combining right dot above 
U+0359 1 L2/03-157 Pantelia, Maria (2003-05-19), Additional Beta Code Characters not in Unicode (WIP) 
L2/03-186R N2612-4 Pantelia, Maria (2003-06-11), Proposal to encode one additional combining diacritic in the UCS 
U+035A 1 L2/02-364 N2630 Glass, Andrew; Baums, Stefan; Salomon, Richard (2003-09-29), Proposal to add one combining diacritic to the UCS 
U+035B 1 L2/03-219 N2597 Proposal to encode one combining character in the UCS, 2003-06-25 
U+035C 1 L2/02-031 Anderson, Deborah (2002-01-21), TLG Miscellanea Proposal 
L2/02-033 Anderson, Deborah (2002-01-21), TLG Unicode Proposal (draft) 
L2/02-053 Anderson, Deborah (2002-02-04), Description of TLG Documents 
L2/02-273 Pantelia, Maria (2002-07-31), TLG Unicode Proposal 
L2/02-287 Pantelia, Maria (2002-08-09), Proposal Summary Form accompanying TLG Unicode Proposal (L2/02-273) 
L2/02-312R Pantelia, Maria (2002-11-07), Proposal to encode additional Greek editorial and punctuation characters in the UCS 
L2/03-324 N2642 Pantelia, Maria (2003-10-06), Proposal to encode additional Greek editorial and punctuation characters in the UCS 
  1. ^ Proposed code points and characters names may differ from final code points and names

See also[edit]

References[edit]

  1. ^ "Unicode 1.0.1 Addendum" (PDF). The Unicode Standard. 1992-11-03. Retrieved 2016-07-09. 
  2. ^ "Unicode character database". The Unicode Standard. Retrieved 2016-07-09. 
  3. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.