Template:Unicode version

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Version Date Book Corresponding ISO/IEC 10646 edition Scripts Characters
Total[tablenote 1] Notable additions
1.0.0[1] October 1991 ISBN 0-201-56788-1 (Vol. 1) 24 7,129[tablenote 2] Initial repertoire covers these scripts: Arabic, Armenian, Bengali, Bopomofo, Cyrillic, Devanagari, Georgian, Greek and Coptic, Gujarati, Gurmukhi, Hangul, Hebrew, Hiragana, Kannada, Katakana, Lao, Latin, Malayalam, Oriya, Tamil, Telugu, Thai, and Tibetan.[1]
1.0.1[2] June 1992 ISBN 0-201-60845-6 (Vol. 2) 25 28,327
(21,204 added;
6 removed)
The initial set of 20,902 CJK Unified Ideographs is defined.[2]
1.1[3] June 1993 ISO/IEC 10646-1:1993 24 34,168
(5,963 added;
89 removed;
33 reclassified
as control
characters)
4,306 more Hangul syllables added to original set of 2,350 characters. Tibetan removed.[3]
2.0[4] July 1996 ISBN 0-201-48345-9 ISO/IEC 10646-1:1993 plus Amendments 5, 6 and 7 25 38,885
(11,373 added;
6,656 removed)
Original set of Hangul syllables removed, and a new set of 11,172 Hangul syllables added at a new location. Tibetan added back in a new location and with a different character repertoire. Surrogate character mechanism defined, and Plane 15 and Plane 16 Private Use Areas allocated.[4]
2.1[5] May 1998 ISO/IEC 10646-1:1993 plus Amendments 5, 6 and 7, as well as two characters from Amendment 18 25 38,887
(2 added)
Euro sign and Object Replacement Character added.[5]
3.0 September 1999 ISBN 0-201-61633-5 ISO/IEC 10646-1:2000 38 49,194
(10,307 added)
Cherokee, Ethiopic, Khmer, Mongolian, Burmese, Ogham, Runic, Sinhala, Syriac, Thaana, Unified Canadian Aboriginal Syllabics, and Yi Syllables added, as well as a set of Braille patterns.[6]
3.1 March 2001 ISO/IEC 10646-1:2000

ISO/IEC 10646-2:2001

41 94,140
(44,946 added)
Deseret, Gothic and Old Italic added, as well as sets of symbols for Western music and Byzantine music, and 42,711 additional CJK Unified Ideographs.[7]
3.2 March 2002 ISO/IEC 10646-1:2000 plus Amendment 1

ISO/IEC 10646-2:2001

45 95,156
(1,016 added)
Philippine scripts Buhid, Hanunó'o, Tagalog, and Tagbanwa added.[8]
4.0 April 2003 ISBN 0-321-18578-1 ISO/IEC 10646:2003 52 96,382
(1,226 added)
Cypriot syllabary, Limbu, Linear B, Osmanya, Shavian, Tai Le, and Ugaritic added, as well as Hexagram symbols.[9]
4.1 March 2005 ISO/IEC 10646:2003 plus Amendment 1 59 97,655
(1,273 added)
Buginese, Glagolitic, Kharoshthi, New Tai Lue, Old Persian, Syloti Nagri, and Tifinagh added, and Coptic was disunified from Greek. Ancient Greek numbers and musical symbols were also added.[10]
5.0 July 2006 ISBN 0-321-48091-0 ISO/IEC 10646:2003 plus Amendments 1 and 2, as well as four characters from Amendment 3 64 99,024
(1,369 added)
Balinese, Cuneiform, N'Ko, Phags-pa, and Phoenician added.[11]
5.1 April 2008 ISO/IEC 10646:2003 plus Amendments 1, 2, 3 and 4 75 100,648
(1,624 added)
Carian, Cham, Kayah Li, Lepcha, Lycian, Lydian, Ol Chiki, Rejang, Saurashtra, Sundanese, and Vai added, as well as sets of symbols for the Phaistos Disc, Mahjong tiles, and Domino tiles. There were also important additions for Burmese, additions of letters and Scribal abbreviations used in medieval manuscripts, and the addition of Capital ẞ.[12]
5.2 October 2009 ISBN 978-1-936213-00-9 ISO/IEC 10646:2003 plus Amendments 1, 2, 3, 4, 5 and 6 90 107,296
(6,648 added)
Avestan, Bamum, Egyptian hieroglyphs (the Gardiner Set, comprising 1,071 characters), Imperial Aramaic, Inscriptional Pahlavi, Inscriptional Parthian, Javanese, Kaithi, Lisu, Meetei Mayek, Old South Arabian, Old Turkic, Samaritan, Tai Tham and Tai Viet added. 4,149 additional CJK Unified Ideographs (CJK-C), as well as extended Jamo for Old Hangul, and characters for Vedic Sanskrit.[13]
6.0 October 2010 ISBN 978-1-936213-01-6 ISO/IEC 10646:2010 plus the Indian rupee sign 93 109,384
(2,088 added)
Batak, Brahmi, Mandaic, playing card symbols, transport and map symbols, alchemical symbols, emoticons and emoji. 222 additional CJK Unified Ideographs (CJK-D) added.[14]
6.1 January 2012 ISBN 978-1-936213-02-3 ISO/IEC 10646:2012 100 110,116
(732 added)
Chakma, Meroitic cursive, Meroitic hieroglyphs, Miao, Sharada, Sora Sompeng, and Takri.[15]
6.2 September 2012 ISBN 978-1-936213-07-8 ISO/IEC 10646:2012 plus the Turkish lira sign 100 110,117
(1 added)
Turkish lira sign.[16]
6.3 September 2013 ISBN 978-1-936213-08-5 ISO/IEC 10646:2012 plus six characters 100 110,122
(5 added)
5 bidirectional formatting characters.[17]
7.0 June 2014 ISBN 978-1-936213-09-2 ISO/IEC 10646:2012 plus Amendments 1 and 2, as well as the Ruble sign 123 112,956
(2,834 added)
Bassa Vah, Caucasian Albanian, Duployan, Elbasan, Grantha, Khojki, Khudawadi, Linear A, Mahajani, Manichaean, Mende Kikakui, Modi, Mro, Nabataean, Old North Arabian, Old Permic, Pahawh Hmong, Palmyrene, Pau Cin Hau, Psalter Pahlavi, Siddham, Tirhuta, Warang Citi, and Dingbats.[18]
8.0 June 2015 ISBN 978-1-936213-10-8 ISO/IEC 10646:2014 plus Amendment 1, as well as the Lari sign, nine CJK unified ideographs, and 41 emoji characters[19] 129 120,672
(7,716 added)
Ahom, Anatolian hieroglyphs, Hatran, Multani, Old Hungarian, SignWriting, 5,771 CJK unified ideographs, a set of lowercase letters for Cherokee, and five emoji skin tone modifiers.[20]
9.0 June 2016 ISBN 978-1-936213-13-9 ISO/IEC 10646:2014 plus Amendments 1 and 2, as well as Adlam, Newa, Japanese TV symbols, and 74 emoji and symbols[21] 135 128,172
(7,500 added)
Adlam, Bhaiksuki, Marchen, Newa, Osage, Tangut, and 72 emoji.[22][23]
10.0 June 2017 ISBN 978-1-936213-16-0 ISO/IEC 10646:2017 plus 56 emoji characters, 285 hentaigana characters, and 3 Zanabazar Square characters[24] 139 136,690
(8,518 added)
Zanabazar Square, Soyombo, Masaram Gondi, Nüshu, hentaigana (non-standard hiragana), 7,494 CJK unified ideographs, 56 emoji, and bitcoin symbol.[25]
11.0 June 2018 ISBN 978-1-936213-19-1 ISO/IEC 10646:2017 plus Amendment 1, as well as 46 Mtavruli Georgian capital letters, 5 CJK unified ideographs, and 66 emoji characters.[26] 146 137,374
(684 added)
Dogra, Georgian Mtavruli capital letters, Gunjala Gondi, Hanifi Rohingya, Indic Siyaq Numbers, Makasar, Medefaidrin, Old Sogdian and Sogdian, Mayan numerals, 5 urgently needed CJK unified ideographs, symbols for xiangqi (Chinese chess) and star ratings, and 145 emoji.[27]
12.0 March 2019 ISBN 978-1-936213-22-1 ISO/IEC 10646:2017 plus Amendments 1 and 2, as well as 62 additional characters.[28] 150 137,928
(554 added)
Elymaic, Nandinagari, Nyiakeng Puachue Hmong, Wancho, Miao script additions for several Miao and Yi languages of China, hiragana and katakana small letters for writing archaic Japanese, Tamil historic fractions and symbols, Lao letters for Pali, Latin letters for Egyptological and Ugaritic transliteration, hieroglyph format controls, and 61 emoji.[29]
12.1 May 2019 ISBN 978-1-936213-25-2 150 137,929
(1 added)
Adds a single character at U+32FF for the square ligature form of the name of the Reiwa era.[30]
13.0[31] March 2020 ISBN 978-1-936213-26-9 ISO/IEC 10646:2020[32] 154 143,859
(5,930 added)
Chorasmian, Dives Akuru, Khitan small script, Yezidi, 4,969 CJK unified ideographs added (including 4,939 in Ext. G), Arabic script additions used to write Hausa, Wolof, and other languages in Africa and other additions used to write Hindko and Punjabi in Pakistan, Bopomofo additions used for Cantonese, Creative Commons license symbols, graphic characters for compatibility with teletext and home computer systems from the 1970s and 1980s, and 55 emoji.[31]
14.0[33] September 2021 ISBN 978-1-936213-29-0 159 144,697
(838 added)
Toto, Cypro-Minoan, Vithkuqi, Old Uyghur, Tangsa, Latin script additions at SMP blocks (Ext-F, Ext-G) for use in extended IPA, Arabic script additions for use in languages across Africa and in Iran, Pakistan, Malaysia, Indonesia, Java, and Bosnia, and to write honorifics, additions for Quranic use, other additions to support languages in North America, the Philippines, India, and Mongolia, addition of the Kyrgyzstani som currency symbol, support for Znamenny musical notation, and 37 emoji.[33]
  1. ^ The number of characters listed for each version of Unicode is the total number of graphic and format characters (i.e., excluding private-use characters, control characters, noncharacters and surrogate code points).
  2. ^ Not counting 'space' or 33 non-printing characters (7,163 total)[1]