Arabic (Unicode block)

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Arabic
Range U+0600..U+06FF
(256 code points)
Plane BMP
Scripts Arabic (237 char.)
Common (6 char.)
Inherited (12 char.)
Major alphabets Arabic
Pashto
Persian
Urdu
Assigned 255 code points
Unused 1 reserved code points
1 deprecated
Source standards ISO 8859-6
Unicode version history
1.0.0 169 (+169)
1.1 194 (+25)
3.0 206 (+12)
3.2 208 (+2)
4.0 227 (+19)
4.1 235 (+8)
5.1 250 (+15)
6.0 252 (+2)
6.1 253 (+1)
6.3 254 (+1)
7.0 255 (+1)

Note: [1][2]

Arabic is a Unicode block, containing the standard letters and the most common diacritics of the Arabic script, and the Arabic-Indic digits.[3]

Block[edit]

Arabic[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+060x  ؀   ؁   ؂   ؃   ؄   ؅  ؆ ؇ ؈ ؉ ؊ ؋ ، ؍ ؎ ؏
U+061x ؐ ؑ ؒ ؓ ؔ ؕ ؖ ؗ ؘ ؙ ؚ ؛  ALM  ؞ ؟
U+062x ؠ ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د
U+063x ذ ر ز س ش ص ض ط ظ ع غ ػ ؼ ؽ ؾ ؿ
U+064x ـ ف ق ك ل م ن ه و ى ي ً ٌ ٍ َ ُ
U+065x ِ ّ ْ ٓ ٔ ٕ ٖ ٗ ٘ ٙ ٚ ٛ ٜ ٝ ٞ ٟ
U+066x ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩ ٪ ٫ ٬ ٭ ٮ ٯ
U+067x ٰ ٱ ٲ ٳ ٴ ٵ ٶ ٷ ٸ ٹ ٺ ٻ ټ ٽ پ ٿ
U+068x ڀ ځ ڂ ڃ ڄ څ چ ڇ ڈ ډ ڊ ڋ ڌ ڍ ڎ ڏ
U+069x ڐ ڑ ڒ ړ ڔ ڕ ږ ڗ ژ ڙ ښ ڛ ڜ ڝ ڞ ڟ
U+06Ax ڠ ڡ ڢ ڣ ڤ ڥ ڦ ڧ ڨ ک ڪ ګ ڬ ڭ ڮ گ
U+06Bx ڰ ڱ ڲ ڳ ڴ ڵ ڶ ڷ ڸ ڹ ں ڻ ڼ ڽ ھ ڿ
U+06Cx ۀ ہ ۂ ۃ ۄ ۅ ۆ ۇ ۈ ۉ ۊ ۋ ی ۍ ێ ۏ
U+06Dx ې ۑ ے ۓ ۔ ە ۖ ۗ ۘ ۙ ۚ ۛ ۜ  ۝  ۞ ۟
U+06Ex ۠ ۡ ۢ ۣ ۤ ۥ ۦ ۧ ۨ ۩ ۪ ۫ ۬ ۭ ۮ ۯ
U+06Fx ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹ ۺ ۻ ۼ ۽ ۾ ۿ
Notes
1.^ As of Unicode version 10.0
2.^ Grey area indicates non-assigned code point
3.^ Unicode code point U+0673 is deprecated as of Unicode version 6.0

History[edit]

The following Unicode-related documents record the purpose and process of defining specific characters in the Arabic block:

Version Final code points[a] Count L2 ID WG2 ID Document
1.0.0 U+060C, 061B, 061F, 0621..063A, 0640..0652, 0660..066C, 0670..06B7, 06BA..06BE, 06C0..06CE, 06D0..06D5, 06F0..06F9 169 (to be determined)
L2/01-270 Hosken, Martin (2001-06-19), How U+06D5 works in Uighur, Some technical information collected 
L2/04-290 Karlsson, Kent (2004-07-16), Updating the Arabic Shaping normative data 
L2/04-419 Davis, Mark (2004-11-18), ArabicShaping suggestion e-mail 
L2/09-146 Pournader, Roozbeh (2009-04-15), Moving dots and Arabic script shaping: Farsi Yeh's and Jawi Nya 
L2/10-045 Allawi, Adil (2010-01-27), Proposal for changes to ArabicShaping.txt to allow machine generation of Arabic fonts and glyphs 
L2/10-168 Mansour, Kamal (2010-05-04), Problems with the joining behavior of Arabic Letter Yeh Barree (U+06D2) 
L2/10-108 Moore, Lisa (2010-05-19), "B.13.2", UTC #123 / L2 #220 Minutes 
L2/11-092 Pournader, Roozbeh (2011-03-08), Changes to schematic names of Arabic letters 
L2/11-206 N4066 Proposing to Supplement with the Script and Character of Chaghatay Language, 2011-04-25 
N4067 Proposal to Encode Special Scripts and Characters in UCS for Uighur language, 2011-05-15 
L2/11-245 N4113 Aalto, Tero (2011-06-08), Ad hoc report on Uighur 
L2/12-063 N4218 Proposal to add a Named UCS Sequence Identifier UYGHUR LETTERS, 2012-02-02 
L2/12-101 N4231 Pournader, Roozbeh; Anderson, Deborah (2012-02-09), Comments on N4218 Proposal to add a Named UCS Sequence Identifier UYGHUR LETTERS 
L2/12-098 N4254 "RESOLUTION M59.05 (Named USIs for characters for Uyghur and Chaghatay)", WG2 Resolutions, 2012-02-17 
L2/12-381 Pournader, Roozbeh (2012-11-03), Initial and medial forms of Arabic Letter Noon Ghunna 
L2/12-343R2 Moore, Lisa (2012-12-04), "B.1.1.5", UTC #133 Minutes 
L2/13-119 Pournader, Roozbeh (2013-05-08), Dot positioning of U+06A3 Arabic Letter Feh with Dot Below 
N4463 Silamu, Wushour; Anderson, Deborah; Constable, Peter (2013-06-28), User Guidelines for Uyghur, Kazakh, Kyrgyz, and Chagatai 
L2/13-226 Milo, Thomas (2013-11-26), Arabic Amphibious Characters 
L2/14-109 Milo, Thomas (2014-05-01), Koranic and Classic orthography in Unicode and computer typography 
L2/14-136 Pournader, Roozbeh (2014-05-08), The right hehs for Arabic script orthographies of Sorani Kurdish and Uighur 
1.1 U+066D, 06D6..06ED 25 (to be determined)
L2/01-428 Kew, Jonathan (2001-11-01), Request for clarification regarding U+06DD ARABIC END OF AYAH and other Arabic enclosing marks 
L2/03-112 Pournader, Roozbeh (2003-03-05), New Arabic controls and Arabic joining 
L2/05-150 Freytag, Asmus (2005-05-05), Arabic errata 
L2/05-151 Milo, Thomas (2005-05-12), Annotations to the printing of the 1924 Azhar Qur'an 
L2/05-203 McGowan, Rick (2005-08-04), Public Review Issue #73: Representative Glyphs for Arabic Characters U+06DF, U+06E0, and U+06E1 
L2/05-231 Mansour, Kamal (2005-08-11), Regarding the proposed changes for the representative glyphs for 06DF, 06E0, and 06E1 
L2/06-324R2 Moore, Lisa (2006-11-29), "B.14.2", UTC #109 Minutes 
L2/09-358R Pournader, Roozbeh (2009-10-28), Discussion document for polishing Koranic support in Unicode 
L2/10-209 Pournader, Roozbeh (2010-06-07), Public Review Issue #171: Changing the properties of U+06DE from a combining mark to a spacing symbol 
3.0 U+0653..0655, 06B8..06B9, 06BF, 06CF, 06FA..06FE 12 (to be determined)
3.2 U+066E..066F 2 L2/00-354 Davis, Mark; Mansour, Kamal (2000-10-12), Proposal For Addition To Arabic repertoire 
L2/01-150 N2357 Proposal to encode two Arabic characters to the UCS, 2001-04-04 
4.0 U+0600..0602, 060D..060E, 0610..0614, 0656..0658 13 L2/00-135 Nelson, Paul; Farhan, Ashhar; Hisam, Arif; Hisam, Kashif; Clews, John (2000-04-07), Proposal to Add Urdu Epethit and Abbreviation Diacritics to the Arabic Block 
L2/01-303 Vikas, Om (2001-07-26), Letter from the Government from India on "Draft for Unicode Standard for Indian Scripts" 
L2/01-304 Feedback on Unicode Standard 3.0, 2001-08-02 
L2/01-305 McGowan, Rick (2001-08-08), Draft UTC Response to L2/01-304, "Feedback on Unicode Standard 3.0" 
L2/01-425 N2483 Kew, Jonathan (2001-11-01), Proposal to add Arabic-script honorifics and other marks 
L2/01-426 Kew, Jonathan (2001-11-01), Proposal to add Arabic-script honorifics and other marks, Appendix: Examples of usage 
L2/01-428 Kew, Jonathan (2001-11-01), Request for clarification regarding U+06DD ARABIC END OF AYAH and other Arabic enclosing marks 
L2/01-439 Milo, Tom (2001-11-02), Arabic Year-sign examples 
L2/01-430R McGowan, Rick (2001-11-20), UTC Response to L2/01-304, “Feedback on Unicode Standard 3.0” 
L2/02-061 N2482 Kew, Jonathan (2002-01-29), Bidi committee consensus on Arabic additions from L2/01-425 
L2/02-227 N2487 Proposal to add 16 Arabic characters, 2002-05-21 
L2/03-102 Vikas, Om (2003-03-04), Unicode Standard for Indic Scripts 
L2/03-101.10 Proposed Changes in Indic Scripts [Urdu, Sindhi, and Kashmiri document], 2003-03-04 
L2/03-112 Pournader, Roozbeh (2003-03-05), New Arabic controls and Arabic joining 
L2/04-196 N2653 Umamaheswaran, V. S. (2004-06-04), "a-3", Unconfirmed minutes of WG 2 meeting 44 
L2/06-332 Esfahbod, Behdad; Pournader, Roozbeh (2006-10-15), Proposal to change the Bidi category of five Arabic characters from AL to AN 
L2/06-372 Lata, Swaran (2006-11-04), Issues Pertinent to Kashmiri 
L2/06-324R2 Moore, Lisa (2006-11-29), "B.14.2", UTC #109 Minutes 
L2/15-183R Pournader, Roozbeh (2015-07-28), Candidate characters for Grapheme_Cluster_Break=Prepend 
U+0603, 060F, 0615 3 N2413 Proposal for Incorporation of Urdu in ISO/IEC 10646 and Unicode, 2002-01-23 
L2/02-005 Hussain, Sarmad; Afzal, Muhammad (2001-12-18), Urdu Computing Standards (Charts and Exhibits) 
L2/02-006, L2/02-006 N2413-1 Zia, Khaver (2002-01-10), Towards Unicode Standard for Urdu 
L2/02-003 N2413-2 Afzal, Muhammad; Hussain, Sarmad (2001-12-28), Urdu Computing Standards: Development of Urdu Zabta Takhti (UZT) 1.01 
L2/02-004 N2413-3 Hussain, Sarmad; Afzal, Muhammad (2001-12-28), Urdu Computing Standards: Urdu Zabta Takhti (UZT) 1.01 
L2/02-163 N2413-4 Proposal to add Marks and Digits in Arabic Code Block (for Urdu), 2002-04-30 
L2/02-011R Kew, Jonathan (2002-01-12), Comments on L2/02-006: Towards Unicode Standard for Urdu 
L2/02-197 Freytag, Asmus (2002-05-01), Urdu Feedback from Bidi Committee 
L2/02-166R2 Moore, Lisa (2002-08-09), UTC #91 Minutes 
L2/02-372 N2453 Umamaheswaran, V. S. (2002-10-30), "7.9", Unconfirmed minutes of WG 2 meeting 42 
L2/03-034 Nelson, Paul; Ross, Fiona; Holloway, Tim; Hudson, John (2003-02-10), Proposal to change character properties of ARABIC SIGN SAFHA (U+0603) 
L2/04-196 N2653 Umamaheswaran, V. S. (2004-06-04), "a-3", Unconfirmed minutes of WG 2 meeting 44 
U+06EE..06EF, 06FF 3 L2/01-427 N2481 Kew, Jonathan (2001-11-01), Proposal to add Parkari letters to Arabic block 
L2/02-227 N2487 Proposal to add 16 Arabic characters, 2002-05-21 
4.1 U+060B 1 N2523 Everson, Michael (2002-11-20), Proposal to encode the AFGHANI SIGN in the UCS 
L2/03-330 N2640 Everson, Michael (2003-10-01), Revised proposal to encode the AFGHANI SIGN in the UCS 
U+061E, 065A..065C 4 L2/98-274 Davis, Mark; Mansour, Kamal (1998-07-28), Proposed Arabic Script Additions for Minority Languages 
L2/98-409 Davis, Mark; Mansour, Kamal (1998-12-01), Proposal to add 25 Arabic characters to the BMP 
L2/02-021 Davis, Mark; Mansour, Kamal (2002-01-17), Proposal To Amend Arabic repertoire 
L2/03-154 Kew, Jonathan; Mansour, Kamal; Davis, Mark (2003-05-16), Proposal to encode productive Arabic-script modifier marks 
L2/03-168 Kew, Jonathan (2003-06-02), Proposal to encode Arabic-script letters for African languages 
L2/03-210 Kew, Jonathan (2003-06-12), Draft chart showing UTC #95 additions to Arabic blocks 
L2/03-223 N2598 Kew, Jonathan (2003-07-10), Proposal to encode additional Arabic-script characters 
U+0659 1 L2/03-133R N2581R2 Everson, Michael; Pournader, Roozbeh (2003-05-29), Proposal to encode the ARABIC ZWARAKAY in the UCS 
U+065D..065E 2 L2/04-025R N2723 Kew, Jonathan (2004-03-15), Proposal to encode Additional Arabic script characters 
5.1 U+0606..060A 5 L2/05-318 Lazrek, Azzeddine (2005-10-24), Proposals for Unicode Consortium [Arabic mathematical symbols] 
L2/05-320 Lazrek, Azzeddine (2005-07-10), Arabic Mathematical Diverse Symbols, Additional characters proposed to Unicode 
L2/06-125 N3086, N3086-1 Lazrek, Azzeddine (2006-03-30), Diverse Arabic Mathematical Symbols 
U+0616, 063B..063F 6 L2/06-345R N3180R Everson, Michael; Pournader, Roozbeh; Sarbar, Elnaz (2006-10-24), Proposal to encode eight Arabic characters for Persian and Azerbaijani in the UCS 
L2/07-221 Hallissy, Bob (2007-07-19), Shaping behavior of Arabic characters based on Farsi Yeh [2007.07.19] 
L2/07-225 Moore, Lisa (2007-08-21), "B.14.3.1", UTC #112 Minutes 
U+0617..061A 4 L2/06-358R N3185R Everson, Michael; Pournader, Roozbeh (2006-11-01), Proposal to encode four Qur'anic Arabic characters in the UCS 
6.0 U+0620, 065F 2 L2/98-274 Davis, Mark; Mansour, Kamal (1998-07-28), Proposed Arabic Script Additions for Minority Languages 
L2/98-409 Davis, Mark; Mansour, Kamal (1998-12-01), Proposal to add 25 Arabic characters to the BMP 
L2/02-021 Davis, Mark; Mansour, Kamal (2002-01-17), Proposal To Amend Arabic repertoire 
L2/09-406 N3686 Proposal to add one character in the Arabic block for representation of Kashmiri and annotation of existing characters, 2008-10-24 
L2/09-176 Aazim, Muzaffar; Mansour, Kamal; Pournader, Roozbeh (2009-04-30), Proposal to add two Kashmiri characters and one annotation to the Arabic block 
L2/09-215 Pournader, Roozbeh; Anderson, Deborah (2009-05-14), Proposal to add two Kashmiri characters 
L2/10-169 Lata, Swaran (2010-05-06), Comments on the Proposed Arabic Letter Kashmiri Yeh 
6.1 U+0604 1 L2/09-144R3 N3734 Pandey, Anshuman (2009-11-20), Proposal to Encode the Samvat Date Sign for Arabic 
6.3 U+061C 1 L2/03-159 Kew, Jonathan (2003-05-28), Proposal to encode Arabic triple dot punctuation mark 
L2/11-005 Allouche, Matitiahu; Mohie, Mohamed (2011-01-16), Proposal to encode an Arabic-Letter Mark (ALM) 
L2/11-016 Moore, Lisa (2011-02-15), "Scripts and Symbols — Arabic letter mark", UTC #126 / L2 #223 Minutes 
L2/11-278 Allouche, Matitiahu; Mohie, Mohamed (2011-07-17), Proposal to encode an Arabic-Letter Mark (ALM) 
L2/11-397 Edberg, Peter (2011-10-25), Proposed addition of AL MARK and LEVEL DIRECTION MARK (PRI #205 background) 
L2/11-398 Edberg, Peter (2011-10-25), Accumulated Feedback on PRI #205 (moderated) 
L2/11-330 N4181 Anderson, Deborah (2011-11-04), Proposed Additions to ISO/IEC 10646 
L2/11-353 Moore, Lisa (2011-11-30), "B.11.18", UTC #129 / L2 #226 Minutes 
L2/11-432R N4180 Allouche, Matitiahu; Mohie, Mohamed (2012-02-15), Proposal to encode the Arabic Letter Mark (ALM) 
L2/13-040 Pournader, Roozbeh; Lanin, Aharon (2013-01-29), Fasttracking Arabic Letter Mark (ALM) 
L2/13-011 Moore, Lisa (2013-02-04), UTC #134 Minutes 
L2/13-240 Davis, Mark (2013-12-12), Reconciling Script and Script_Extensions 
7.0 U+0605 1 L2/09-163R Pandey, Anshuman (2009-09-15), Proposal to Encode Coptic Numerals in ISO/IEC 10646 
L2/10-114 N3786 Pandey, Anshuman (2010-04-10), Towards an Encoding for Coptic Numbers in the UCS 
L2/10-206R N3843R Pandey, Anshuman (2010-06-21), Final Proposal to Encode Coptic Numbers 
L2/10-421R N3958R Pandey, Anshuman (2010-11-01), Request to Rename ‘Coptic Numbers’ to ‘Coptic Epact Numerals’ 
L2/11-062R N3990 Pandey, Anshuman (2011-02-14), Final Proposal to Encode Coptic Epact Numbers 
  1. ^ Proposed code points and characters names may differ from final code points and names

See also[edit]

References[edit]

  1. ^ "Unicode character database". The Unicode Standard. Retrieved 2016-07-09. 
  2. ^ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09. 
  3. ^ The Unicode Consortium. The Unicode Standard, Version 6.0.0, (Mountain View, CA: The Unicode Consortium, 2011. ISBN 978-1-936213-01-6), Chapter 8