Arabic script in Unicode

From Wikipedia, the free encyclopedia
  (Redirected from ڊ)
Jump to: navigation, search

As of Unicode 7.0, the Arabic script is contained in the following blocks:

The basic Arabic range encodes the standard letters and diacritics, but does not encode contextual forms (U+0621–U+0652 being directly based on ISO 8859-6); and also includes the most common diacritics and Arabic-Indic digits. The Arabic Supplement range encodes letter variants mostly used for writing African (non-Arabic) languages. The Arabic Extended-A range encodes additional Qur'anic annotations and letter variants used for various non-Arabic languages. The Arabic Presentation Forms-A range encodes contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. The Arabic Presentation Forms-B range encodes spacing forms of Arabic diacritics, and more contextual letter forms. The presentation forms are present only for compatibility with older standards, and are not currently needed for coding text.[3] The Arabic Mathematical Alphabetical Symbols block encodes characters used in Arabic mathematical expressions.

Contextual forms[edit]

A demonstration for the basic alphabet used in Modern Standard Arabic:

General
Unicode
Contextual forms Name
Isolated End Middle Beginning
0623
أ
FE83
أ
FE84
ـأ
ʾalif
0628
ب
FE8F
FE90
ـب
FE92
ـبـ
FE91
بـ
bāʾ
062A
ت
FE95
FE96
ـت
FE98
ـتـ
FE97
تـ
tāʾ
062B
ث
FE99
FE9A
ـث
FE9C
ـثـ
FE9B
ثـ
ṯāʾ
062C
ج
FE9D
FE9E
ـج
FEA0
ـجـ
FE9F
جـ
ǧīm
062D
ح
FEA1
FEA2
ـح
FEA4
ـحـ
FEA3
حـ
ḥāʾ
062E
خ
FEA5
FEA6
ـخ
FEA8
ـخـ
FEA7
خـ
ḫāʾ
062F
د
FEA9
FEAA
ـد
dāl
0630
ذ
FEAB
FEAC
ـذ
ḏāl
0631
ر
FEAD
FEAE
ـر
rāʾ
0632
ز
FEAF
FEB0
ـز
zayn/zāy
0633
س
FEB1
FEB2
ـس
FEB4
ـسـ
FEB3
سـ
sīn
0634
ش
FEB5
FEB6
ـش
FEB8
ـشـ
FEB7
شـ
šīn
0635
ص
FEB9
FEBA
ـص
FEBC
ـصـ
FEBB
صـ
ṣād
0636
ض
FEBD
FEBE
ـض
FEC0
ـضـ
FEBF
ضـ
ḍād
0637
ط
FEC1
FEC2
ـط
FEC4
ـطـ
FEC3
طـ
ṭāʾ
0638
ظ
FEC5
FEC6
ـظ
FEC8
ـظـ
FEC7
ظـ
ẓāʾ
0639
ع
FEC9
FECA
ـع
FECC
ـعـ
FECB
عـ
ʿayn
063A
غ
FECD
FECE
ـغ
FED0
ـغـ
FECF
غـ
ġayn
0641
ف
FED1
ف
FED2
ـف
FED4
ـفـ
FED3
فـ
fāʾ
0642
ق
FED5
FED6
ـق
FED8
ـقـ
FED7
قـ
qāf
0643
ك
FED9
FEDA
ـك
FEDC
ـكـ
FEDB
كـ
kāf
0644
ل
FEDD
FEDE
ـل
FEE0
ـلـ
FEDF
لـ
lām
0645
م
FEE1
FEE2
ـم
FEE4
ـمـ
FEE3
مـ
mīm
0646
ن
FEE5
ن
FEE6
ـن
FEE8
ـنـ
FEE7
نـ
nūn
0647
FEE9
FEEA
ـه
FEEC
ـهـ
FEEB
هـ
hāʾ
0648
و
FEED
FEEE
ـو
wāw
064A
ي
FEF1
FEF2
ـي
FEF4
ـيـ
FEF3
يـ
yāʾ
0622
آ
FE81
FE82
ـآ
ʾalif maddah
0629
ة
FE93
FE94
ـة
Tāʾ marbūṭah
0649
ى
FEEF
FEF0
ـى
ʾalif maqṣūrah

Punctuation and ornaments[edit]

Only the Arabic question mark ⟨؟⟩ and the Arabic comma ⟨،⟩ are used in regular Arabic script typing. However, the Arabic comma can also be substituted with the normal comma used in Latin-based scripts at U+002c.

  • U+060C ، arabic comma
  • U+060D ؍ arabic date separator
  • U+060E ؎ arabic poetic verse sign
  • U+060F ؏ arabic sign misra
  • U+061F ؟ arabic question mark
  • U+066D ٭ arabic five pointed star
  • U+06DD ۝ arabic end of ayah
  • U+06DE ۞ arabic start of rub el hizb
  • U+06E9 ۩ arabic arabic place of sajdah
  • U+FD3E arabic ornate left parenthesis
  • U+FD3F ﴿ arabic ornate right parenthesis

Word ligatures[edit]

Arabic Presentation Forms-A has a few characters defined as "word ligatures" for terms frequently used in formulaic expressions in Arabic. They are rarely used out of professional liturgical typing, also the Rial grapheme is normally written fully, not by the ligature.

  • U+FDF0 arabic ligature salla used as koranic stop sign isolated form (صلے)
  • U+FDF1 arabic ligature qala used as koranic stop sign isolated form (قلے)
  • U+FDF2 arabic ligature allah isolated form (الله)
  • U+FDF3 arabic ligature akbar isolated form (اكبر, as in the phrase الله أكبر Allāhu akbar)
  • U+FDF4 arabic ligature mohammad isolated form (محمد)
  • U+FDF5 arabic ligature salam isolated form (صلعم, the abbreviation for صلى الله عليه وسلم "peace be upon him")
  • U+FDF6 arabic ligature rasoul isolated form (رسول)
  • U+FDF7 arabic ligature alayhe isolated form (عليه)
  • U+FDF8 arabic ligature wasallam isolated form (وسلم)
  • U+FDF9 arabic ligature salla isolated form (صلى)
  • U+FDFA arabic ligature sallallahou alayhe wasallam (صلى الله عليه وسلم "peace be upon him")
  • U+FDFB arabic ligature jallajalalouhou (جل جلاله)
  • U+FDFC rial sign (ريال)
  • U+FDFD arabic ligature bismillah ar-rahman ar-raheem (بسم الله الرحمن الرحيم bism-i llāh-i r-raḥmān-i r-raḥīm)

Code blocks[edit]

Note:The National Arabic phonetic alphabetes(NAPA) are taken from the following Code blocks.

Arabic[edit]

Arabic[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+060x  ؀   ؁   ؂   ؃   ؄   ؅  ؆ ؇ ؈ ؉ ؊ ؋ ، ؍ ؎ ؏
U+061x ؐ ؑ ؒ ؓ ؔ ؕ ؖ ؗ ؘ ؙ ؚ ؛  ALM  ؞ ؟
U+062x ؠ ء آ أ ؤ إ ئ ا ب ة ت ث ج ح خ د
U+063x ذ ر ز س ش ص ض ط ظ ع غ ػ ؼ ؽ ؾ ؿ
U+064x ـ ف ق ك ل م ن ه و ى ي ً ٌ ٍ َ ُ
U+065x ِ ّ ْ ٓ ٔ ٕ ٖ ٗ ٘ ٙ ٚ ٛ ٜ ٝ ٞ ٟ
U+066x ٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩ ٪ ٫ ٬ ٭ ٮ ٯ
U+067x ٰ ٱ ٲ ٳ ٴ ٵ ٶ ٷ ٸ ٹ ٺ ٻ ټ ٽ پ ٿ
U+068x ڀ ځ ڂ ڃ ڄ څ چ ڇ ڈ ډ ڊ ڋ ڌ ڍ ڎ ڏ
U+069x ڐ ڑ ڒ ړ ڔ ڕ ږ ڗ ژ ڙ ښ ڛ ڜ ڝ ڞ ڟ
U+06Ax ڠ ڡ ڢ ڣ ڤ ڥ ڦ ڧ ڨ ک ڪ ګ ڬ ڭ ڮ گ
U+06Bx ڰ ڱ ڲ ڳ ڴ ڵ ڶ ڷ ڸ ڹ ں ڻ ڼ ڽ ھ ڿ
U+06Cx ۀ ہ ۂ ۃ ۄ ۅ ۆ ۇ ۈ ۉ ۊ ۋ ی ۍ ێ ۏ
U+06Dx ې ۑ ے ۓ ۔ ە ۖ ۗ ۘ ۙ ۚ ۛ ۜ  ۝  ۞ ۟
U+06Ex ۠ ۡ ۢ ۣ ۤ ۥ ۦ ۧ ۨ ۩ ۪ ۫ ۬ ۭ ۮ ۯ
U+06Fx ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹ ۺ ۻ ۼ ۽ ۾ ۿ
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
3.^ Unicode code point U+0673 is deprecated as of Unicode version 6.0

Arabic Supplement[edit]

Arabic Supplement[1]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+075x ݐ ݑ ݒ ݓ ݔ ݕ ݖ ݗ ݘ ݙ ݚ ݛ ݜ ݝ ݞ ݟ
U+076x ݠ ݡ ݢ ݣ ݤ ݥ ݦ ݧ ݨ ݩ ݪ ݫ ݬ ݭ ݮ ݯ
U+077x ݰ ݱ ݲ ݳ ݴ ݵ ݶ ݷ ݸ ݹ ݺ ݻ ݼ ݽ ݾ ݿ
Notes
1.^ As of Unicode version 7.0

Arabic Extended-A[edit]

Arabic Extended-A[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+08Ax
U+08Bx
U+08Cx
U+08Dx
U+08Ex
U+08Fx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Arabic Presentation Forms A[edit]

They are mostly ligatures which can be created from the previous charts' characters, with the exception of the bracket-like graphemes ﴾ ﴿ and some of them are ligatures of common liturgical phrases.

Arabic Presentation Forms-A[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FB5x
U+FB6x
U+FB7x ﭿ
U+FB8x
U+FB9x
U+FBAx
U+FBBx ﮿
U+FBCx
U+FBDx
U+FBEx
U+FBFx ﯿ
U+FC0x
U+FC1x
U+FC2x
U+FC3x ﰿ
U+FC4x
U+FC5x
U+FC6x
U+FC7x ﱿ
U+FC8x
U+FC9x
U+FCAx
U+FCBx ﲿ
U+FCCx
U+FCDx
U+FCEx
U+FCFx ﳿ
U+FD0x
U+FD1x
U+FD2x
U+FD3x ﴿
U+FD4x
U+FD5x
U+FD6x
U+FD7x ﵿ
U+FD8x
U+FD9x
U+FDAx
U+FDBx ﶿ
U+FDCx
U+FDDx
U+FDEx
U+FDFx
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points
3.^ Black areas indicate noncharacters - Unicode will never use these codepoints for encoding characters

Arabic Presentation Forms B[edit]

These can all be created from the basic chart's characters.

Arabic Presentation Forms-B[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+FE7x ﹿ
U+FE8x
U+FE9x
U+FEAx
U+FEBx ﺿ
U+FECx
U+FEDx
U+FEEx
U+FEFx ZWN
 BSP 
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Rumi Numeral Symbols[edit]

Rumi Numeral Symbols[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+10E6x 𐹠 𐹡 𐹢 𐹣 𐹤 𐹥 𐹦 𐹧 𐹨 𐹩 𐹪 𐹫 𐹬 𐹭 𐹮 𐹯
U+10E7x 𐹰 𐹱 𐹲 𐹳 𐹴 𐹵 𐹶 𐹷 𐹸 𐹹 𐹺 𐹻 𐹼 𐹽 𐹾
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

Arabic Mathematical Alphabetic Symbols[edit]

Arabic Mathematical Alphabetic Symbols[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+1EE0x 𞸀 𞸁 𞸂 𞸃 𞸅 𞸆 𞸇 𞸈 𞸉 𞸊 𞸋 𞸌 𞸍 𞸎 𞸏
U+1EE1x 𞸐 𞸑 𞸒 𞸓 𞸔 𞸕 𞸖 𞸗 𞸘 𞸙 𞸚 𞸛 𞸜 𞸝 𞸞 𞸟
U+1EE2x 𞸡 𞸢 𞸤 𞸧 𞸩 𞸪 𞸫 𞸬 𞸭 𞸮 𞸯
U+1EE3x 𞸰 𞸱 𞸲 𞸴 𞸵 𞸶 𞸷 𞸹 𞸻
U+1EE4x 𞹂 𞹇 𞹉 𞹋 𞹍 𞹎 𞹏
U+1EE5x 𞹑 𞹒 𞹔 𞹗 𞹙 𞹛 𞹝 𞹟
U+1EE6x 𞹡 𞹢 𞹤 𞹧 𞹨 𞹩 𞹪 𞹬 𞹭 𞹮 𞹯
U+1EE7x 𞹰 𞹱 𞹲 𞹴 𞹵 𞹶 𞹷 𞹹 𞹺 𞹻 𞹼 𞹾
U+1EE8x 𞺀 𞺁 𞺂 𞺃 𞺄 𞺅 𞺆 𞺇 𞺈 𞺉 𞺋 𞺌 𞺍 𞺎 𞺏
U+1EE9x 𞺐 𞺑 𞺒 𞺓 𞺔 𞺕 𞺖 𞺗 𞺘 𞺙 𞺚 𞺛
U+1EEAx 𞺡 𞺢 𞺣 𞺥 𞺦 𞺧 𞺨 𞺩 𞺫 𞺬 𞺭 𞺮 𞺯
U+1EEBx 𞺰 𞺱 𞺲 𞺳 𞺴 𞺵 𞺶 𞺷 𞺸 𞺹 𞺺 𞺻
U+1EECx
U+1EEDx
U+1EEEx
U+1EEFx 𞻰 𞻱
Notes
1.^ As of Unicode version 7.0
2.^ Grey areas indicate non-assigned code points

References[edit]

  1. ^ Unicode v6.1 (UAX#41): Scripts
  2. ^ "Arabic Mathematical Alphabetic Symbols". 2012-02-01. 
  3. ^ The Unicode Consortium. The Unicode Standard, Version 6.0.0, (Mountain View, CA: The Unicode Consortium, 2011. ISBN 978-1-936213-01-6), Chapter 8

External links[edit]