Devanagari: Difference between revisions

Devanāgarī
देवनागरी
Devanāgarī ; देवनागरी;
	Rigveda manuscript in Devanāgarī (early 19th century)
Script type	Abugida
Time period	c. 1200–present
Direction	Left-to-right
Region	India and Nepal
Languages	Several Indo-Aryan languages, including Sanskrit, Hindi, Marathi, Nepali, Bihari, Bhili, Konkani, Bhojpuri, Newari and sometimes Sindhi and Kashmiri
Related scripts
Parent systems	Proto-Canaanite alphabet [a] Phoenician alphabet [a] Aramaic alphabet [a] Brāhmī Gupta Nāgarī Devanāgarī ; देवनागरी; ; ; ; ; ;
Child systems	Gujarati; Moḍī; Ranjana; Canadian Aboriginal Syllabics
Sister systems	Eastern Nāgarī
ISO 15924
ISO 15924	Deva (315), Devanagari (Nagari)
Unicode
Unicode alias	Devanagari
Unicode range	U+0900–U+097F
	[a] The Semitic origin of the Brahmic scripts is not universally agreed upon.
	This article contains phonetic transcriptions in the International Phonetic Alphabet (IPA). For an introductory guide on IPA symbols, see Help:IPA. For the distinction between [ ], / / and ⟨ ⟩, see IPA § Brackets and transcription delimiters.

Browse history interactively

← Previous edit Next edit →

Content deleted Content added

VisualWikitext

Inline

Revision as of 22:35, 26 March 2008

Template:IndicTextRight Devanāgarī (देवनागरी, Template:PronEng in English) is an abugida script. It is the main script used to write the Hindi, Marathi, and Nepali languages. Since the 19th century, it has become the most common script used to represent Sanskrit. Other languages using Devanagari (although not always as their only or principal script) include Sindhi, Bihari, Bhili, Marwari, Konkani, Bhojpuri, Pahari (Garhwali and Kumaoni), Santhali, Newari, Tharu, and Kashmiri. It is written and read from left to right.

Origins

Devanāgarī emerged around CE 1200 out of the Siddham script, gradually replacing the earlier, closely related Sharada script (which remained in parallel use in Kashmir). Both are immediate descendants of the Gupta script, ultimately deriving from the Brāhmī script attested from the 3rd century BCE; Nāgarī appeared in approx. the 8th century as an eastern variant of the Gupta script, contemporary to Sharada, its western variant. The descendants of Brahmi form the Brahmic family, including the alphabets employed for many other South and South-East Asian languages.

Sanskrit nāgarī is the feminine of nāgara "urban(e)", an adjectival vrddhi derivative from nagara "city"; the feminine form is used because of its original application to qualify the feminine noun lipi "script" ("urban(e) script", i.e. the script of the cultured). There were several varieties in use, one of which was distinguished by affixing deva "deity" to form a tatpurusha compound meaning the "urban(e) [script] of the deities (= gods)", i.e. "divine urban(e) [script]".

The widespread use of the name Devanāgarī is relatively recent; well into the twentieth century, and even today, simply Nāgarī was also in use for this same script. The rapid spread of the usage of Devanāgarī seems also to be connected with the almost exclusive use of this script in colonial times to publish works in Sanskrit, even though traditionally nearly all indigenous scripts had been employed for this language. This has led to the establishment of such a close connection between the script and Sanskrit that it is, erroneously, widely regarded as "the Sanskrit script" today.

Principles

As a Brahmic abugida, the fundamental principle of the Devanagari writing system is that each base consonantal character carries within it an inherent vowel a [ə]. That is, an unmarked consonant sign is assumed to represent that consonant plus the inherent vowel;^[1] e.g. क ka, कन kana, कनय kanaya, etc. Flowing from this core feature are a number of other features.

Consonant clusters lacking intervening vowels may be represented by physically joined and condensed ligatures or "conjuncts" (saṃyuktākṣara); e.g. कनय kanaya → क्नय knaya, कन्य kanya, क्न्य knya.
For postconsonantal vowels other than inherent a, the consonant symbol is applied with diacritics; e.g. क ka → के ke, कु ku, की kī, का kā.
- For non-postconsonantal vowels (initial and post-vocalic positions), there are full-formed characters. Thus ū is ू in कू kū but ऊ in ऊक ūka and कऊ kaū.
The diacritic ્, called the virāma or halanta, indicates cancellation of the inherent vowel; e.g. क्नय knaya → क्नय् knay.

Thus the basic unit is the graphic symbol or akṣara, with phonetic structures V or (C)(C)(C)(C)C(V). Finally, Devanagari is written from left to right, lacks distinct cases, and possesses a horizontal line running above the characters, linking them together.

Symbols

This article contains phonetic transcriptions in the International Phonetic Alphabet (IPA). For an introductory guide on IPA symbols, see Help:IPA. For the distinction between [ ], / / and ⟨ ⟩, see IPA § Brackets and transcription delimiters.

Devanagari, like nearly all Brahmi scripts, is ordered based on phonetic principles, considering both manner and place of articulation, as per Sanskrit and its grammatical tradition (cf. Vyakarana). Indeed, Devanagari as used for Sanskrit serves as the prototype for its application, with minor variations or additions, to other languages.^[2] The below two tables derive from Wikner (1996:13, 14, 73). This arrangement is usually referred to as the varṇamālā "garland of letters".^[3]

Vowels

	Independent form	Romanized	As diacritic with प	Independent form	Romanized	As diacritic with प
kaṇṭhya (Guttural)	अ	a	प	आ	ā	पा
tālavya (Palatal)	इ	i	पि	ई	ī	पी
oṣṭhya (Labial)	उ	u	पु	ऊ	ū	पू
mūrdhanya (Cerebral)	ऋ	‹ The template below (Deleted template) is being considered for deletion. See templates for discussion to help reach a consensus. › ṛ	पृ	ॠ	ṝ	पॄ
dantya (Dental)	ऌ	ḷ	पॢ	ॡ	ḹ	पॣ
kaṇṭhatālavya (Palato-Guttural)	ए	ē	पे	ऐ	ai	पै
kaṇṭhoṣṭhya (Labio-Guttural)	ओ	ō	पो	औ	au	पौ

Also organized with these vowels are two non-vowel diacritics, the anusvāra

ं ṃ and the visarga

ः

ḥ (recited as

अं

aṃ and

अः aḥ). In regard to Sanskrit, Masica (1991:146) notes of the anusvāra that "there is some controversy as to whether it represents a homorganic nasal consonant [...], a nasalized vowel, a nasalized semivowel, or all these according to context". The visarga represents post-vocalic voiceless glottal fricative [h], an allophone in Sanskrit of s, or less commonly r, usually in word-final position. Some traditions append an echo of the vowel after the breath;^[4] e.g. इः [ihⁱ]. Masica (1991:146) considers the visarga along with symbols ङ ṅa and ञ ña for the "largely predictable" velar and palatal nasals to be examples of "phonetic overkill in the system".

Another associated symbol is the candrabindu/anunāsika

ँ

. Salomon (2003:76–77) notes it as a "more emphatic form" of the anusvāra, "sometimes [...] used to mark a true [vowel] nasalization". In a New Indo-Aryan language such as Hindi the distinction is clear and formal: the candrabindu indicates vowel nasalization^[5] while the anusvār indicates a homorganic nasal consonant;^[6] e.g.

हँसी

[ɦə̃si] "laughter,

गंगा

[gəŋgɑ] "Ganges". However, when a syllable has a vowel sign above the top line that leaves no room for the candrabindu's candra ("moon") portion, then it is dispensed with in favour of the lone dot;^[7] e.g.

हूँ

[ɦũ] "am", but

हैं [ɦɛ̃] "are". Finally, some writers and typesetters dispense with the "moon" altogether, using the dot alone all of the time.^[8]

The avagraha

ऽ

(usually transliterated with an apostrophe) is a punctuational sign indicating the elision or coalescence of a vowel in Sanskrit as a result of sandhi; e.g.

एकोऽयम

ekoyam (< ekas + ayam) "this one". An original long vowel lost by coalescence is sometimes indicated by a double avagraha; e.g.

सदाऽऽत्मा

sadātmā (< sadā + ātmā) "always, the self".^[9] In Hindi, Snell (2000:77) states that its "main function is to show that a vowel is sustained in a cry or a shout"; e.g.

आईऽऽऽ!

āīīī!. In Magahi, Verma (2003:501) explains that it is used to mark the non-elision of word-final inherent a. Word-final a-elision is a modern orthographic convention/assumption, and Magahi, which has "quite a number of verbal forms [that] end in that inherent vowel", deals with it as such; e.g.

बइठऽ

baiṭha "sit" versus

*

बइठ baiṭh

The vowels ṝ, ḷ, and ḹ are Sanskrit-specific and not included in the varṇamālās of other languages. With the sound represented by ṛ having been lost as well, it now elicits pronunciations ranging from [ɾɪ] (Hindi) to [ɾu] (Marathi).
ḹ is not actually a primary phoneme of Sanskrit, but is usually included among the vowels in order to maintain the symmetry of short/long pairs.^[2]
There are non-regular formations of रु ru and रू rū.

Consonants

	sparśa (Stop)								anunāsika (Nasal)		antaḥstha (Semivowel)		ūṣman (Fricative)
Voicing →	aghoṣa				ghoṣa								aghoṣa		ghoṣa
Aspiration →	alpaprāṇa		mahāprāṇa		alpaprāṇa		mahāprāṇa		alpaprāṇa				mahāprāṇa
kaṇṭhya (Guttural)	क	ka	ख	kha	ग	ga	घ	gha	ङ	‹ The template below (Deleted template) is being considered for deletion. See templates for discussion to help reach a consensus. › ṅa					ह	‹ The template below (Deleted template) is being considered for deletion. See templates for discussion to help reach a consensus. › ha
tālavya (Palatal)	च	ca	छ	cha	ज	ja	झ	jha	ञ	ña	य	ya	श	śa
mūrdhanya (Cerebral)	ट	‹ The template below (Deleted template) is being considered for deletion. See templates for discussion to help reach a consensus. › ṭa	ठ	‹ The template below (Deleted template) is being considered for deletion. See templates for discussion to help reach a consensus. › ṭha	ड	‹ The template below (Deleted template) is being considered for deletion. See templates for discussion to help reach a consensus. › ḍa	ढ	‹ The template below (Deleted template) is being considered for deletion. See templates for discussion to help reach a consensus. › ḍha	ण	‹ The template below (Deleted template) is being considered for deletion. See templates for discussion to help reach a consensus. › ṇa	र	ra	ष	‹ The template below (Deleted template) is being considered for deletion. See templates for discussion to help reach a consensus. › ṣa
dantya (Dental)	त	ta	थ	tha	द	da	ध	dha	न	na	ल	la	स	sa
oṣṭhya (Labial)	प	pa	फ	pha	ब	ba	भ	bha	म	ma	व	va

Rounding this out where applicable is ळ ḷa, which represented the intervocalic lateral flap allophone of the voiced retroflex stop in Vedic Sanskrit, and which is a phoneme in languages such as Marathi and Rajasthani.
Beyond the Sanskritic set new shapes have rarely been formulated. Masica (1991:146) harvcoltxt error: no target: CITEREFMasica1991 (help) offers the following, "In any case, according to some, all possible sounds had already been described and provided for in this system, as Sanskrit was the original and perfect language. Hence it was difficult to provide for or even to conceive other sounds, unknown to the phoneticians of Sanskrit." Where foreign borrowings and internal developments did inevitably accrue and arise in New Indo-Aryan languages, they have been either ignored in writing, or dealt through means such as diacritics and ligatures (ignored in recitation).
- The most prolific diacritic has been the subscript nuqtā ़. Hindi uses it for the Persian sounds क़ qa, ख़ xa, ग़ ġa, ज़ za, and फ़ fa, and for the allophonic developments ड़ [[Retroflex flap|ṛ]]a and ढ़ ṛha.
- Sindhi's implosives are accommodated with underlining: ग [ɠə], ज [ʄə], ड [ɗə], ब [ɓə].
- Aspirated sonorants may be represented as conjuncts/ligatures with ह ha: म्ह mha, न्ह nha, ण्ह ṇha, व्ह vha, ल्ह lha, ळ्ह ḷha.
- Masica (1991:147) notes Marwari as using a special symbol for ḍa [ɗə] (while ड = [ɽə]).

Conjuncts

File:JanaSanskritSans ddhrya.png

The ddhrya-ligature (द्ध्र्य) of JanaSanskritSans.

You will only be able to see the ligatures if your system has a Unicode font installed that includes the required ligature glyphs (e.g. one of the TDIL fonts, see "external links" below).

As mentioned, successive consonants lacking a vowel in between them may physically join together as a 'conjunct' or ligature. The government of these clusters ranges from widely to narrowly applicable rules, with special exceptions within. While standardized for the most part, there are certain variations in clustering, of which the Unicode used on this page is just one scheme. The following are a number of rules —

24 out of the 36 consonants contain a vertical right stroke (ख, घ, ण etc.). As first or middle fragments/members of a cluster, they lose that stroke. e.g. त + व = त्व, ण + ढ = ण्ढ, स + थ = स्थ. श ś(a) appears as a different, simple ribbon-shaped fragment preceding व va, न na, च ca, ल la, and र ra, squishing down these second members. Thus श्व śva, श्न śna, श्च śca श्ल śla, and श्र śra.

र r(a) as a first member it takes the form of a curved upward dash above the final character or its ā-diacritic. e.g. र्व rva, र्वा rvā, र्स्प rspa, र्स्पा rspā. As a final member with ट ठ ड ढ ङ छ it is two lines below the character, pointed downwards and apart. Thus ट्र ठ्र ड्र ढ्र ङ्र छ्र. Elsewhere as a final member it is a diagonal stroke jutting leftwards and down. e.g. क्र ग्र भ्र. त ta is shifted up to make त्र tra.

As first members, remaining vertical stroke-less characters such as द d(a) and ह h(a) may have their second member, shrunken and minus its horizontal stroke, placed underneath. क k(a), छ ch(a), and फ ph(a) shorten their right hooks and join them directly to the following member.

Displayed then in the following table are all the viable symbols for the biconsonantal clusters of Sanskrit as listed in Masica (1991:161–162). Scroll your cursor over the conjuncts to reveal their romanizations.

	क	ख	ग	घ	च	छ	ज	झ	ञ	ट	ठ	ड	ढ	ण	त	थ	द	ध	न	प	फ	ब	भ	म	य	र	ल	व	श	ष	स	ह
क	kk													kṇ	kt	kth			kn					km	ky	kr	kl	kv		kṣ
ख																									khy
ग			gg				gj											gdh	gn					gm		gr	gl
घ																			ghn					ghm		ghr
च					cc	cch			jñ																cy
ज							jj	jjh	jñ															jm	jy	jr		jv
ड																										ḍr
ण										ṇṭ	ṇṭh	ṇḍ	ṇḍh	ṇṇ										ṇm	ṇy			ṇv
त	tk	tkh													tt	tth			tn	tp	tph			tm	ty	tr		tv			ts
थ																			thn						thy
द			dg	dgh													dd	ddh	dn			db	dbh	dm	dy	dr		dv
ध																			dhn					dhm	dhy	dhr		dhv
न															nt	nth	nd	ndh	nn					nm	ny			nv
प															pt				pn	pp	pph				py	pr	pl				ps
ब							bj										bd	bdh				bb				br
भ																										bhr
म																			mn	mp	mph	mb	mbh	mm	my	mr	ml
य																								yy
र	rk	rkh	rg	rgh	rc	rch	rj	rjh						rṇ	rt	rth	rd	rdh	rn	rp		rb	rbh	rm	ry		rl	rv	rś	rṣ	rs	rh
ल	lk		lg														ld			lp	lph	lb		lm	ly		ll	lv				lh
व																									vy	vr		vv
श					śc														śn					śm	śy	śr	śl	śv
ष	ṣk									ṣṭ	ṣṭh			ṣṇ						ṣp	ṣph			ṣm	ṣy			ṣv
स	sk	skh													st	sth			sn	sp	sph			sm	sy	sr		sv
ह														hṇ					hn					hm	hy	hr		hv

New Indo-Aryan languages may use the above forms for their Sanskrit loanwords (or otherwise).

Accent marks

The pitch accent of Vedic Sanskrit is written with various symbols depending on shakha. In the Rigveda, anudatta is written with a bar below the line (॒), svarita with a stroke above the line (॑) while udatta is unmarked.

Numerals

**Devanāgarī numerals**
०	१	२	३	४	५	६	७	८	९
0	1	2	3	4	5	6	7	8	9

Transliteration

There are several methods of transliteration from Devanāgarī into Roman scripts. The most widely used transliteration method is IAST. However, there are other transliteration options.

The following are the major transliteration methods for Devanāgarī:

ISO 15919

A standard transliteration convention was codified in the ISO 15919 standard of 2001. It uses diacritics to map the much larger set of Brahmic graphemes to the Latin script. See also Transliteration of Indic scripts: how to use ISO 15919. The Devanagari-specific portion is nearly identical to the academic standard for Sanskrit, IAST.

IAST

The International Alphabet of Sanskrit Transliteration (IAST) is the academic standard for the romanization of Sanskrit. IAST is the de-facto standard used in printed publications, like books and magazines, and with the wider availability of Unicode fonts, it is also increasingly used for electronic texts. It is based on a standard established by the Congress of Orientalists at Athens in 1912.

The National Library at Kolkata romanization, intended for the romanization of all Indic scripts, is an extension of IAST.

Harvard-Kyoto

Compared to IAST, Harvard-Kyoto looks much simpler. It does not contain all the diacritic marks that IAST contains. This makes typing in Harvard-Kyoto much easier than IAST. Harvard-Kyoto uses capital letters that can be difficult to read in the middle of words.

ITRANS

ITRANS is a lossless transliteration scheme of Devanāgarī into ASCII that is widely used on Usenet. It is an extension of the Harvard-Kyoto scheme. In ITRANS, the word Devanāgarī is written as "devanaagarii". ITRANS is associated with an application of the same name that enables typesetting in Indic scripts. The user inputs in Roman letters and the ITRANS pre-processor displays the Roman letters into Devanāgarī (or other Indic languages). The latest version of ITRANS is version 5.30 released in July, 2001.

ALA-LC Romanization

ALA-LC romanization is a transliteration scheme approved by the Library of Congress and the American Library Association, and widely used in North American libraries. Transliteration tables are based on languages, so there is a table for Hindi, one for Sanskrit and Prakrit, etc.

Encodings

ISCII

ISCII is a fixed-length 8-bit encoding. The lower 128 codepoints are plain ASCII, the upper 128 codepoints are ISCII-specific.

It has been designed for representing not only Devanāgarī, but also various other Indic scripts as well as a Latin-based script with diacritic marks used for transliteration of the Indic scripts.

ISCII has largely been superseded by Unicode, which has however attempted to preserve the ISCII layout for its Indic language blocks.

Devanāgarī in Unicode

The Unicode range for Devanāgarī is U+0900 .. U+097F. Grey areas indicate non-assigned code points.

Devanagari^[1] Official Unicode Consortium code chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+090x	ऀ	ँ	ं	ः	ऄ	अ	आ	इ	ई	उ	ऊ	ऋ	ऌ	ऍ	ऎ	ए
U+091x	ऐ	ऑ	ऒ	ओ	औ	क	ख	ग	घ	ङ	च	छ	ज	झ	ञ	ट
U+092x	ठ	ड	ढ	ण	त	थ	द	ध	न	ऩ	प	फ	ब	भ	म	य
U+093x	र	ऱ	ल	ळ	ऴ	व	श	ष	स	ह	ऺ	ऻ	़	ऽ	ा	ि
U+094x	ी	ु	ू	ृ	ॄ	ॅ	ॆ	े	ै	ॉ	ॊ	ो	ौ	्	ॎ	ॏ
U+095x	ॐ	॑	॒	॓	॔	ॕ	ॖ	ॗ	क़	ख़	ग़	ज़	ड़	ढ़	फ़	य़
U+096x	ॠ	ॡ	ॢ	ॣ	।	॥	०	१	२	३	४	५	६	७	८	९
U+097x	॰	ॱ	ॲ	ॳ	ॴ	ॵ	ॶ	ॷ	ॸ	ॹ	ॺ	ॻ	ॼ	ॽ	ॾ	ॿ
Notes 1.^ As of Unicode version 15.1

Devanāgarī Keyboard Layouts

Devanāgarī and Devanāgarī-QWERTY keyboard layouts for Mac OS X

The Mac OS X operating system supports convenient editing for the Devanāgarī script by insertion of appropriate Unicode characters with two different keyboard layouts available for use. To input Devanāgarī text, one goes to System Preferences → International → Input Menu and enables the keyboard layout that is to be used. The layout is the same as for INSCRIPT/KDE Linux:

INSCRIPT / KDE Linux

This is the India keyboard layout for Linux (variant 'deva')

Typewriter

Phonetic

See Bolnagri Home Page

Software

Apple Type Services for Unicode Imaging - Macintosh
HindiWriter - The Phonetic Hindi Writer with AutoWord lookup and Spellcheck for MS Word and OpenOffice.org for Windows.
Pango - open source (GNOME)
Uniscribe - Windows
WorldScript - Macintosh, replaced by the Apple Type Services for Unicode Imaging, mentioned above
Baraha - Devanāgarī Input using English Keyboard
Lipikaar - The indic script typing tool with support for Devanagari through a Windows desktop executable or Firefox Extension.

References

Bibliography

External links

Electronic typesetting

Fonts

Unicode Compliant Open Type Fonts including ligature glyphs (TDIL Data Centre)

Documentation

Tools and applications

List of Hindi Typing Tools
IndiX, Indian language support for Linux, a site by the Indian National Centre for Software Technology
Devanāgarī Tools: Wiki Sandbox, Devanagari Mail, Yahoo/Google Search & Devanāgarī Transliteration
EnTrans - Entrans is an online, collaborative translation tool

[1] Salomon (2003:70)

[Salomon2003p75-2] Salomon (2003:75)

[3] Salomon (2003:71)

[4] Wikner (1996:6)

[5] Snell (2000:44–45)

[6] Snell (2000:64)

[7] Snell (2000:45)

[8] Snell (2000:46)

[9] Salomon (2003:77)

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

@@ Line 456: / Line 456: @@
 ===INSCRIPT / KDE Linux ===
-[[Image:Devanagari INSCRIPT2.png|800px|none|INSCRIPT Keyboard Layout (Windows, Solaris, Java)]]
+<!-- Deleted image removed: [[Image:Devanagari INSCRIPT2.png|800px|none|INSCRIPT Keyboard Layout (Windows, Solaris, Java)]] -->
 This is the India keyboard layout for Linux (variant 'deva')

v t e Marathi
Grammar Phonology Literature Poetry Samyukta Maharashtra Movement Maharashtra State Marathi Wikipedia
Marathi dialects and Marathic languages	Maharastri Prakrit Andh Deccani East Indian (Mumbai dialect) Judeo-Marathi Katkari Maharashtrian Konkani Malvani Konkani Phudagi Samvedi Thanjavur Marathi Varhadi (Varhadi-Nagpuri) Varli
Marathi scripts	Balbodh style of Devanagari Braille Modi
Other	Marathi Cinema Marathi theatre Literature Marathi people (List)
Arts	Awards Sahitya Akademi Award Jnanpith Award Writers Poets National Film Award Maharashtra

v t e Writing systems
Index of language articles
Overview	Language History of writing History of the alphabet Graphemes Scripts in Unicode
Lists	Writing systems Languages by writing system / by first written account Ancient languages corpuses by size Undeciphered writing systems Creators of writing systems
Types	Abjads Abugidas Alphabets Featural Ideogrammic Logographic Numeral Phonogrammic Pictographic Semi-syllabaries Shorthand Syllabaries
Current examples	Arabic Canadian syllabics Chinese Devanagari Hangul Kana Latin Mongolian
Related topics	In Africa In Southeast Asia