Hindustani orthography

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Hindustani (Standard Hindi and Urdu) has been written in several different scripts. Most Hindi texts are written in the Devanagari script, which is derived from the Brāhmī script of Ancient India. Most Urdu texts are written in the Urdu alphabet, which comes from the Persian alphabet. Hindustani has been written in both scripts. In recent years the Latin script has been used in these languages for technological or internationalization reasons.

Devanagari script[edit]

Main article: Devanagari script

The Devanagari script is an abugida, as written consonants have an inherent vowel, which in Standard Hindi is a schwa. In certain contexts, such as at the end of words, there is no vowel, a phenomenon called the schwa syncope.[1] Other vowels are written with a diacritic on the consonant letter. Devanagari is written from left to right, with a top-bar connecting the letters together.

ə ɪ ʊ ɛː ɔː
k x ɡ ɣ ɡʱ ŋ
t͡ʃ t͡ʃʰ d͡ʒ z d͡ʒʱ ɲ
ʈ ʈʰ ɖ ɽ ɖʱ ɽʱ ɳ
t̪ʰ d̪ʱ n
p f b m
j r l ʋ ʃ ʂ s h

क्ष kṣ is pronounced /kʃ/ and ज्ञ is /ɡj/.

Schwa deletion[edit]

The schwa (अ or 'ə', sometimes written 'a') implicit in each consonant of the Devanagri script is "obligatorily deleted" in Hindi at the end of words and in certain other contexts.[2] This phenomenon has been termed the "schwa syncope rule" or the "schwa deletion rule" of Hindi.[1][2] One formalization of this rule has been summarized as ə -> ø | VC_CV. In other words, when a vowel-preceded consonant is followed by a vowel-succeeded consonant, the schwa inherent in the first consonant is deleted.[1][3] However, this formalization is inexact and incomplete (i.e. sometimes deletes a schwa when it shouldn't or, at other times, fails to delete it when it should), and can yield errors. Schwa deletion is computationally important because it is essential to building text-to-speech software for Hindi.[3][4]

As a result of schwa syncope, the correct Hindi pronunciation of many words differs from that expected from a literal rendering of Devanagari. For instance, राम is Rām (incorrect: Rāma), रचना is Rachnā (incorrect: Rachanā), वेद is Véd (incorrect: Véda) and नमकीन is Namkeen (incorrect Namakeen).[3][4]

Persian script[edit]

Main article: Urdu alphabet

The Urdu alphabet is based on the Persian, which is an Arabic alphabet. Urdu is written from right to left, and most letters link together. This leads to variations in the form of a letter depending on its position in a word. Most vowels are omitted in generic texts, although they may be written for disambiguation or for pedagogical purposes. Urdu is primarily written in a calligraphic style of the script called Nasta'liq.

Letter Name of letter Transcription IPA
ا alif - -
ب be b /b/
پ pe p /p/
ت te t /t̪/
ٹ ṭe /ʈ/
ث se s /s/
ج jīm j /d͡ʒ/
چ che ch /t͡ʃ/
ح baṛī he h /h/
خ khe kh /x/
د dāl d /d̪/
ڈ ḍāl /ɖ/
ذ zāl dh /z/
ر re r /r/
ڑ ṛe /ɽ/
ز ze z /z/
ژ zhe zh /ʒ/
س sīn s /s/
ش shīn sh /ʃ/
ص su'ād /s/
ض zu'ād /z/
ط to'e t /t/
ظ zo'e /z/
ع ‘ain ' -
غ ghain gh /ɣ/
ف fe f /f/
ق qāf q /q/
ک kāf k /k/
گ gāf g /ɡ/
ل lām l /l/
م mīm m /m/
ن nūn n /n/
و vā'o v, o, or ū /ʋ/, /oː/, /ɔ/ or /uː/
ہ, ﮩ, ﮨ choṭī he h /h/
ھ do chashmī he h /ʰ/
ء hamza ' /ʔ/
ی ye y, i /j/ or /iː/
ے bari ye ai or e /ɛː/, or /eː/

Romanized Hindustani[edit]

The Latin alphabet has been used to write Hindustani for technological or internationalization reasons. Roman Urdu uses the basic Latin alphabet. It is most commonly used by young native speakers for technological applications, such as chat, emails and SMS.

ITRANS, ISCII, IAST, and Harvard-Kyoto romanization schemes have been employed primarily for usage by non-native speakers who are more familiar with the Latin alphabet.

See also: Roman Urdu

Braille script[edit]

Main articles: Hindi Braille and Urdu Braille

Three braille alphabets are used: Hindi and Urdu braille in India, based on Bharati braille conventions, and Urdu Braille in Pakistan, based on Persian Braille conventions. Hindi Braille is an alphabet with a not written in some environments, while for Urdu Braille in Pakistan, it seems that vowels may be optional as they are in print.

See also[edit]

References[edit]

  1. ^ a b c Tej K. Bhatia (1987), A history of the Hindi grammatical tradition: Hindi-Hindustani grammar, grammarians, history and problems, BRILL, ISBN 90-04-07924-6, ... Hindi literature fails as a reliable indicator of the actual pronunciation because it is written in the Devanagari script ... the schwa syncope rule which operates in Hindi ... 
  2. ^ a b Larry M. Hyman, Victoria Fromkin, Charles N. Li (1988 (Volume 1988, Part 2)), Language, speech, and mind, Taylor & Francis, ISBN 0-415-00311-3, ... The implicit /a/ is not read when the symbol appears in word-final position or in certain other contexts where it is obligatorily deleted (via the so-called schwa-deletion rule which plays a crucial role in Hindi word phonology ...  Check date values in: |date= (help)
  3. ^ a b c Monojit Choudhury, Anupam Basu and Sudeshna Sarkar (July 2004), "A Diachronic Approach for Schwa Deletion in Indo Aryan Languages", Proceedings of the Workshop of the ACL Special Interest Group on Computational Phonology (SIGPHON) (Association for Computations Linguistics), ... schwa deletion is an important issue for grapheme-to-phoneme conversion of IAL, which in turn is required for a good Text-to-Speech synthesizer ... 
  4. ^ a b Naim R. Tyson, Ila Nagar (2009 (12:15–25)), "Prosodic rules for schwa-deletion in hindi text-to-speech synthesis", International Journal of Speech Technology, ... Without the appropriate deletion of schwas, any speech output would sound unnatural. Since the orthographical representation of Devanagari gives little indication of deletion sites, modern TTS systems for Hindi implemented schwa deletion rules based on the segmental context where schwa appears ...  Check date values in: |date= (help)