Two scripts are currently used for the Tatar language: Arabic (in China), Cyrillic (in Russia and Kazakhstan).

History of Tatar writing[edit]

Before 1928, the Tatar language was usually written using alphabets based on the Arabic alphabet: İske imlâ alphabet before 1920 and Yaña imlâ alphabet in 1920–1927. Some letters such as چ and پ were borrowed from the Persian alphabet and the letter (called nef or sağır kef) was borrowed from Chagatai. The writing system was inherited from Volga Bulgar.

The most ancient of Tatar literature (Qíssai Yosıf by Qol-Ğäli, written in Old Tatar language) was created in the beginning of the 13th century. Until 1905 all literature was in Old Tatar, which was partly derived from the Bolgar language and not intelligible with modern Tatar. Since 1905 newspaper publishers started using modern Tatar. In 1918 the Arabic-based alphabet was revised: some new letters for Tatar sounds were added and some Arabic letters deleted. The Latin-based Jaꞑalif alphabet was in use between 1928 and 1939 and the Cyrillic-based alphabet has been used ever since.

Some scholars regard Institutiones linguae Turcicae libri quator ("The Basic Rules of the Turkic Language"), written in Latin by Hieronymus Megiser and printed in Leipzig in 1612, being the first example of a Turkic text printed in Arabic script, as a first printed Tatar book.[1] Meanwhile Hieronymus Megiser’s Chorographia Tartariae[2] published in 1611 describes a unique Tartarian alphabet and cites the Lord’s Prayer in the Tartarian language, written in Latin script. The first Turkic-Tatar printed publication in Russia[3] appears to be Peter the Great's Manifest, printed in Arabic script and published in Astrakhan in 1722.

Printed books appeared en masse in 1801 when the first private typography ("Oriental typography") in Kazan appeared.

The first unsuccessful attempt to publish a Tatar newspaper was in 1808, when professor of mathematics at Kazan University, I.I. Zapolsky, proposed publishing a newspaper "The Kazan News" in both Russian and Tatar languages. Zapolsky's untimely death in 1810 thwarted the project. The first successful attempt to publish a newspaper in Tatar was in 1905. On September 2, the first issue of the newspaper "Nur" was published in St. Petersburg by Gataulla Bayazitov. The second Tatar newspaper, "Kazan Muhbire," came into existence on October 29, 1905. The publisher of the newspaper was a member of the Kazan City Council, Saidgirey Alkin.

The first Tatar typewriter was created in Tatarstan in the 1920s and used the Arabic-based alphabet.

In 1930s Turkey became a potential enemy of the Soviet Union. Even though Turkish alphabet, introduced in 1928, was different from Jaꞑalif, for Soviet officials the Latin script was a symbol of the outer, bourgeois world. This motivated switching all Turkic languages of the USSR to Cyrillic script.

This was not the first project of introducing Cyrillic script for the Tatar language. Since 1861, the Keräşens ethnic group had used Nikolay Ilminsky's alphabet, based on pre-1917 Russian orthography which used fita and dotted I to spell Orthodox proper names, additional Cyrillic letters Ӓ, Ӧ, Ӱ for Tatar vowels, and the ligature Ҥ for [ŋ]. This alphabet is related to the Mari alphabet, and was used because Christian Tatars couldn't use the Arabic script. By the 1930s, Ilminsky's alphabet was forgotten and could not be used due to its religious origin. In 1938 professor M. Fazlullin introduced an adaptation of the Russian alphabet for the Tatar language, without any additional characters. Tatar sounds absent from Russian were to be represented with digraphs, consisting of Russian letters and the letters Ъ and Ь.[5]

In 1939 Qorbangaliev and Ramazanov offered their own projects that planned to use additional Cyrillic characters. Letters Ө, Ә, Ү, Һ were inherited from Jaꞑalif, but Җ and Ң were invented by analogy with Щ and Ц. ⟨Гъ⟩ and ⟨къ⟩ were suggested to designate [ʁ] and [q], spelled in Jaꞑalif as ⟨ƣ⟩ and ⟨q⟩ correspondingly. In Ramazanov's project [w] (Jaꞑalif ⟨v⟩) was spelled as ⟨в⟩ before a vowel, and as ⟨у⟩ or ⟨ү⟩ in the end of a syllable. On 5 May 1939, Presidium of the Supreme Soviet of Tatar ASSR issued the decree "On switching Tatar writing from the Latin-based alphabet to an alphabet based on Russian glyphs", which opened with a declaration that the switch was enacted "in response to numerous requests by Tatar workers, kolkhozniks, and intelligentsia."[6] The Tatar society disagreed to this project, and during a conference in July 1940, the Cyrillic alphabet was amended. The updated alphabet was accepted on 10 January 1941.

Jaꞑalif Proposed spelling (1939) Accepted spelling (1940) Meaning
ƣədət гъәдәт гадәт "custom"
qar къар кар "snow"
vaq вакъ вак "small"
tav тау тау "mountain"
v дәү дәү "big"

[q] and [ʁ] are allophones of /k/ and /ɡ/ in the environment of back vowels, and the accepted spelling doesn't explicitly distinguish between the allophones in each pair. When ⟨га/го/гу/гы/ка/ко/ку/кы⟩ is followed by a "soft syllable", containing one of the front vowels ⟨ә, е, ө, и, ү⟩ or the soft sign ⟨ь⟩, they are pronounced as [ʁæ/ʁɵ/ʁy/ʁe/qæ/qɵ/qy/qe], otherwise as [ʁɑ/ʁo/ʁu/ʁɤ/qɑ/qo/qu/qɤ]. ⟨гә/гө/гү/ге/кә/кө/кү/ке⟩ are pronounced as [ɡæ/ɡɵ/ɡy/ɡe/kæ/kɵ/ky/ke]. Similar rules apply to ⟨е, ю, я⟩ which could be pronounced as either [je, jy, jæ] or [jɤ, ju, jɑ]. The soft sign is not used to show palatalization as in Russian, but to show qualities of vowels where they are not determinable through vowel harmony. Unlike modern Russian, some words can end with ⟨гъ⟩, representing [ʁ] after a front vowel, as in ⟨балигъ⟩ [bɑliʁ] ("baligh").[5] In total, the Tatar Cyrillic script requires the Russian alphabet plus 6 extra letters: Әә, Өө, Үү, Җҗ, Ңң, Һһ. All Russian loanwords are written as in Russian and should be pronounced with Russian pronunciation.

The complexity of the orthographic rules had led to discussions about amending the Tatar Cyrillic alphabet again; these included sessions in the Kazan branch of the Academy of Sciences of the Soviet Union (KFAN) which were conducted in January 1954 and in February–March 1959, but did not result in any specific proposal for a new alphabet. In 1972, prof. Nikolai Baskakov suggested three new letters to be added to the Tatar Cyrillic alphabet: Қ, Ғ and Ў for the sounds [q], [ʁ] and [w], to make the Tatar spelling phonetic. On 18 May 1989, the Orthographic Commission formed by the KFAN published the new alphabet, which included Baskakov's three new letters, and the new spelling rules. [7] The new alphabetic order was as follows, with the new letters shown in brackets:

А Ә Б В [Ў] Г [Ғ] Д Е (Ё) Ж Җ З И Й К [Қ] Л М Н Ң О Ө П Р С Т У Ү Ф Х Һ Ц Ч Ш Щ Ъ Ы Ь Э Ю Я
Transcription Accepted spelling (1940) Proposed spelling (1989) New Latin spelling (1999) Meaning
[diqqæt] дикъкать диққәт diqqət "attention"
[qɑrlɤʁɑɕ] карлыгач қарлығач qarlığaç "swallow"
[qænæʁæt] канәгать қәнәғәт qənəğət "satisfied"
[jɤl] ел йыл yıl "year"
[jefæk] ефәк йефәк yefək "silk"
[jæm] ямь йәм yəm "charm"
[jynæleʃ] юнәлеш йүнәлеш yünəleş "direction"

The spelling system of 1940 had led to many homographs and near-homographs between Tatar and Russian which had totally different pronunciation, e.g. ⟨гарь⟩ [ʁær] "shame" and ⟨гарь⟩ [ɡarʲ] "cinder". This presented difficulties for pupils learning the two spelling systems for the two languages simultaneously. One of the goals for the new spelling system was that the same sequence of letters would correspond to the same sounds, whether in a Russian word or in a Tatar word. Yet, the amended orthography was never formally adopted, as the popular opinion in the 1990s leaned towards switching to a Latin-based alphabet, instead of changing the Cyrillic one. Thus, on 20 July 1994, the Supreme Council of the Republic of Tatarstan approved a gradual transition to Latin-based script;[8] the urgency of such transition was included in the resolution of the Second World Congress of the Tatars in 1997.[9] Recognizing the popular demand, on 15 September 1999, the State Council of the Republic of Tatarstan issued the decree "On restoring the Tatar alphabet based on Latin glyphs". [10] Despite the name of the decree, the new Latin alphabet was significantly different from Jaꞑalif, and its letters had one-to-one correspondence with the proposed Cyrillic alphabet from 1989.[11] On 27 September 2000, the Cabinet of Ministers updated the new Latin alphabet, replacing the three uncommon characters inherited from Jaꞑalif (Ə, Ɵ, Ꞑ) with those present in Latin-1 encoding and in most computer fonts.[12]

Correspondence between alphabets[edit]

No. Cyrillic alphabet
(since 1940)
proposal (1938)
alphabet (1861)
Yaña imlâ
1 А а А а А а A a A a
2 Б б Б б Б б ب B ʙ B b
3 В в В в В в ۋ, و V v W w, V v [v] in Russian words, [w] in Tatar words
4 Г г Г г Г г ﮒ, ﻉ G g, Ƣ ƣ G g, Ğ ğ
5 Д д Д д Д д D d D d
6 Е е Е е Е е ئ E e, Je, Jь E e, ye, yı
7 Ё ё Е е يؤ Jo Yo only in Russian loanwords
8 Ж ж Ж ж Ж ж ژ Ƶ ƶ J j
9 З з З з З з Z z Z z
10 И и И и И и ئی I i İ i
11 Й й Й й Й й ي J j Y y
12 К к К к К к ﮎ, ق K k, Q q K k, Q q
13 Л л Л л Л л ل L l L l
14 М м М м М м م M m M m
15 Н н Н н Н н ن N n N n
16 О о О о О о ࢭئۇ O o O o
17 П п П п П п P p P p
18 Р р Р р Р р R r R r
19 С с С с С с S s S s
20 Т т Т т Т т ت T t T t
21 У у У у У у ࢭئو U u U u
22 Ф ф Ф ф Ф ф ف F f F f
23 Х х Х х Х х X x X x
24 Ц ц Ц ц Ц ц تس Ts Ts only in Russian loanwords
25 Ч ч Ч ч Ч ч C c Ç ç
26 Ш ш Ш ш Ш ш Ş ş Ş ş
27 Щ щ Щ щ Щ щ شچ Şc Şç only in Russian loanwords
28 Ъ ъ Ъ ъ Ъ ъ
29 Ы ы Ы ы Ы ы ࢭئ Ь ь I ı
30 Ь ь Ь ь Ь ь
31 Э э Э э Э э ئ E e E e
32 Ю ю Ю ю Ю ю يو Ju, Jy Yu, Yü
33 Я я Я я Я я يا Ja, Jə Ya, Yä
34 Ә ә Аъ аъ Ӓ ӓ (Я я) ﺋﻪ Ə ə Ə ə (1999),
Ä ä (2000–2005)
35 Ө ө Оъ оъ Ӧ ӧ Ɵ ɵ Ɵ ɵ (1999),
Ö ö (2000–2005)
36 Ү ү Уъ уъ Ӱ ӱ (Ю ю) Y y Ü ü
37 Җ җ Жъ жъ Ж ж Ç ç C c
38 Ң ң Нъ нъ Ҥ ҥ ڭ Ꞑ ꞑ Ꞑ ꞑ (1999),
Ñ ñ (2000–2005)
39 Һ һ Хъ хъ Х х ه H h H h

Before the 1980s, in the listing of the alphabet, extra letters were placed after the Russian ones, as shown above. The Tatar Parliament changed the alphabetic order in January 1997 to the one shown below.[5]

Cyrillic version[edit]

The official Cyrilic version of the Tatar alphabet used in Tatarstan contains 39 letters:

А Ә Б В Г Д Е (Ё) Ж Җ З И Й К Л М Н Ң О Ө П Р С Т У Ү Ф Х Һ Ц Ч Ш Щ Ъ Ы Ь Э Ю Я

Letter names and pronunciation[edit]

Letters and symbols of the Tatar Cyrillic alphabet
Cyrillic version
Cyrillic version
Common Turkic Alphabet ISO-9 Name Pronunciation Notes
А а a a а /a/ [a]
Ә ә ä ä ә /æ/ [æ]
Б б b b бэ /be/ [b]
В в w, v v вэ /we/ [w]; [v]
Г г g, ğ g гэ /ɡe/ [ɡ]; [ɣ]
Д д d d дэ /de/ [d]
Е е e, ye, yı e йе /je/
йы /jɤ/
[je]; [jɤ]; [e]
Ё ё yo ë йо /jo/ [jo]
Ж ж j ž жэ /ʒe/ [ʒ]
Җ җ c ẓ̌ җэ /ʑe/ [ʑ]
З з z z зэ /ze/ [z]
И и i i и /i/ [i]
Й й y j кыска и
/qɤsˈqɑ ˈi/
К к k, q k ка /qɑ/ [k]; [q]
Л л l l эль /el/ [l]
М м m m эм /em/ [m]
Н н n n эн /en/ [n]
Ң ң ñ ņ эң /eŋ/ [ŋ]
О о o o о /o/ [o]
Ө ө ö ô ө /ø/ [ø]
П п p p пэ /pe/ [p]
Р р r r эр /er/ [r]
С с s s эс /es/ [s]
Т т t t тэ /te/ [t]
У у u, w u У /u/ [u]; [w]
Ү ү ü, w ù Ү /y/ [y]; [w]
Ф ф f f эф /ef/ [f]
Х х x h ха /xa/ [x]
Һ һ h һэ /he/ [h]
Ц ц ts c цэ /tse/ [t͡s]
Ч ч ç č чэ /ɕe/ [ɕ]
Ш ш ş š ша /ʃa/ [ʃ]
Щ щ şç ŝ ща /ʃɕa/ [ʃɕ]
Ъ ъ ' калынлык билгесе
/qɑlɤnˈlɤq bilɡeˈse/
[ʔ] калынлык һәм аеру билгесе
Ы ы ı y ы /ɤ/ [ɤ]
Ь ь ' нечкәлек билгесе
/neɕkæˈlek bilɡeˈse/
[ʔ] нечкәлек һәм аеру билгесе
Э э e, ' è э /e/ [e]; [ʔ]
Ю ю yu, yü û йу /ju/ [ju]; [jy]
Я я ya, yä â йа /ja/ [ja]; [jæ]

Due to the Russian Federal law, only Cyrillic alphabets may have official status in regions of the Russian Federation. There is ongoing confrontation with regards to adoption of the Latin script for the Tatar language.

Latin version[edit]

According to the decree "On restoring the Tatar alphabet based on Latin glyphs" from 1999, the new Latin alphabet would be in official use alongside the Cyrillic alphabet from 1 September 2001, and would become the sole alphabet in official use by 1 September 2011. Around the same time, the Republic of Karelia was pursuing official status for Karelian language, which also uses a Latin-based alphabet.[13] The Russian State Duma perceived the latinization of the two republics as a variety of language secessionism, and on 15 November 2002, they introduced an amendment into the law On the languages of the peoples of the Russian Federation stating that all official languages of the republics within the Russian Federation must use Cyrillic alphabets.[14]

The Republic of Tatarstan challenged the amendment in the Constitutional Court of Russia, arguing that the State Duma doesn't have authority over the language policies of the constituent republics.[15] On 16 November 2004, the Constitutional Court declined the appeal.[16] To comply with the court's decision, the decree "On restoring the Tatar alphabet based on Latin glyphs" was officially rescinded on 22 January 2005.[17]

On 24 December 2012, a new Tatarstani law clarified that the new Latin alphabet, as specified in 2000, should be used as the official romanization for the Tatar language. It also specified Yaña imlâ as the official system for transliteration into the Arabic script. According to this law, requests to Tatarstani authorities may use the Latin and Arabic scripts, but the authorities' answers would be written in Cyrillic, with an optional transliteration into the other alphabets.[18][19] As of 2020, Cyrillic remains the only official script in Tatarstan.

Zamanälif (Tatar for "modern alphabet") contains 34 letters. There are 10 vowels and 25 consonants. In addition to the ISO basic Latin alphabet, the following 9 letters are used: Çç, Ğğ, Şş, Ññ, Ää, Öö, Üü, Iı, Ii.

A, Ä, B, C, Ç, D, E, F, G, Ğ, H, I, I, J, K, L, M, N, Ñ, O, Ö, P, Q, R, S, Ş, T, U, Ü, V, W, X, Y, Z.

Tatar vowels are: a/ä, o/ö, u/ü, ıy/i, ı/e.

The symbol ⟨'⟩ is used for the glottal stop (known as hämzä in Tatar).

Tatar writing is largely phonetic, meaning that the pronunciation of a word can usually be derived from its spelling. This rule excludes recent loanwords, such as summit and names.

Letter names and pronunciation[edit]

in alphabet
Latin character Name in Latin Name in Cyrillic IPA
1 A a A А ɑ, ʌ
2 Ä ä Ä, noqtalı A Ә, нокталы А æ, ə
3 B b Бэ b
4 C c Җэ ʑ
5 Ç ç Çé Чэ ɕ, t͡ʃ
6 D d Дэ d
7 E e E Э e
8 F f Éf Эф f
9 G g Ге ɡ
10 Ğ ğ Ğé Гъэ ɣ
11 H h Һэ h
12 I i I И i
13 I ı I Ы ɨ
14 J j Жэ ʒ, d͡ʒ
15 K k Ке k
16 L l El Эль l
17 M m Ém Эм m
18 N n Én Эн n
19 Ñ ñ Éñ Эң ŋ
20 O o O О o, oː
21 Ö ö Ö, noqtalı O Ө, нокталы О œ
22 P p Пэ p
23 Q q Qu Ку q
24 R r Ér Эр r
25 S s És Эс s
26 Ş ş Şa Ша ʃ
27 T t Тэ t
28 U u U У u
29 Ü ü Ü, noqtalı U Ү, нокталы У ʏ
30 V v Вэ v
31 W w Вэ (Уэ) w
32 X x Éx Эх x
33 Y y Йэ j, ɪ
34 Z z Zet Зет z
  ' Hämzä Һәмзә ʔ

Arabic version[edit]

Sample of the scripts[edit]

Article 1 of the Universal Declaration of Human Rights:

Iske imlâ Yaña imlâ Yañalif Cyrillic Zamanälif English translation
بارلق كشیلر دا آزاد هم اوز آبرويلري هم حقوقلری یاغیننن تینک بولیپ طوالر. آلرغا عقل هم وجدان برلگان هم بر-برسینا قراطا طوغاننرچا مناسبتتا بولرغا تییشلر. بارلئق كئشئلەر دە ئازات هەم ئوز ئابرویلارئ هەم حۇقوقلارئ یاعئننان تیڭ بولئپ توالار. ئالارعا ئاقئل هەم وۇجدان بیرئلگەن هەم بئر-بئرسئنە قاراتا توعاننارچا مۇناسەبەتتە بولئرعا تیئشلەر. Вarlьq keşelər də azat həm yz aʙrujlarь həm xoquqlarь jaƣьnnan tiꞑ ʙulьp tualar. Alarƣa aqьl həm vɵçdan ʙirelgən həm ʙer-ʙersenə qarata tuƣannarca mɵnasəʙəttə ʙulьrƣa tieşlər. Барлык кешеләр дә азат һәм үз абруйлары һәм хокуклары ягыннан тиң булып туалар. Аларга акыл һәм вөҗдан бирелгән һәм бер-берсенә карата туганнарча мөнасәбәттә булырга тиешләр. Barlıq keşelär dä azat häm üz abruyları häm xoquqları yağınnan tiñ bulıp tualar. Alarğa aqıl häm wöcdan birelgän häm ber-bersenä qarata tuğannarça mönasäbättä bulırğa tieşlär. All human beings are born free and equal in dignity and rights. They are endowed with reason and conscience and should act towards one another in a spirit of brotherhood.

