Page semi-protected
From Wikipedia, the free encyclopedia
(Redirected from Alphabetic language)

Charles Morton's 1759 updated version of Edward Bernard's "Orbis eruditi",[1] comparing all known alphabets as of 1689

An alphabet is a standardized set of basic written graphemes (called letters) that represent the phonemes of certain spoken languages.[2] Not all writing systems represent language in this way; in a syllabary, each character represents a syllable, and logographic systems use characters to represent words, morphemes, or other semantic units.[3][4]

The first fully phonemic script, the Proto-Sinaitic script, now the modern Phoenician alphabet, is considered to be the first alphabet and is the ancestor of most modern alphabets, abjads, and abugidas, including Arabic, Cyrillic, Greek, Hebrew, Latin, and possibly Brahmic.[5][6] It was created by Semitic-speaking workers and slaves in the Sinai Peninsula (as the Proto-Sinaitic script), by selecting a small number of hieroglyphs commonly seen in their Egyptian surroundings to describe the sounds, as opposed to the semantic values of the Canaanite languages.[7][8] However, Peter T. Daniels distinguishes an abugida, a set of graphemes that represent consonantal base letters that diacritics modify to represent vowels (as in Devanagari and other South Asian scripts), an abjad, in which letters predominantly or exclusively represent consonants (as in the original Phoenician, Hebrew or Arabic), and an alphabet, a set of graphemes that represent both consonants and vowels. In this narrow sense of the word, the first true alphabet was the Greek alphabet,[9][10] which was based on the earlier Phoenician abjad.

Of the dozens of alphabets in use today, the most popular is the Latin alphabet[11] (originally derived from the Greek alphabet), which is now used by many languages worldwide, often with the addition of extra letters or diacritical marks.

Alphabets are usually associated with a standard ordering of letters. This makes them useful for purposes of collation, specifically by allowing words to be sorted in alphabetical order. It also means that their letters can be used as an alternative method of "numbering" ordered items, in such contexts as numbered lists and number placements.


The English word alphabet came into Middle English from the Late Latin word alphabetum, which in turn originated in the Greek ἀλφάβητος (alphabētos), it was made from the first two letters, alpha (α) and beta (β).[12] The names for the Greek letters came from the first two letters of the Phoenician alphabet; aleph, which also meant ox, and bet, which also meant house.[13]


Ancient Northeast African and Middle Eastern scripts

The history of the alphabet started in ancient Egypt. Egyptian writing had a set of some 24 hieroglyphs that are called uniliterals,[14] which are glyphs that provide one sound.[15] These glyphs were used as pronunciation guides for logograms, to write grammatical inflections, and, later, to transcribe loan words and foreign names.[16]

In the Middle Bronze Age, an apparently "alphabetic" system known as the Proto-Sinaitic script appeared in Egyptian turquoise mines in the Sinai peninsula dated to circa the 15th century BCE, apparently left by Canaanite workers. In 1999, John and Deborah Darnell, American Egyptologists, discovered an earlier version of this first alphabet at the Wadi el-Hol valley in Egypt. The script dated to circa 1800 BCE and shows evidence of having been adapted from specific forms of Egyptian hieroglyphs that could be dated to circa 2000 BCE, strongly suggesting that the first alphabet had developed about that time.[17] The script was based on letter appearances and names, believed to be based on Egyptian hieroglyphs.[5] This script had no characters representing vowels. Originally, it probably was a syllabary—a script where syllables are represented with characters—with symbols that were not needed being removed. It was an alphabetic cuneiform script with 30 signs, including three that indicate the following vowel invented in Ugarit before the 15th century BCE. This script was not used after the destruction of Ugarit in 1178 BCE.[18]

A specimen of Proto-Sinaitic script, one of the earliest (if not the very first) phonemic scripts

The Proto-Sinaitic script eventually developed into the Phoenician alphabet, conventionally called "Proto-Canaanite" before circa 1050 BCE.[6] The oldest text in Phoenician script is an inscription on the sarcophagus of King Ahiram crica 1000 BCE. This script is the parent script of all western alphabets. By the tenth century BCE, two other forms distinguish themselves, Canaanite and Aramaic. The Aramaic gave rise to the Hebrew script.[19] The South Arabian alphabet, a sister script to the Phoenician alphabet, is the script from which the Ge'ez alphabet, an abugida, a writing system where consonant-vowel units are written as units, which was used around the horn of Africa, descended. Vowel-less alphabets are called abjads, currently exemplified in others such as Arabic, Hebrew, and Syriac. The omission of vowels was not always a satisfactory solution, due to having to preserve sacred text, and some "weak" consonants are used to indicate vowels. These letters have a dual function since they can also be used as pure consonants.[20][21]

The Proto-Sinaitic script and the Ugaritic script were the first scripts with a limited number of signs instead of using many different signs for words, in contrast to the other widely used writing systems at the time, Cuneiform, Egyptian hieroglyphs, and Linear B. The Phoenician script was probably the first phonemic script,[5][6] and it contained only about two dozen distinct letters, making it a script simple enough for traders to learn. Another advantage of Phoenician was that it could write different languages since it recorded words phonemically.[22]

The Phoenician script was spread across the Mediterranean by the Phoenicians.[6] The Late Mycenaeans added vowels to the alphabet, this new script, Linear B, gave rise to the ancestor of all alphabets in the West. The Greek Alphabet was the first alphabet in which vowels have independent letter forms separate from those of consonants. The Greeks chose letters representing sounds that did not exist in Phoenecian to represent vowels. The syllabical Linear B, script that was used by the Mycenaean Greeks from the 16th century BCE had 87 symbols, including five vowels. In its early years, there were many variants of the Greek alphabet, a situation that caused many different alphabets to evolve from it.[23]

European alphabets

The Greek alphabet, in Euboean form, was carried over by Greek colonists to the Italian peninsula, circa 800-600 BCE giving rise to many different alphabets used to write the Italic languages. One of these became the Latin alphabet, which spread across Europe as the Romans expanded their republic. After the fall of the Western Roman state, and later the Eastern Roman state, the alphabet survived in intellectual and religious works. It came to be used for the descendant languages of Latin (the Romance languages) and most of the other languages of western and central Europe.[24]

Some adaptations of the Latin alphabet have ligatures, a combination of two letters make one such as æ in Danish and Icelandic and Ȣ in Algonquian; borrowings from other alphabets, such as the thorn þ in Old English and Icelandic, which came from the Futhark runes;[25] and modified existing letters, such as the eth ð of Old English and Icelandic, which is a modified d. Other alphabets only use a subset of the Latin alphabet, such as Hawaiian, and Italian, which uses the letters j, k, x, y, and w only in foreign words.[26]

Another notable script is Elder Futhark, believed to have evolved out of one of the Old Italic alphabets. Elder Futhark gave rise to other alphabets known collectively as the Runic alphabets. The Runic alphabets were used for Germanic languages from 100 CE to the late Middle Ages being engraved on stone and jewelry, although inscriptions found on bone and wood occasionally appear. These alphabets have since gotten replaced with the Latin alphabet. The exception being for decorative use, where the runes remained in use until the 20th century.[27]

A photo of the Old Hungarian script.

The Old Hungarian script was the writing system of the Hungarians. It was in use during the entire history of Hungary, albeit not as an official writing system. From the 19th century, it once again became more and more popular.[28]

The Glagolitic alphabet was the initial script of the liturgical language Old Church Slavonic and became, together with the Greek uncial script, the basis of the Cyrillic script. Cyrillic is one of the most widely used modern alphabetic scripts, and is notable for its use in Slavic languages and also for other languages within the former Soviet Union. Cyrillic alphabets include Serbian, Macedonian, Bulgarian, Russian, Belarusian, and Ukrainian. The Glagolitic alphabet believed to have been created by Saints Cyril and Methodius, while the Cyrillic alphabet was invented by Clement of Ohrid, their disciple. They feature many letters that appear to have been borrowed from or influenced by Greek and Hebrew.[29]

Asian alphabets

Beyond the logographic Chinese writing, many phonetic scripts exist in Asia. The Arabic alphabet, Hebrew alphabet, Syriac alphabet, and other abjads of the Middle East are developments of the Aramaic alphabet.[30][31]

Most alphabetic scripts of India and Eastern Asia descend from the Brahmi script, believed to be a descendant of Aramaic.[32]


In Korea, Sejong the Great created the Hangul alphabet in 1443 CE.[33] Hangul is a unique alphabet: it is a featural alphabet, where design of many of the letters comes from a sound's place of articulation (P to look like the widened mouth, L to look like the tongue pulled in.;[34] The creation of Hangul was planned by the government of the day;[35] and it places individual letters in syllable clusters with equal dimensions, in the same way as Chinese characters. This change allows for mixed-script writing (one syllable always takes up one type-space no matter how many letters get stacked into building that one sound-block).[36]


Zhuyin (sometimes called Bopomofo) is a semi-syllabary. It transcribes Mandarin phonetically in the Republic of China. After the later establishment of the People's Republic of China and its adoption of Hanyu Pinyin, the use of Zhuyin today is limited. However, it is still widely used in Taiwan, where the Republic of China governs. Zhuyin developed from a form of Chinese shorthand based on Chinese characters in the early 1900s and has elements of both an alphabet and a syllabary. Like an alphabet, the phonemes of syllable initials get represented by individual symbols, but like a syllabary, the phonemes of the syllable finals are not; each possible final (excluding the medial glide) has its own character, an example being, luan written as ㄌㄨㄢ (l-u-an). The last symbol ㄢ takes place as the entire final -an. While Zhuyin is not a mainstream writing system, it is still often used in ways similar to a romanization system, for aiding pronunciation and as an input method for Chinese characters on computers and cellphones.[37]


European alphabets, especially Latin and Cyrillic, have been adapted for many languages of Asia. Arabic is also widely used, sometimes as an abjad (as with Urdu and Persian) and sometimes as a complete alphabet (as with Kurdish and Uyghur).[38][39]


Predominant national and selected regional or minority scripts
  Hanzi [L]
  Kana [S] / Kanji [L]  

The term "alphabet" gets used by linguists and paleographers in both a wide and a narrow sense. In a broader sense, an alphabet is a segmental script at the phoneme level—that is, it has separate glyphs for individual sounds and not for larger units such as syllables or words. In the narrower sense, some scholars distinguish "true" alphabets from two other types of segmental script, abjads and abugidas. These three differ in how they treat vowels. Abjads have letters for consonants and leave most vowels unexpressed. Abugidas are also consonant-based but indicate vowels with diacritics, a systematic graphic modification of the consonants. In alphabets in the narrow sense, on the other hand, consonants and vowels are written as independent letters.[40] The earliest known alphabet in the wider sense is the Wadi el-Hol script, believed to be an abjad. Its successor, Phoenician is the ancestor of modern alphabets, including Arabic, Greek, Latin (via the Old Italic alphabet), Cyrillic (via the Greek alphabet), and Hebrew (via Aramaic).[41][42]

Examples of present-day abjads are the Arabic and Hebrew scripts;[43] true alphabets include Latin, Cyrillic, and Korean hangul; and abugidas, used to write Tigrinya, Amharic, Hindi, and Thai. The Canadian Aboriginal syllabics are also an abugida rather than a syllabary as their name would imply, because each glyph stands for a consonant and is modified by rotation to represent the following vowel (in a true syllabary, each consonant-vowel combination gets represented by a separate glyph).[44]

All three types may be augmented with syllabic glyphs. Ugaritic, for example, is basically an abjad but has syllabic letters for /ʔa, ʔi, ʔu/[45][46] (these are the only times that vowels are indicated). Coptic has a letter for /ti/.[47] Devanagari is typically an abugida augmented with dedicated letters for initial vowels, though some traditions use अ as a zero consonant as the graphic base for such vowels.[48][49]

The boundaries between the three types of segmental scripts are not always clear-cut. For example, Sorani Kurdish is written in the Arabic script, which when used for other languages is an abjad.[50] In Kurdish, writing the vowels is mandatory, and whole letters get used, so the script is a true alphabet. Other languages may use a Semitic abjad with forced vowel diacritics, effectively making them abugidas. On the other hand, the Phagspa script of the Mongol Empire was based closely on the Tibetan abugida, but vowel marks are written after the preceding consonant rather than as diacritic marks. Although short a not getting written, as in the Indic abugidas, The original source of the term "abugida", namely the Ge'ez abugida now used for Amharic and Tigrinya, has assimilated into their consonant modifications. It is now no longer systematic and must be learned as a syllabary rather than as a segmental script. Even more extreme, the Pahlavi abjad eventually became logographic.[51]

Thus the primary categorisation of alphabets reflects how they treat vowels. For tonal languages, further classification can be based on their treatment of tone though names do not yet exist to distinguish the various types. Some alphabets disregard tone entirely, especially when it does not carry a heavy functional load,[52] as in Somali and many other languages of Africa and the Americas.[53] Most commonly, tones get indicated with diacritics, which is how vowels get treated in abugidas, which is the case for Vietnamese (a true alphabet) and Thai (an abugida). In Thai, the tone is determined primarily by a consonant, with diacritics for disambiguation.[54] In the Pollard script, an abugida, vowels get indicated by diacritics. The placing of the diacritic relative to the consonant is modified to indicate the tone.[39] More rarely, a script may have separate letters for tones, as is the case for Hmong and Zhuang.[55] For many, regardless of whether letters or diacritics get used, the most common tone is not marked, just as the most common vowel is not marked in Indic abugidas. In Zhuyin, not only is one of the tones unmarked; but there is a diacritic to indicate a lack of tone, like the virama of Indic.[56]

Sizes of alphabets

The number of letters in an alphabet can be small. The Book Pahlavi script, an abjad, had only twelve letters at one point and may have had even fewer.[57] Today the Rotokas alphabet has only twelve letters (the Hawaiian alphabet is claimed to be as small. However, it consists of 13 letters, including the ʻokina and five long vowels).[58] While Rotokas has a small alphabet because it has few phonemes to represent (just eleven),[59] Book Pahlavi was small because many letters got conflated—or, the graphic distinctions had gotten lost over time.[60] In later Pahlavi papyri, up to half of the remaining graphic distinctions of these twelve letters were lost, and the script could no longer be read as a sequence of letters at all. Instead, each word had to be learned as a whole—or, they had become logograms as in Egyptian Demotic. Moreover, the spellings of some words were heterograms; that is, those spellings did not reflect the pronunciation of those words in Pahlavi but instead reflected their Aramaic equivalents used as logograms (as English e. g. 'for example', from Latin exempli gratia).[61]

A Venn diagram showing the Greek (left), Cyrillic (bottom) and Latin (right) alphabets, which share many of the same letters, although they have different pronunciations


The largest segmental script is probably Devanagari. When written in Devanagari, Vedic Sanskrit has an alphabet of 53 letters, including the visarga mark for final aspiration and special letters for and jñ. However, one of the letters is theoretical and not used. The Hindi alphabet must represent both Sanskrit and modern vocabulary, and so has been expanded to 58 with the khutma letters (letters with a dot added) to represent sounds from Persian and English.[48]


The largest known abjad is Sindhi, with 52 letters (28 letters of the Arabic abjad and 24 additional letters) for 62 phonemes (16 vowel sounds, and 46 consonant sounds); vowels are not fully distinguished in writing.[62] On the smaller side is Sogdian, an abjad used around modern day Kazakhstan, Tajikistan, Pakistan, and Xinjiang. With 20 letters, 17 consonants and 3 Matres Lectionis[63]


The largest alphabets in the narrow sense include Abkhaz and Kabardian (for Cyrillic), with 62 and 60 letters respectively, and Slovak (for the Latin script), with 46. However, these scripts either count di- and tri-graphs as separate letters, as Spanish did with ch and ll until recently, or uses diacritics like Slovak č.[64][65][66] The Georgian alphabet is an alphabetic writing system. The modern Georgian alphabet has 33 letters.[67] The original Georgian alphabet had 38 letters, but five letters were removed in the 19th century by Ilia Chavchavadze.[68]

The Armenian alphabet is an alphabetical writing system used to write the Armenian language. It was created in year 405 A.D. originally contained 36 letters. Two more letters, օ (o) and ֆ (f), were added in the Middle Ages. During the 1920s orthography reform, a new letter և (capital ԵՎ) was added, which was a ligature before ե+ւ. The letter Ւ ւ was discarded and reintroduced as part of a new letter ՈՒ ու (which was a digraph before).[69]


Syllabaries typically contain 50 to 100 glyphs. The Cherokee syllabary has 85 glyphs;[70] while Hiragana and Katakana, syllabaries used in Japan, each has 46 base characters respectively. However, there are certain additions that can be added to modify the characters creating 25 new sounds for both.[71] Glyphs of logographic systems typically number from the many hundreds into the thousands. Thus a simple count of the number of distinct symbols is an important clue to the nature of an unknown script. Mandarin, a Chinese logographic language, can require 7,000 up to 13,053 different glyphs depending on whether simplified Chinese or traditional Chinese is used.[72]

Alphabetical order

Alphabets often come to be associated with a standard ordering of their letters, which is for collation—namely, for the listing words and other items in alphabetical order.[citation needed]

The basic ordering of the Latin alphabet (A B C D E F G H I J K L M N O P Q R S T U V W X Y Z), which derives from the Northwest Semitic "Abgad" order,[73] is already well established. Although, languages using this alphabet have different conventions for their treatment of modified letters (such as the French é, à, and ô) and certain combinations of letters (multigraphs). In French, these are not considered to be additional letters for collation. However, in Icelandic, the accented letters such as á, í, and ö are considered distinct letters representing different vowel sounds from sounds represented by their unaccented counterparts. In Spanish, ñ is considered a separate letter, but accented vowels such as á and é are not. The ll and ch were also formerly considered single letters and sorted separately after l and c, but in 1994 the tenth congress of the Association of Spanish Language Academies changed the collating order so that ll came to be sorted between lk and lm in the dictionary and ch came to be sorted between cg and ci; those digraphs were still formally designated as letters, but in 2010 the Real Academia Española changed it so they are no longer considered letters at all.[74][75]

In German, words starting with sch- (which spells the German phoneme /ʃ/) get inserted between words with initial sca- and sci- (all incidentally loanwords) instead of appearing after the initial sz, as though it were a single letter, which contrasts several languages such as Albanian, in which dh-, ë-, gj-, ll-, rr-, th-, xh-, and zh-, which all represent phonemes and considered separate single letters, would follow the letters d, e, g, l, n, r, t, x, and z, respectively, as well as Hungarian and Welsh. Further, German words with an umlaut get collated ignoring the umlaut as—contrary to Turkish that adopted the graphemes ö and ü, and where a word like tüfek would come after tuz, in the dictionary. An exception is the German telephone directory, where umlauts are sorted like ä=ae since names such as Jäger also appear with the spelling Jaeger and are not distinguished in the spoken language.[76]

The Danish and Norwegian alphabets end with æøå,[77][78] whereas the Swedish conventionally put åäö at the end. However, æ phonetically corresponds with ä, as does ø and ö.[79]

It is unknown whether the earliest alphabets had a defined sequence. Some alphabets today, such as the Hanuno'o script, are learned one letter at a time, in no particular order, and are not used for collation where a definite order is required.[80] However, a dozen Ugaritic tablets from the fourteenth century BCE preserve the alphabet in two sequences. One, the ABCDE order later used in Phoenician, has continued with minor changes in Hebrew, Greek, Armenian, Gothic, Cyrillic, and Latin; the other, HMĦLQ, was used in southern Arabia and is preserved today in Ethiopic.[81] Both orders have therefore been stable for at least 3000 years.[82]

Runic used an unrelated Futhark sequence, which got simplified later on.[83] Arabic uses usually uses its sequence, although Arabic retains the traditional abjadi order for numbers.[84]

The Brahmic family of alphabets used in India uses an unique order based on phonology: The letters get arranged according to how and where the sounds get produced in the mouth. This organization is present in Southeast Asia, Tibet, Korean hangul, and even Japanese kana, which is not an alphabet.[85]

Names of letters

The Phoenician letter names, in which each letter got associated with a word that begins with that sound (acrophony), continue to be used to varying degrees in Samaritan, Aramaic, Syriac, Hebrew, Greek, and Arabic.[86][87][88][89]

Acrophony got abandoned in Latin. It referred to the letters by adding a vowel (usually "e," sometimes "a" or "u") before or after the consonant. With two exceptions were Y and Z, which were borrowed from the Greek alphabet rather than Etruscan. They were known as Y Graeca "Greek Y" and zeta (from Greek)—this discrepancy was inherited by many European languages, as in the term zed for Z in all forms of English, other than American English.[90] Over time names sometimes shifted or were added, as in double U for W, or "double V" in French, the English name for Y, and American zee for Z. Comparing them in English and French gives a clear reflection of the Great Vowel Shift: A, B, C, and D are pronounced /eɪ, biː, siː, diː/ in today's English, but in contemporary French they are /a, be, se, de/.[91][unreliable source?] The French names (from which the English names got derived) preserve the qualities of the English vowels before the Great Vowel Shift. By contrast, the names of F, L, M, N, and S (/ɛf, ɛl, ɛm, ɛn, ɛs/) remain the same in both languages because "short" vowels were largely unaffected by the Shift.[92]

In Cyrillic, originally, acrophony was present using Slavic words.[93][unreliable source?] However, this got abandoned in favor of a system similar to Latin.[94][unreliable source?]

Letters of the Armenian alphabet also have distinct letter names.[95][unreliable source]

Orthography and pronunciation

When an alphabet is adopted or developed to represent a given language, an orthography generally comes into being, providing rules for spelling words, following the principle on which alphabets get based. These rules will map letters of the alphabet to the phonemes of the spoken language.[96] In a perfectly phonemic orthography, there would be consistent one-to-one correspondence between the letters and the phonemes so that a writer could predict the spelling of a word given its pronunciation, and a speaker would always know the pronunciation of a word given its spelling, and vice versa. However, this ideal is usually never achieved in practice. Languages can come close to it, such as Spanish and Finnish. others, such as English, deviate from it to a much larger degree.[97]

The pronunciation of a language often evolves independently of its writing system. Writing systems have gotten borrowed for languages they did not design to have in mind. The degree to which letters of an alphabet correspond to phonemes of a language varies.[98]

Languages may fail to achieve a one-to-one correspondence between letters and sounds in any of several ways:

  • A language may represent a given phoneme by combinations of letters rather than just a single letter. Two-letter combinations are called digraphs, and three-letter groups are called trigraphs. German uses the tetragraphs (four letters) "tsch" for the phoneme German pronunciation: [tʃ] and (in a few borrowed words) "dsch" for [dʒ].[99] Kabardian also uses a tetragraph for one of its phonemes, namely "кхъу."[100] Two letters representing one sound occur in several instances in Hungarian as well (where, for instance, cs stands for [tʃ], sz for [s], zs for [ʒ], dzs for [dʒ]).[101]
  • A language may represent the same phoneme with two or more different letters or combinations of letters. An example is modern Greek which may write the phoneme Greek pronunciation: [i] in six different ways: ⟨ι⟩, ⟨η⟩, ⟨υ⟩, ⟨ει⟩, ⟨οι⟩, and ⟨υι⟩.[citation needed]
  • A language may spell some words with unpronounced letters that exist for historical or other reasons. For example, the spelling of the Thai word for "beer" [เบียร์] retains a letter for the final consonant "r" present in the English word it borrows but silences it.[102]
  • Pronunciation of individual words may change according to the presence of surrounding words in a sentence (sandhi).[citation needed]
  • Different dialects of a language may use different phonemes for the same word.[103]
  • A language may use different sets of symbols or rules for distinct vocabulary items. The Japanese hiragana and katakana syllabaries. The rules in English for spelling words from Latin and Greek. Along with rules in the original Germanic vocabulary.[citation needed]

National languages sometimes elect to address the problem of dialects by associating the alphabet with the national standard. Some national languages like Finnish, Armenian, Turkish, Russian, Serbo-Croatian (Serbian, Croatian and Bosnian) and Bulgarian have a very regular spelling system with a nearly one-to-one correspondence between letters and phonemes. Strictly speaking, these national languages lack a word corresponding to the verb "to spell" (meaning to split a word into its letters), the closest match being a verb meaning to split a word into its syllables.[citation needed] Similarly, the Italian verb corresponding to 'spell (out),' compitare, is unknown to many Italians because spelling is usually trivial, as Italian spelling is highly phonemic.[citation needed] In standard Spanish, one can tell the pronunciation of a word from its spelling, but not vice versa, as phonemes sometimes can be represented in more than one way, but a given letter gets consistently pronounced.[104] French, with its silent letters and its heavy use of nasal vowels and elision, may seem to lack much correspondence between spelling and pronunciation, it's rules on pronunciation, though complex, are actually consistent and predictable with a fair degree of accuracy.[105]

At the other extreme are languages such as English, where pronunciations mostly have to be memorized as they do not correspond to the spelling consistently. For English, this is partly because the Great Vowel Shift occurred after the orthography got established and because English has acquired a large number of loanwords at different times, retaining their original spelling at varying levels.[106] Even English has general, albeit complex, rules that predict pronunciation from spelling. Rules like this are usually successful. However, rules to predict spelling from pronunciation have a higher failure rate.[107]

Sometimes, countries have the written language undergo a spelling reform to realign the writing with the contemporary spoken language. These can range from simple spelling changes and word forms to switching the entire writing system. For example, Turkey switched from the Arabic alphabet to a Latin-based Turkish alphabet,[108] and when Kazakh changed from an Arabic script to a Cyrillic script due to the Soviet Union's influence, and in 2021, having a transition to the Latin alphabet, just like Turkish.[109][110] The Cyrillic script used to be official in Uzbekistan and Turkmenistan before they all switched to the Latin alphabet, including Uzbekistan that is having a reform of the alphabet to use diacritics on the letters that get marked by apostrophes and the letters that are digraphs.[111][112]

The standard system of symbols used by linguists to represent sounds in any language, independently of orthography, is called the International Phonetic Alphabet.

See also


  1. ^ Edwin JEANS (1860). A Catalogue of Books, in all Branches of Literature, both Ancient & Modern ... on sale at E. Jeans's, bookseller ... Norwich. J. Fletcher. p. 33.
  2. ^ Pulgram, Ernst (1951). "Phoneme and Grapheme: A Parallel". WORD. 7 (1): 15–20. doi:10.1080/00437956.1951.11659389. ISSN 0043-7956.
  3. ^ Daniels & Bright 1996, p. 4
  4. ^ Taylor, Insup (1980), Kolers, Paul A.; Wrolstad, Merald E.; Bouma, Herman (eds.), "The Korean writing system: An alphabet? A syllabary? a logography?", Processing of Visible Language, Boston, MA: Springer US, pp. 67–82, doi:10.1007/978-1-4684-1068-6_5, ISBN 978-1-4684-1070-9, retrieved 19 June 2021
  5. ^ a b c Coulmas 1989, pp. 140–141
  6. ^ a b c d Daniels & Bright 1996, pp. 92–96
  7. ^ Goldwasser, O. (2012). "The Miners that Invented the Alphabet - a Response to Christopher Rollston". Journal of Ancient Egyptian Interconnections. 4 (3): 9–22. doi:10.2458/azu_jaei_v04i3_goldwasser.
  8. ^ Goldwasser, O. (2010). "How the Alphabet was Born from Hieroglyphs". Biblical Archaeology Review. 36 (2): 40–53.
  9. ^ Coulmas, Florian (1996). The Blackwell Encyclopedia of Writing Systems. Oxford: Blackwell Publishing. ISBN 978-0-631-21481-6.
  10. ^ Millard 1986, p. 396
  11. ^ Haarmann 2004, p. 96
  12. ^ "alphabet".
  13. ^ "Alphabet | Definition, History, & Facts | Britannica". Retrieved 4 January 2023.
  14. ^ Lynn, Bernadette (8 April 2004). "The Development of the Western Alphabet". h2g2. BBC. Retrieved 4 August 2008.
  15. ^ "Uniliteral Signs". Retrieved 24 January 2023.
  16. ^ Daniels & Bright 1996, pp. 74–75
  17. ^ Darnell, J. C.; Dobbs-Allsopp, F. W.; Lundberg, Marilyn J.; McCarter, P. Kyle; Zuckerman, Bruce; Manassa, Colleen (2005). "Two Early Alphabetic Inscriptions from the Wadi el-Ḥôl: New Evidence for the Origin of the Alphabet from the Western Desert of Egypt". The Annual of the American Schools of Oriental Research. 59: 63, 65, 67–71, 73–113, 115–124. JSTOR 3768583.
  18. ^ Ugaritic Writing online
  19. ^ Coulmas 1989, p. 142
  20. ^ Coulmas 1989, p. 147
  21. ^ "Matres lectionis | orthography | Britannica". Retrieved 20 January 2023.
  22. ^ Hock, Hans; Joseph, Brian (22 July 2019). Language History, Language Change, and Language Relationship: An Introduction to Historical and Comparative Linguistics (3rd ed.). Mouton De Gruyter. p. 85. ISBN 978-3110609691.
  23. ^ Ventris, Micheal; Chadwick, John (2015). Documents in Mycenaean Greek: Three Hundred Selected Tablets from Knossos, Pylos and Mycenae with Commentary and Vocabulary (Reprinted ed.). Cambridge University Press. p. 60. ISBN 978-1107503410.
  24. ^ Jeffery, L. H.; Johnston, A. W. (10 May 1990). The Local Scripts of Archaic Greece: A Study of the Origin of the Greek Alphabet and Its Development from the Eighth to the Fifth Centuries B.C. (Oxford Monographs on Classical Archaeology) (Revised ed.). Clarendon Press. ISBN 978-0198140610.
  25. ^ Knight, Sirona (2008). Runes. New York: Sterling. ISBN 978-1-4027-6006-8. OCLC 213301655.
  26. ^ Robustelli, Cecilia; Maiden, Martin (4 February 2014). A Reference Grammar of Modern Italian. Routledge Reference Grammars (2nd ed.). Routledge (published 25 May 2007). ISBN 978-0340913390.
  27. ^ Stifter, David (2010), "Lepontische Studien: Lexicon Leponticum und die Funktion von san im Lepontischen", in Stüber, Karin; et al. (eds.), Akten des 5. Deutschsprachigen Keltologensymposiums. Zürich, 7.–10. September 2009, Wien.
  28. ^ Maxwell, Alexander (2004). "Contemporary Hungarian Rune-Writing Ideological Linguistic Nationalism within a Homogenous Nation" (PDF). Anthropos.
  29. ^ "Glagolitic alphabet | Britannica". Retrieved 30 November 2022.
  30. ^ "Aramaic Alphabet | PDF | Languages Of Asia | Writing". Scribd. Retrieved 4 January 2023.
  31. ^ Blau, Joshua (2010). Phonology and morphology of Biblical Hebrew : an introduction. Winona Lake, Ind.: Eisenbrauns. ISBN 978-1-57506-601-1. OCLC 759160098.
  32. ^ "Brāhmī | writing system | Britannica". Retrieved 4 January 2023.
  33. ^ "上親制諺文二十八字...是謂訓民正音 (His majesty created 28 characters himself... It is Hunminjeongeum (original name for Hangul))", 《세종실록 (The Annals of the Choson Dynasty : Sejong)》 25년 12월.
  34. ^ Hitkari, Cherry (6 October 2021). "Alphabet's Epitome: The Invention of Hangul and its Contribution to the Korean Society". Retrieved 30 November 2022.
  35. ^ "Hangul | Alphabet Chart & Pronunciation | Britannica". Retrieved 30 November 2022.
  36. ^ Paul A. Kolers; Merald Ernest Wrolstad; Herman Bouma (1980). Processing of visible language 2. New York. ISBN 0-306-40576-8. OCLC 7099393.
  37. ^ "The Definition of the Bopomofo Chinese Phonetic System". ThoughtCo. Retrieved 30 November 2022.
  38. ^ Thackston, W.M. (2006), "—Sorani Kurdish— A Reference Grammar with Selected Readings", Harvard Faculty of Arts & Sciences, Harvard University, retrieved 10 June 2021
  39. ^ a b Zhou, Minglang (24 October 2012). Multilingualism in China: The Politics of Writing Reforms for Minority Languages. Mouton de Gruyter.
  40. ^ For critics of the abjad-abugida-alphabet distinction, see Reinhard G. Lehmann: "27-30-22-26. How Many Letters Needs an Alphabet? The Case of Semitic", in: The idea of writing: Writing across borders; edited by Alex de Voogt and Joachim Friedrich Quack, Leiden: Brill 2012, p. 11-52, esp p. 22-27
  41. ^ "Sinaitic inscriptions | Alphabet, Meaning, & Decipherment | Britannica". Retrieved 30 November 2022.
  42. ^ Thamis. "The Phoenician Alphabet & Language". World History Encyclopedia. Retrieved 30 November 2022.
  43. ^ Lipiński, Edward (1975). Studies in Aramaic inscriptions and onomastics. [Leuven]: Leuven University Press. ISBN 90-6186-019-9. OCLC 2005521.
  44. ^ Bernard Comrie, 2005, "Writing Systems", in Haspelmath et al. eds, The World Atlas of Language Structures (p 568 ff). Also Robert Bringhurst, 2004, The solid form of language: an essay on writing and meaning.
  45. ^ Florian Coulmas, 1991, The writing systems of the world
  46. ^ Schniedewind, William M. (2007). A primer on Ugaritic : language, culture, and literature. Joel H. Hunt. New York: Cambridge University Press. ISBN 978-0-511-34933-1. OCLC 647687091.
  47. ^ "КОПТСКОЕ ПИСЬМО • Большая российская энциклопедия - электронная версия". Retrieved 30 November 2022.
  48. ^ a b Dhanesh Jain; George Cardona (2007). The Indo-Aryan languages. London: Routledge. ISBN 978-1-135-79711-9. OCLC 648298147.
  49. ^ "A Practical Sanskrit Introductory by Charles Wikner". Retrieved 30 November 2022.
  50. ^ Thackston, W. M. (2022). Sorani Kurdish — A Reference Grammar with Selected Readings. Independently Published. ISBN 979-8837159206.
  51. ^ Nyberg, Henrik (1964). A Manual of Pahlavi: Glossary (in German). Harrassowitz (published 31 December 1974). ISBN 978-3447015806.
  52. ^ Alphonsa, Alice Celin; Bhanja, Chuya China; Laskar, Azharuddin; Laskar, Rabul Hussain (July 2017). "Spectral feature based automatic tonal and non-tonal language classification". 2017 International Conference on Intelligent Computing, Instrumentation and Control Technologies (ICICICT): 1271–1276. doi:10.1109/ICICICT1.2017.8342752. ISBN 978-1-5090-6106-8. S2CID 5060391.
  53. ^ Galaal, Muuse Haaji Ismaaʻiil; Andrzejewski, Bogumił W. (1956). Hikmaad Soomaali. Oxford University Press.
  54. ^ B., Alisscia (20 March 2021). Thai-English Picture Book: Thai Consonants, Vowels, 4 Tone Marks, Numbers and Activity Book for Kids | Thai Language Learning. Amazon Digital Services LLC (published 20 March 2021). ISBN 9798725525847.
  55. ^ Clark, Marybeth (2000), Diexis and anaphora and prelinguistic universals, Oceanic Linguistics Special Publications, vol. 29, pp. 46–61
  56. ^ "Devanagari - an overview | ScienceDirect Topics". Retrieved 30 November 2022.
  57. ^ Asher, Ronald (2005). Brown, Keith (ed.). Encyclopedia of Language and Linguistics. Elsevier Science. ISBN 9780080547848.
  58. ^ Group, United Language. "Some Little Known Facts About the Hawaiian Language". Retrieved 24 January 2023.
  59. ^ Robinson, Stuart (June 2006). The Phoneme Inventory of the Aita Dialect of Rotokas. Vol. 45. University of Hawai'i Press. pp. 206–209.
  60. ^ Utas, Bo (2013). From Old to New Persian : collected essays. Carina Jahani, Mehrdad Fallahzadeh. Wiesbaden. ISBN 978-3-89500-970-9. OCLC 856903580.
  61. ^ Henning, Walter B. (1958), Altiranisch. Handbuch der Orientalistik. Erste Abteilung, vol. Band IV: Iranistik. Erster Abschnitt. Linguistik, Leiden-Köln: Brill
  62. ^ International Phonetic Association (1999). Handbook of the International Phonetic Association : a guide to the use of the International Phonetic Alphabet. Cambridge, U.K.: Cambridge University Press. ISBN 0-521-65236-7. OCLC 40305532.
  63. ^ Qarīb, Badr al-Zamān, or 1930-; قريب، بدر الزمان،, or 1930- (1995). Sogdian dictionary : Sogdian-Persian-English (Chāp-i avval ed.). Tehran: Farhangan Publications. ISBN 964-5558-06-9. OCLC 34145239.
  64. ^ Unicode (10 May 2008). "Document ISO/IEC JTC1/SC2/WG2 N3435R" (PDF). Unicode. Retrieved 1 December 2022.
  65. ^ Kuipers, Aert (1960). Phoneme and Morpheme in Karbadian (Eastern Adyghe). Mouton.
  66. ^ Hanulíková, Adriana; Hamann, Silke (December 2010). "Slovak". Journal of the International Phonetic Association. 40 (3): 373–378. doi:10.1017/S0025100310000162. ISSN 0025-1003. S2CID 232347779.
  67. ^ Unicode Standard, V. 6.3. U10A0, p. 3
  68. ^ Matthias Hüning; Ulrike Vogl; Olivier Moliner (2012). Standard languages and multilingualism in European history. Amsterdam: John Benjamins Pub. Co. p. 299. ISBN 978-90-272-7391-8. OCLC 793996608.
  69. ^ "Armenian alphabet | writing system | Britannica". Retrieved 25 January 2023.
  70. ^ Handbook of North American Indians. William C. Sturtevant. Washington: Smithsonian Institution. 1978. ISBN 978-1-944466-53-4. OCLC 13240086.{{cite book}}: CS1 maint: others (link)
  71. ^ "Kana | Japanese writing | Britannica". Retrieved 25 January 2023.
  72. ^ Sonnad, Nikhil (18 December 2015). "The long, incredibly tortuous, and fascinating process of creating a Chinese font". Quartz. Retrieved 27 January 2023.
  73. ^ Reinhard G. Lehmann: "27-30-22-26. How Many Letters Needs an Alphabet? The Case of Semitic", in: The idea of writing: Writing across borders; edited by Alex de Voogt and Joachim Friedrich Quack, Leiden: Brill 2012, p. 11-52
  74. ^ Real Academia Española. Exclusión de «ch» y «ll» del abecedario.
  75. ^ "La 'i griega' se llamará 'ye'". Cuba Debate. 2010-11-05. Retrieved 12 December 2010.
  76. ^ DIN 5007-1:2005-08 FILING OF CHARACTER STRINGS - PART 1: GENERAL RULES FOR PROCESSING (ABC RULES) (in German). German Institute for Standardisation (Deutsches Institut für Normung). 2005.
  77. ^ WAGmob (25 December 2013). Learn Danish (Alphabet and Numbers). WAGmob.
  78. ^ WAGmob (2 January 2014). Learn Norwegian (Alphabet and Numbers). WAGmob.
  79. ^ Holmes, Philip (2003). Swedish : a comprehensive grammar. Ian Hinchliffe (2nd ed.). London: Routledge. ISBN 9780415278836. OCLC 52269425.
  80. ^ Conklin, Harold C. (2007). Fine description : ethnographic and linguistic essays. Joel Corneal Kuipers, Ray McDermott. New Haven, Conn.: Yale University Southeast Asia Studies. pp. 320–342. ISBN 978-0-938692-85-0. OCLC 131239101.
  81. ^ Millard 1986, p. 395
  82. ^ "ScriptSource - Ethiopic (Geʻez)". Retrieved 14 December 2022.
  83. ^ Elliott, Ralph Warren Victor (1980). Runes, an introduction. Manchester, Eng.: Manchester Univ. Press. p. 14. ISBN 0-7190-0787-9. OCLC 7088245.
  84. ^ "ترتيب المداخل والبطاقات في القوائم والفهارس الموضوعية - منتديات اليسير للمكتبات وتقنية المعلومات". Retrieved 2 December 2022.
  85. ^ Frellesvig, Bjarke (2010). A history of the Japanese language. Cambridge: Cambridge University Press. pp. 177–178. ISBN 978-0-511-93242-7. OCLC 695989981.
  86. ^ "World Wide Words: Acrophony". World Wide Words. Retrieved 13 December 2022.
  87. ^ "The Samaritan Script". The Samaritans. Retrieved 13 December 2022. Notice the "Names of the Letters" Section.
  88. ^ MacLeod, Ewan (2015). Learn The Aramiac Alphabet. pp. 3–4.
  89. ^ "Arabic alphabet, ABC - Names in Arabic". 3 February 2013. Retrieved 13 December 2022.
  90. ^ Sampson, Geoffrey (1985). Writing systems : a linguistic introduction. Stanford, Calif.: Stanford University Press. ISBN 0-8047-1254-9. OCLC 12745931.
  91. ^ "French Alphabet & Pronunciation". Retrieved 13 December 2022.
  92. ^ "The Great Vowel Shift". Retrieved 13 December 2022. Note how it says short vowels are similar between Middle and Modern English.
  93. ^ "The Cyrillic Alphabet: Origins". Retrieved 13 December 2022.
  94. ^ "Russian Alphabet - (Cyrillic Alphabet) - Letter Names". Retrieved 13 December 2022.
  95. ^ "Armenian Alphabet". Retrieved 13 December 2022.
  96. ^ Seidenberg, Mark (1992). Frost, Ram; Katz, Leonard (eds.). Beyond Orthographic Depth in Reading: Equitable Division of Labor. Advances in Psychology. ISBN 9780444891402.
  97. ^ Nordlund, Taru (2012). "Standardization of Finnish Orthography: From Reformists to National Awakeners". Walter de Gruyter: 351–372. doi:10.1515/9783110288179.351. ISBN 9783110288179. S2CID 156286003.
  98. ^ Rogers, Henry (1 January 1999). "Sociolinguistic factors in borrowed writing systems". Toronto Working Papers in Linguistics. 17. ISSN 1718-3510.
  99. ^ Reindl, Donald (2005). The Effects of Historical German-Slovene Language Contact on the Slovene Language (Digitized ed.). Indiana University, Department of Slavic Languages and Literature. p. 90.
  100. ^ Dictionaries, An International Encyclopedia of Lexicography. Vol. 3rd. Walter De Gruyter. 1991.
  101. ^ Berecz, Ágoston (2020). Empty signs, historical imaginaries : the entangled nationalization of names and naming in a late Habsburg borderland. New York. p. 211. ISBN 978-1-78920-635-7. OCLC 1135915948.
  102. ^ Allyn, Eric; Chaiyana, Samorn (1995). The Bua Luang What You See is what You Say Thai Phrase Handbook Contemporary Thai-language Phrases in Context, WYSIWYS Easier-to-read Transliteration System. Bua Luang Publishing Company. ISBN 9780942777048. Note in the pronunciation guide next to "เบียร์" it has it being said as, "Bia"
  103. ^ Gasser, Micheal (10 April 2021). "4.5: English Accents". Social Sci LibreTexts. Retrieved 15 December 2022.
  104. ^ "Spanish Pronunciation: The Ultimate Guide | The Mimic Meth". The Mimic Method. 17 January 2017. Retrieved 13 December 2022.
  105. ^ Rochester, Myrna Bell (2009). Easy French step-by-step : master high-frequency grammar for French proficiency--fast!. New York: McGraw Hill. ISBN 978-0-07-164221-7. OCLC 303676798.
  106. ^ Denham, Kristin E. (2010). Linguistics for everyone : an introduction. Anne C. Lobeck. Boston, MA: Wadsworth/ Cengage Learning. ISBN 978-1-4130-1589-8. OCLC 432689138.
  107. ^ Linstead, Stephen (11 December 2014). "English spellings don't match the sounds they are supposed to represent. It's time to change | Mind your language". the Guardian. Retrieved 13 December 2022.
  108. ^ Zürcher, Erik Jan (2004). Turkey : a modern history (3rd ed.). London: I.B. Tauris. pp. 188–189. ISBN 1-4175-5697-8. OCLC 56987767.
  109. ^ "Нұрсұлтан Назарбаев. Болашаққа бағдар: рухани жаңғыру". 28 June 2017. Archived from the original on 28 June 2017. Retrieved 13 December 2022.
  110. ^ О переводе алфавита казахского языка с кириллицы на латинскую графику [On the change of the alphabet of the Kazakh language from the Cyrillic to the Latin script] (in Russian). President of the Republic of Kazakhstan. 26 October 2017. Archived from the original on 27 October 2017. Retrieved 26 October 2017.
  111. ^ "ÖZBEK ALIFBOSI". Retrieved 13 December 2022.
  112. ^ "Uzbekistan Aims For Full Transition To Latin-Based Alphabet By 2023". RadioFreeEurope/RadioLiberty. Retrieved 13 December 2022.


Further reading

  • Josephine Quinn, "Alphabet Politics" (review of Silvia Ferrara, The Greatest Invention: A History of the World in Nine Mysterious Scripts, translated from the Italian by Todd Portnowitz, Farrar, Straus and Giroux, 2022, 289 pp.; and Johanna Drucker, Inventing the Alphabet: The Origins of Letters from Antiquity to the Present, University of Chicago Press, 2022, 380 pp.), The New York Review of Books, vol. LXX, no. 1 (19 January 2023), pp. 6, 8, 10.

External links