Mainland Southeast Asia linguistic area
The Mainland Southeast Asia (MSEA) linguistic area stretches from Thailand to China and is home to speakers of languages of the Sino-Tibetan, Hmong–Mien (or Miao–Yao), Tai–Kadai, Austronesian (represented by Chamic) and Austroasiatic families. Neighbouring languages across these families, though presumed unrelated, often have similar typological features, which are believed to have spread by diffusion. James Matisoff referred to this area as the Sinosphere, contrasted with the "Indosphere".
Language distribution 
The Austroasiatic languages include Vietnamese and Khmer, as well as many other languages spoken in scattered pockets as far afield as Malaya and eastern India. Most linguists believe that Austroasiatic languages once ranged continuously across southeast Asia and that their scattered distribution today is the result of the subsequent migration of speakers of other language groups from southern China.
Chinese civilization and the Chinese languages spread from their home in the North China Plain into the Yangtze valley and then into southern China during the first millennium BC and first millennium AD. Indigenous groups in these areas either became Chinese, retreated to the hill country, or migrated to the south. Thus the Tai–Kadai languages, today including Thai, Lao and Shan, were originally spoken in southern China, where the greatest diversity within the family is still found, and possibly as far north as the Yangtze valley. With the exception of Zhuang, most of the Tai–Kadai languages still remaining in China are spoken in isolated upland areas. Similarly the Miao–Yao or Hmong–Mien languages may originally have been spoken in the middle Yangtze. Today they are scattered across isolated hill regions of southern China. Many of them migrated to southeast Asia in the 18th and 19th centuries, after the suppression of a series of revolts in Guizhou.
The upland regions of the interior of the area, as well as the plains of Burma, are home to speakers of other Sino-Tibetan languages, the Tibeto-Burman languages. The Austronesian languages, spoken across the Pacific and Indian Oceans, are represented in MSEA by the divergent Chamic group.
Syllable structure 
A characteristic of MSEA languages is a particular syllable structure involving monosyllabic morphemes, lexical tone, a fairly large inventory of consonants, including phonemic aspiration, limited clusters at the beginning of a syllable, and plentiful vowel contrasts. Final consonants are typically highly restricted, often limited to glides and nasals or unreleased stops at the same points of articulation, with no clusters and no voice distinction. Languages in the northern part of the area generally have fewer vowel and final contrasts but more initial contrasts.
Most MSEA languages tend to have monosyllabic morphemes, though there are exceptions. Some polysyllabic morphemes exist even in Old Chinese and Vietnamese, often loan words from other languages. A related syllable structure found in some languages, such as the Mon–Khmer languages, is the sesquisyllable, consisting of a stressed syllable with approximately the above structure, preceded by a unstressed "minor" syllable consisting only of a consonant and a neutral vowel /ə/. This structure is present in many conservative Mon–Khmer languages such as Khmer (Cambodian), as well as in Burmese, and is reconstructed for the older stages of a number of Sino-Tibetan languages.
Tone systems 
Phonemic tone is one of the most well-known of southeast Asian language characteristics. The tone systems of Middle Chinese, Proto-Miao–Yao, proto-Tai and early Vietnamese all display a three-way tonal contrast in syllables lacking stop endings. In traditional analyses, syllables ending in stops have been treated as a fourth or "checked tone", because their distribution parallels that of syllables with nasal codas. Moreover, the earliest strata of loans display a regular correspondence between tonal categories in the different languages:
|Vietnamese||Proto-Tai||Proto-Miao–Yao||Middle Chinese||suggested origin|
|*A (ngang-huyền)||*A||*A||平 píng "level"||-|
|*B (sắc-nặng)||*C||*B||上 shǎng "rising"||*-ʔ|
|*C (hỏi-ngã)||*B||*C||去 qù "departing"||*-h < *-s|
The incidence of these tones in Chinese, Tai and Miao-Yao words follows a similar ratio 2:1:1. Thus rhyme dictionaries such as the Qieyun divide the level tone between two volumes while covering each of the other tones in a single volume. Vietnamese has a different distribution, with tone B four times more common than tone C.
It was long believed than tone was an invariant feature of languages, suggesting that these groups must be related. However this category cut across groups of languages with shared basic vocabulary. In 1954 André-Georges Haudricourt solved this paradox by demonstrating that Vietnamese tones corresponded to certain final consonants in other (atonal) Austroasiatic languages. He thus argued that the Austroasiatic proto-language had been atonal, and that its development in Vietnamese had been conditioned by these consonants, which had subsequently disappeared, a process now known as tonogenesis. Haudricourt further proposed that tone in the other languages had a similar origin. Other scholars have since uncovered transcriptional and other evidence for these consonants in early forms of Chinese, and many linguists now believe that Old Chinese was atonal. A smaller amount of similar evidence has been found for proto-Tai. Moreover, since the realization of tone categories as pitch contours varies so widely between languages, the correspondence observed in early loans suggests that the conditioning consonants were still present at the time of borrowing.
Loss of voicing with tone or register split 
A characteristic sound change (a phonemic split) occurred in most southeast Asian languages around 1000 AD. First, syllables with voiced initial consonants came to be pronounced with a lower pitch than those with unvoiced initials. In most of these languages, with a few exceptions such Wu Chinese, the voicing distinction subsequently disappeared, and the pitch contour became distinctive. In tonal languages, each of the tones split into two "registers", yielding a typical pattern of six tones in unchecked syllables and two in checked ones. Pinghua and Yue Chinese, as well as neighbouring Tai languages, have further tone splits in checked syllables, while many other Chinese varieties, including Mandarin Chinese, have merged some tonal categories.
Many non-tonal languages instead developed a register split, with voiced consonants producing breathy-voiced vowels and unvoiced consonants producing normally voiced vowels. Often, the breathy-voiced vowels subsequently went through additional, complex changes (e.g. diphthongization). Examples of languages affected this way are Mon and Khmer (Cambodian). Breathy voicing has since been lost in standard Khmer, although the vowel changes triggered by it still remain.
Many of these languages have subsequently developed some voiced obstruents. The most common such sounds are /b/ and /d/ (often pronounced with some implosion), which result from former preglottalized /ʔb/ and /ʔd/, which were common phonemes in many Asian languages and which behaved like voiceless obstruents. In addition, Vietnamese developed voiced fricatives through a different process (specifically, in words consisting of two syllables, with an initial, unstressed minor syllable, the medial stop at the beginning of the stressed major syllable turned into a voiced fricative, and then the minor syllable was lost).
Morphology and syntax 
Most MSEA languages are of the isolating type, with mostly mono-morphemic words, no inflection and little affixation. Nouns are derived by compounding; for example, Mandarin Chinese is rich in polysyllabic words. Grammatical relations are typically signalled by word order, particles and coverbs or prepositions. Modality is expressed using sentence-final particles. The usual word order in MSEA languages is subject–verb–object. Chinese, Bai and Karen are thought to have changed to this order from the subject–object–verb order retained by most other Sino-Tibetan languages. The order of constituents within a noun phrase varies: noun–modifier order is usual in Tai languages and Miao, while in Chinese varieties and Yao most modifiers are placed before the noun. Topic-comment organization is also common.
MSEA languages typically have well-developed systems of numeral classifiers. The Bengali language just to the west of Southeast Asia also has numerical classifiers, even though it is an Indo-European language which does not share the other MSEA features. Bengali also lacks gender, unlike most Indo-European languages.
- Enfield (2005), pp. 182–184.
- Matisoff (1991), p. 486.
- Sidwell & Blench (2011), pp. 339–340.
- Ramsey (1987), p. 233.
- Ramsey (1987), pp. 278–279.
- Enfield (2005), pp. 186–187.
- Enfield (2005), p. 186.
- Downer (1963).
- Luo (2008), p. 11.
- Norman (1988), p. 56.
- Ballard (1985), p. 171.
- Gedney (1989).
- Ratliff (2002).
- Norman (1988), p. 53.
- Enfield (2005), pp. 192–193.
- Enfield (2005), pp. 187–190.
- Ramsey (1987), p. 280.
- Enfield (2005), pp. 189–190.
- Enfield (2005), p. 189.
- Works cited
- Ballard, W.L. (1985), "Aspects of the Linguistic History of South China", Asian Perspectives 24 (2): 163–185.
- Downer, G.B. (1963), "Chinese, Thai, and Miao-Yao", in Shorto, H.L., Linguistic Comparison in South East Asia and the Pacific, School of Oriental and African Studies, University of London, pp. 133–139.
- Enfield, N.J. (2005), "Areal Linguistics and Mainland Southeast Asia", Annual Review of Anthropology 34 (1): 181–206, doi:10.1146/annurev.anthro.34.081804.120406.
- Gedney, William J. (1989), "Speculations on early Tai tones", in Gedney, William J.; Bickner, Robert J., Selected Papers on Comparative Tai Studies, Center for South and Southeast Asian Studies, University of Michigan, pp. 207–228, ISBN 978-0-89148-037-2.
- Luo, Yongxian (2008), "Sino-Tai and Tai–Kadai: Another Look", in Diller, Anthony; Edmondson, Jerold A.; Luo, Yongxian, The Tai–Kadai Languages, Routledge Language Family Series, Psychology Press, pp. 9–28, ISBN 978-0-7007-1457-5.
- Matisoff, James A. (1991), "Sino-Tibetan Linguistics: Present State and Future Prospects", Annual Review of Anthropology 20: 469–504, doi:10.1146/annurev.an.20.100191.002345, JSTOR 2155809.
- Norman, Jerry (1988), Chinese, Cambridge University Press, ISBN 978-0-521-29653-3.
- Ramsey, S. Robert (1987), The Languages of China, Princeton University Press, ISBN 978-0-691-01468-5.
- Ratliff, Martha (2002), "Timing Tonogenesis: Evidence from Borrowing", Proceedings of the Annual Meeting of the Berkeley Linguistics Society 28 (2): 29–41.
- Sidwell, Paul; Blench, Roger (2011), "The Austroasiatic Urheimat: the Southeastern Riverine Hypothesis", in Enfield, N.J., Dynamics of Human Diversity: The Case of Mainland Southeast Asia, Canberra: Pacific Linguistics, pp. 317–345, ISBN 978-0-85883-638-9.