|Region||Great Nicobar Island|
|≈ 400 (2004)|
Partially because the native peoples of the Andaman and Nicobar Islands are protected from outside researchers, Shompen is poorly described, with much of the data from the 19th century, and the little 20th–21st century data of poor quality. However, Roger Blench and Paul Sidwell demonstrate that it is an Austroasiatic language, though they suggest that it might constitute a distinct branch of that family.
It was a century before more data became available. 70 words were published in 1995, and then in 2003 substantial new data was published, the most extensive so far. However, Blench and Sidwell (2011) note that the 2003 book is at least partially plagiarized, the authors show little sign of understanding the material, which is full of anomalies and inconsistencies, and that the data appears to have been taken from an earlier source or sources, perhaps from the colonial era. Van Driem (2008) found it too difficult to work with, but Blench and Sidwell made an attempt at analyzing and retranscribing the data, based on comparisons of Malay loan words and identifiable cognates with other Mon–Khmer languages. They conclude that the data in the 1995 and 2003 publications come from either the same language or two closely related languages.
Although traditionally lumped in with the Nicobarese languages, which form a branch of the Austroasiatic (Mon–Khmer) language family, there was little evidence to support this assumption during the 20th century. Man (1886) notes that there are very few Shompen words which "bear any resemblance" to Nicobarese, and also that "in most instances" words differ between the two Shompen groups he worked with. For example, the word for "back (of the body)" is given as gikau, tamnōi, and hokōa in different sources; "to bathe" as pu(g)oihoɔp and hōhōm; and "head" as koi and fiāu. In some of these cases, this may be a matter of borrowed versus native vocabulary, as koi appears to be Nicobarese, but it also suggests that Shompen is not a single language.
Based on the 1997 data, however, van Driem concluded that Shompen was a Nicobarese language.
Blench and Sidwell note many cognates with both Nicobarese and with North Aslian in the 2003 data, including many words found only in Nicobarese or only in North Aslian (or sometimes North and Central Aslian), and also note that Shompen shares historical phonological developments with North Aslian. Given the likelihood of borrowing from Nicobarese, this suggests that Shompen might be a North Aslian or at least Aslian language, or perhaps a third branch of a Southern Mon–Khmer family alongside Aslian and Nicobarese.
It is not clear if the following description applies to all varieties of Shompen, or how phonemic it is where it does apply.
The consonants are attested as follows:
Many Mon–Khmer roots with final nasal stops, *m *n *ŋ, appear in Shompen with voiced oral stops [b d ɡ]. This resembles Aslian, and especially North Aslian, where historical final nasals have become prestopped or fully oral. However, whereas in North Aslian nasals conflated with oral stops, in Shompen oral stops appear to have first been lost, only to be reacquired as nasals became oral. There are also, however, numerous words which retain final nasal stops. It is not clear if borrowing from Nicobarese is enough to explain all of these—Shompen could have been partially relexified under the influence of Nicobarese, or consultants might have given Nicobarese words during elicitation—though it was clearly the case for some.
Other historical sound changes are word-final *r and *l as [w], *r before a vowel as [j], loss of final *h and *s, and the breaking of Mon–Khmer long vowels into diphthongs.
- Shompen at Ethnologue (17th ed., 2013)
- Nordhoff, Sebastian; Hammarström, Harald; Forkel, Robert; Haspelmath, Martin, eds. (2013). "Shompen". Glottolog 2.2. Leipzig: Max Planck Institute for Evolutionary Anthropology.
- De Roëpstorff, 1875. Vocabulary of dialects spoken in the Nicobar and Andaman islands. 2nd ed. Calcutta.
- EH Man, 1886. "A Brief Account of the Nicobar Islanders, with Special Reference to the Inland Tribe of Great Nicobar." The Journal of the Anthropological Institute of Great Britain and Ireland, 15:428–451.
- EH Man, 1889. A dictionary of the Central Nicobarese language. London: W.H. Allen.
- Rathinasabapathy Elangaiyan et al., 1995. Shompen–Hindi Bilingual Primer Śompen Bhāratī 1. Port Blair and Mysore.
- Subhash Chandra Chattopadhyay & Asok Kumar Mukhopadhyay, 2003. The Language of the Shompen of Great Nicobar: a preliminary appraisal. Kolkata: Anthropological Survey of India.
- for example, [a] is transcribed as short 〈a〉 but schwa [ə] as long 〈ā〉, the opposite of normal conventions in India or elsewhere
- Roger Blench & Paul Sidwell, 2011. "Is Shom Pen a Distinct Branch?" In Sophana Srichampa and Paul Sidwell, eds. Austroasiatic Studies: Papers from ICAAL 4. Canberra: Pacific Linguistics. (ICAAL, ms)
- George van Driem, 2008. "The Shompen of Great Nicobar Island: New linguistic and genetic data, and the Austroasiatic homeland revisited." Mother Tongue, 13:227–247.