|Region||Great Nicobar Island|
Partially because the native peoples of the Andaman and Nicobar Islands are protected from outside researchers, Shompen is poorly described, with much of the data from the 19th century, and the little 20th–21st century data of poor quality. However, Roger Blench and Paul Sidwell demonstrate that it is an Austroasiatic language, though they suggest that it might constitute a distinct branch of that family.
- Dakade (10 km northeast of Pulo-babi, a Nicobarese village on the western coast of Great Nicobar; 15 persons and 4 huts)
- Puithey (16 km southeast of Pulo-babi)
- Tataiya (inhabited by the Dogmar River Shompen group, who had moved from Tataiya to Pulo-kunyi between 1960 and 1977)
It was a century before more data became available. 70 words were published in 1995, and then in 2003 substantial new data was published, the most extensive so far. However, Blench and Sidwell (2011) note that the 2003 book is at least partially plagiarized, the authors show little sign of understanding the material, which is full of anomalies and inconsistencies, and that the data appears to have been taken from an earlier source or sources, perhaps from the colonial era. Van Driem (2008) found it too difficult to work with, but Blench and Sidwell made an attempt at analyzing and retranscribing the data, based on comparisons of Malay loanwords and identifiable cognates with other Austroasiatic languages. They conclude that the data in the 1995 and 2003 publications come from either the same language or two closely related languages.
Although traditionally lumped in with the Nicobarese languages, which form a branch of the Austroasiatic language family, there was little evidence to support this assumption during the 20th century. Man (1886) notes that there are very few Shompen words that "bear any resemblance" to Nicobarese, and also that "in most instances" words differ between the two Shompen groups he worked with. For example, the word for "back (of the body)" is given as gikau, tamnōi, and hokōa in different sources; "to bathe" as pu(g)oihoɔp and hōhōm; and "head" as koi and fiāu. In some of these cases, this may be a matter of borrowed versus native vocabulary, as koi appears to be Nicobarese, but it also suggests that Shompen is not a single language.
Based on the 1997 data, however, van Driem concluded that Shompen was a Nicobarese language.
Blench and Sidwell note many cognates with both Nicobarese and with Jahaic in the 2003 data, including many words found only in Nicobarese or only in Jahaic (or sometimes also in Senoic), and also note that Shompen shares historical phonological developments with Jahaic. Given the likelihood of borrowing from Nicobarese, this suggests that Shompen might be a Jahaic or at least an Aslian language, or perhaps a third branch of a Southern Austroasiatic family alongside Aslian and Nicobarese.
It is not clear if the following description applies to all varieties of Shompen, or how phonemic it is where it does apply.
The consonants are attested as follows:
Many Austroasiatic roots with final nasal stops, *m *n *ŋ, appear in Shompen with voiced oral stops [b d ɡ]. This resembles Aslian, and especially Jahaic, where historical final nasals have become prestopped or fully oral. However, whereas in Jahaic nasals conflated with oral stops, in Shompen oral stops appear to have been lost first, only to be reacquired as nasals became oral. There are also, however, numerous words that retain final nasal stops. It is not clear if borrowing from Nicobarese is enough to explain all of these—Shompen could have been partially relexified under the influence of Nicobarese, or consultants might have given Nicobarese words during elicitation—though it was clearly the case for some.
Other historical sound changes are word-final *r and *l as [w], *r before a vowel as [j], loss of final *h and *s, and the breaking of Austroasiatic long vowels into diphthongs.
- Shompen at Ethnologue (18th ed., 2015)
- Hammarström, Harald; Forkel, Robert; Haspelmath, Martin; Bank, Sebastian, eds. (2016). "Shom Peng". Glottolog 2.7. Jena: Max Planck Institute for the Science of Human History.
- Lal, Parmanand. 1977. Great Nicobar Island: study in human ecology. Calcutta: Anthropological Survey of India, Govt. of India.
- De Roëpstorff, 1875. Vocabulary of dialects spoken in the Nicobar and Andaman islands. 2nd ed. Calcutta.
- EH Man, 1886. "A Brief Account of the Nicobar Islanders, with Special Reference to the Inland Tribe of Great Nicobar." The Journal of the Anthropological Institute of Great Britain and Ireland, 15:428–451.
- EH Man, 1889. A dictionary of the Central Nicobarese language. London: W.H. Allen.
- Rathinasabapathy Elangaiyan et al., 1995. Shompen–Hindi Bilingual Primer Śompen Bhāratī 1. Port Blair and Mysore.
- Subhash Chandra Chattopadhyay & Asok Kumar Mukhopadhyay, 2003. The Language of the Shompen of Great Nicobar: a preliminary appraisal. Kolkata: Anthropological Survey of India.
- for example, [a] is transcribed as short ⟨a⟩ but schwa [ə] as long ⟨ā⟩, the opposite of normal conventions in India or elsewhere
- Roger Blench & Paul Sidwell, 2011. "Is Shom Pen a Distinct Branch?" In Sophana Srichampa and Paul Sidwell, eds. Austroasiatic Studies: Papers from ICAAL 4. Canberra: Pacific Linguistics. (ICAAL, ms)
- George van Driem, 2008. "The Shompen of Great Nicobar Island: New linguistic and genetic data, and the Austroasiatic homeland revisited." Mother Tongue, 13:227–247.