Sambahsa or Sambahsa-Mundialect is an international auxiliary language (IAL) devised by French Dr. Olivier Simon. Among IALs it is categorized as a worldlang. It is based on the Proto Indo-European language (PIE), with a highly simplified grammar. The language was first released on the Internet in July 2007; prior to that, the creator claims to have worked on it for eight years. According to one of the rare academic studies addressing recent auxiliary languages, "Sambahsa has an extensive vocabulary and a large amount of learning and reference material".
The first part of the name of the language, Sambahsa, is taken from two Malay words, sama and bahsa which mean 'same' and 'language' respectively. Mundialect, on the other hand, is a result of combining two Romance words, mondial (worldwide) and dialect (dialect).
Sambahsa tries to preserve the original spellings of words as much as possible and this makes its orthography complex, though still kept regular. There are four grammatical cases: nominative, accusative, dative and genitive.
- 1 Phonology
- 2 Declensions
- 3 Conjugation
- 4 Wordstock
- 5 Sample phrases
- 6 Literary works translated into Sambahsa
- 7 Movies with Sambahsa subtitles
- 8 References
- 9 External links
Sambahsa's phonology  has little to do with Proto-Indo-European phonology, though the majority of its vocabulary comes from PIE. The changes from PIE are not regular, since the creator of Sambahsa has tried to avoid homophones, which would have become common after the elimination of some PIE sounds like laryngeals or some aspirated consonants. However, any person proficient with Proto-Indo-European roots will easily recognize them when they appear in Sambahsa. Unlike some auxlangs like Esperanto, Sambahsa does not use the "one letter = one sound" principle, nor diacritics, but instead relies on a regular and complex system that combines the 26 letters of the basic Latin alphabet. This system was chosen to preserve the recognizability of words taken from West-European languages, where orthography plays a key role. For example, according to the rules of Sambahsa, bureau is pronounced as in French, and point as in English.
Sambahsa has nine vowels (not counting the lengthened form of these vowels), two semi-vowels (IPA: [j] and [w]) and twenty consonants. To help language learners, and because IPA symbols cannot be written with all keyboards, a special simpler system has been developed, called Sambahsa Phonetic Transcription, or SPT.
Compared to other conlangs, Sambahsa words are short, often as short as English words, and highly consonantic. This latter point is in accordance with the PIE background of Sambahsa, where roots have often a consonant-vocal-consonant structure.
Likewise, Sambahsa's accentuation rules are complex but regular, and tend to follow what is often found in German or Italian. This predictability implies that all words with the same orthography are pronounced and stressed the same way as each other. Thus, for example, while German Präsident and Italian presidente are stressed on the "ent" syllable, Sambahsa president is stressed on the "i", since president can also mean "they preside", and a final "ent" never bears the stress. This regularity of accentuation can be compared with English "president" and "to preside", two words that bear the stress on different syllables, though they share the same origin.
In Sambahsa, declensions are only compulsory for pronouns. The declensions of these pronouns (demonstrative/interrogative & relative/personal) are mostly parallel, and often show similarities with their Proto-Indo-European ancestors. Thus, in all Sambahsa declensions, the neuter nominative and accusative are identical, as it was the case in PIE. There are identical forms for the relative and interrogative pronouns, as well for the third person pronoun and the definitive article ("the" in English).
Sambahsa has two numbers (singular and plural; the dual number of PIE has not been preserved) and four grammatical genders : masculine, feminine, neutral and "undetermined". This last gender, which is an innovation from PIE, is used when a noun of uncertain or unknown gender is referred to, and, in the plural, for groups containing elements of different genders. The creator of Sambahsa introduced this non-PIE element to avoid the "gender" dispute found in Esperanto.
Gender is attributed in Sambahsa according to the "true nature" of the noun referred to, as English speakers do with he, she and it.
Sambahsa has four grammatical cases: nominative, accusative, dative and genitive; however, their attribution tries to be as logical as possible, and not arbitrary as in many modern Indo-European languages. The nominative is the case of the Subject, and the form under which words are given in dictionaries. Except for verbs describing a movement or a position (where the required prepositions ought to be used), all transitive verbs must introduce the accusative case in the first place, before an eventual dative case. However, the dependent clause of indirect speech is considered as a direct object, leading to verbs introducing an indirect object, even if there is no visible direct object.
- Is mi antwehrdt od is ne gwehmsiet cras = "He answers (to) me that he won't come tomorrow"
- Is ne mi hat antwohrden = "He hasn't answered (to) me"
- Som yakin od is ghehdsiet kwehre to = "I'm sure that he'll be able to do that".
- Som yakin eysen (genitive plural) imkans = "I'm sure of his abilities".
For substantives and adjectives, there are declined "free endings" (i.e. non-compulsory) used most often in literary context for euphonics or poetry. This system is inspired from the euphonic endings found in the Standard Arabic Language.
In Sambahsa, all verbs are regular, except ses (to be), habe (to have), and woide (to know, in the meaning of French savoir or German wissen). Sambahsa verbs are indicated in dictionaries not under their infinitive form, but their bare stem, because the whole conjugation can be deduced from the form of this stem. The main tenses of Sambahsa are present and past, but many other tenses can be obtained through the use of affixes or auxiliary verbs. Sambahsa uses the following endings, which are close to those found in many Indo-European languages.
|Person||Present and other tenses||Past tense only|
|1° person singular||-o, -m (if the verb ends with a stressed vocalic sound) or nothing (if the last vowel of the verb is unstressed)||-im|
|2° person singular||-s||-(i)st(a)|
|3° person singular||-t||-it|
|1° person plural||-m(o)s||-am|
|2° person plural||-t(e)||-at|
|3° person plural||-e(nt) ("-nt" is compulsory if the verb ends with a stressed vocalic sound)||-(ee)r|
Sambahsa is surely[original research?] unique among auxlangs because of its use of a predictable ablaut system for the past tense and passive past participles. For example, eh within a verbal stem turns to oh. Other verbs that cannot use ablaut can drop their nasal infix, or use an improved version of the De Wahl's rules. Finally, the remaining verbs simply add the past tense endings, which are optional for verbs of the categories described above.
Because of its rather huge vocabulary for an auxlang (as of August 2014, the full Sambahsa-English dictionary contained more than 15000 entries), it is difficult to assess the share of each language in Sambahsa's sprawling wordstock. However, the main layers are (either reconstructed or extrapolated) Indo-European vocabulary, Greco-Roman scientific and technical vocabulary (which is not discussed below, as it is more or less comparable to what is found in English) and multiple sources extending from Western Europe up to Eastern Asia.
The core of Sambahsa's vocabulary is undoubtedly of Indo-European origin. Only a few Sambahsa words can be traced back to pre-Indo-European times (as kamwns, chamois, cf. Basque language : "ahuntz"). Many basic Sambahsa are thus very close to their reconstructed Indo-European counterparts. See (Sambahsa / Proto-Indo-European) : eghi / *H₁eghis (hedgehog), ghelgh / *ghelghe- (gland), pehk / *pek (to comb), skand / *skand (to jump), peungst / *pn̥kʷsti- (fist), wobhel / *wobhel- (weevil), gwah / *gweH₂ (to go), tox / *tòksom ("yew wood" in Sambahsa; "yew" in PIE), treb / *trêbs (dwelling), oit / *H₁òitos (oath), poti / *potis (Sir, lord). But less attested Indo-European vocabulary is found in Sambahsa too. For example, the common Sambahsa word for "person" is anghen, as in semanghen = "someone, somebody", and can be derived from PIE ?*H₂enH₁ǵh, only found in Old Armenian anjn (person) and Old Norse angi (smell). And motic (hoe) may be a cognate of Old Church Slavonic motyka and English "mattock".
Further development from the Indo-European background
Though Sambahsa, like any other conlang, has derivation rules, it sometimes uses backformation too. For example, the relation between Lithuanian bendras (companion), Old Greek pentheros (father-in-law) and Sanskrit bandhu- (companion) is uncertain; however Sambahsa "reconstructs" this root as behndwr from behnd 'to bind'. PIE has *dhéǵhom 'earth' and *dhinéǵh- (with nasal infix) 'to shape, to make pottery'; accordingly, Sambahsa has (di)ghom and dinegh, but the latter can be understood as "to put earth on" if we refer to yug (yoke) and yuneg (to join), both from PIE *yugom and *yunég-.
The Sambahsa word for 'ice pellet' is kersnit; it rests on the word kersen 'frozen snow', itself from Old Norse hjarn, Lithuanian šarma (frost) and Russian serën. But the suffix -it was abstracted from PIE words like *sepit 'grain of wheat' and *H₂elbit 'grain of barley'; thus kersnit can be understood as 'a grain of frozen snow'.
Words common to different language families
A characteristic of Sambahsa is to include words found in different language families, while the most famous auxiliary languages tend to limit themselves to a compilation of Romance vocabulary with some borrowings from the Germanic languages. For example:
- schkaf (cupboard) has cognates both in Germanic and Slavic languages: Russian Шкаф, Polish Szafa, Ukrainian Шафа, Danish Skab, Icelandic Skàpur, Franconian dialect Schaaf and Swedish Skåp.
- Graf (count, as a nobility title) is a German word from Greek "grapheùs" that has been borrowed into many languages including Azerbaijani Qraf, Bulgarian Граф, Czech Hrabě, Danish Greve, Estonian Krahv, Croatian Grof, Hungarian Gróf, Finnish Kreivi, Lithuanian Grafas, Icelandic Greifi and Russian Граф.
- Bicair (mug) is found in German Becher and many other Germanic languages. It comes from Low Latin bicarium and is at the origin of Hungarian Pohár, Italian Bicchiere and Romanian Pahar, all meaning "glass".
- Sambahsa saray means "big hall, palace" and has the same Turkish and Persian origin as English Seraglio but with a meaning closer to its etymology and to Russian сарай (barn).
The Balkan sprachbund
Though they belong to different language families, the languages spoken in South-East European share a number of common grammatical features and of loanwords due to their historical background. That's why Sambahsa includes words from this region.
- Sambahsa schut = "hornless" corresponds to Romanian Șut, Bulgarian/Serbo-Croatian šut; also Albanian shut ‘hornless’.
- Sambahsa potire = "pitcher" comes from Old Greek ποτήρ, like Serbo-Craotian путир, Russian потир, Romanian and Albanian potir.
- Sambahsa keramide = "coating" comes from Greek κεραμίδα, which has given, among others, Romanian cărămidă (brick) or Arabic قرميدة = qirmîda(t) = "tile".
Words from Arabic and Parsi
A significant part of Sambahsa's vocabulary comes from Arabic language and Persian. Both languages have extensively provided loanwords to a lexical continuum ranging from the Atlantic Ocean to Indonesia because, respectively, of the spread of Islam and the brilliance of the former Persian civilization. Sambahsa learning materials often call this stratum "Muslim".
- Sambahsa amlak (assets) comes from Arabic أملاك and is found in Turkish emlak (estate) and Persian املاک.
- Sambahsa zina (adultery) comes from Arabic زنا and is found in Persian and many other languages spoken by a majority of Muslims.
- Sambahsa adarb (merlon) comes from Spanish Adarve and Portuguese Adarve from Arabic درب and ultimately Persian در which has its origin in PIE *dhwer just like Sambahsa dwer = "door".
Classical Chinese has heavily influenced the wordstock of neighbouring languages, mostly Japanese, Korean and Vietnamese. As a result, Sambahsa incorporates some "Sinitic" vocabulary, but the phonetic differences between these various languages can be high.
- Sambahsa kjingyow (goldfish) correspond to 金魚, which is read jīnyú in Mandarin Pinyin and kingyo in Japanese.
- Sambahsa geong (fortified palace) corresponds to the Han character 城 read chéng in Mandarin Pinyin, jō in Japanese Goon reading, seong in Korean, and thành in Vietnamese.
Not all Sambahsa "Sinitic" words come from Classical Chinese. The Min Nan language of Southern China provided loanwords to some South-East Asian languages, and some of these borrowings are, in turn, found in Sambahsa.
- Sambahsa pangsit (wonton) is an Indonesian word from Min Nan pian sit, while Mandarin Chinese (Pinyin) has húndùn
- Likewise, Sambahsa loteng (attic) comes from Min Nan lauteng through Indonesian Loteng.
|Kam leitte yu?||How are you?|
|Bahte yu Sambahsa?||Do you speak Sambahsa?|
|No, ne bahm Sambahsa.||No, I don't speak Sambahsa.|
|Marba!||Pleased to meet you!|
Literary works translated into Sambahsa
- The Songs of Bilitis by Pierre Louÿs : Ia Songvs as Bilitis
- Demian by Hermann Hesse : Demian
- The Stranger by Albert Camus : Is Gospoti
- The Little Prince by Antoine de Saint-Exupéry : Is Lytil Prince
- The Gospel of Matthew : Id Euanghelio sekwent Matyah
- Alice's Adventures in Wonderland : Ia Aventures as Alice in Daumsenland published by Evertype
Movies with Sambahsa subtitles
- Revelations (a fan-made movie based on Star Wars): Revelations
- The Hunt for Gollum (a fan-made prequel to the Lord of the Rings) : Sayd po Gollum
- Born of Hope (a fan-made prequel to the Lord of the Rings) : Gnaht Speh
- Home (a French movie by Yann Arthus-Bertrand about environmental threats) : Ghom
- Kaydara (a fan-made movie based on The Matrix) : Kaydara
- Dr. Olivier Simon (2010). "The Official Website of Sambahsa". Retrieved 2011-02-18.
- C.Quilès, of the Dnghu Project, called it "a modern Proto-Indo-European language with an easier verbal and nominal inflection, borrowed [non-translated] IE vocabulary : http://carlosquiles.com/indo-european-language-blog/2008/06/artificial-and-natural-languages/
- Mithridates (2009-05-14). "Why You Should Keep an Eye on Sambahsa". Retrieved 2011-02-18.
- "The Representation of Korean and Other Altaic Languages in Artificial International Auxiliary Languages" in Journal of Universal Language, March 2012, p.153, by Alan Reed Libert.
- A. L. N. Kramer, Willie Koen (1993). Tuttle's Concise Indonesian dictionary: English-Indonesian, Indonesian-English. Charles E. Tuttle Company, Inc. of Rutland, Vermont & Tokyo, Japan. ISBN 0-8048-1864-9.
- Merriam-Webster (2002). Merriam-Webster's French-English dictionary. ISBN 978-0-87779-917-7.
- A full analysis of Sambahsa (written in Esperanto) has been made by S.Auclair in La Riverego n°104, pp. 11-16, http://www.esperanto.qc.ca/files/riverego/Riverego-104.pdf
- Dave MacLeod (2010). "Foreword to the Sambahsa Grammar in English". Retrieved 2011-02-02.
- "The strange quest for a universal "Earth Standard" language" by Esther Inglis-Arkell, 08-17-2012 : http://io9.com/5935563/the-strange-quest-for-a-universal-earth-standard-language
- However, different versions of pronunciation of "r" are admitted, and the "ng" sound (as in English "sing") could be counted as a new sound, distinct from the conjunction of [n] + [g].
- See this link on a French-speaking forum : http://aphil.forumn.net/t844p15-analyse-phonotactique-kotava-esperanto-uropi-et-autres?highlight=analyse+phon%E9tique
- Emile Benveniste, Origine de la formation des noms en Indo-Européen: http://books.google.fr/books?id=OD4IAQAAIAAJ&q=emile+benveniste&dq=emile+benveniste&ei=sttrTfm0EtCC4QbOiajfCQ&sa=X&oi=book_result&ct=result&resnum=6&ved=0CEcQ6AEwBTgK
- R.S.P. Beekes, Comparative Indo-European Linguistics, J.Benjamins.Pub., p.195
- To the exception of the nominative singular masculine, as in Latin, where the relative pronoun is qui, and the interrogative form is quis.
- But the genitive form serves only for the definitive article, while the possessive pronouns have special forms (otherwise, confusions could have arisen).
- Under certain circumstances, the preposition bi can merge with the definite article in its dative form.
- They can be compared to the data provided in Indo-European Linguistics : an introduction by J.Clackson, Cambridge University Press, 2007, pp. 127 & 128.
- J.P Mallory & D.Q. Adams, Encyclopedia of Indo-European Culture, Fitzroy Dearborn Publishers, p.196
- ibidem, p.287
- ibidem, p.639
- See "ciut" in http://en.wikipedia.org/wiki/List_of_Romanian_words_of_possible_Dacian_origin