The Pangloss Collection is a digital library whose objective is to store and facilitate access to audio recordings in endangered languages of the world. Developed by the LACITO centre of CNRS in Paris, the collection provides free online access to documents of connected, spontaneous speech, in otherwise poorly documented languages of all continents.
A sound archive with synchronized transcriptions
For the science of linguistics, language is first and foremost spoken language. The medium of spoken language is sound. The Pangloss collection gives access to original recordings simultaneously with transcriptions and translations, as a resource for further research. After being recorded in its cultural context, texts have been transcribed in collaboration with native speakers.
A structured, open architecture
The archived data is structured in accordance with the latest data-processing standards, as open architecture, in an open format, and may be downloaded under a Creative Commons license. The software used to prepare and disseminate it is open-source. The Pangloss collection is a member of the OLAC network of archival repositories.
- Michailovsky, Boyd, Martine Mazaudon, Alexis Michaud, Séverine Guillaume, Alexandre François & Evangelia Adamou. 2014. Documenting and researching endangered languages: the Pangloss Collection. Language Documentation & Conservation 8, pp. 119-135.
- See the list of all Pangloss resources on the “CoCoON” homepage (retrieved 3 March 2017).
- Number of “Pangloss” entries on LACITO’s language map (retrieved 2 Apr 2016).
- Homepage of the Pangloss Collection
- Sample text from the collection: a story in the Chang Naga language of north-eastern India, presented in bilingual format.
- Access to the Pangloss collection through LACITO’s interactive language map
- Access to the Pangloss collection through the OLAC search interface.
- Access to the Pangloss collection through the CoCoON search interface.