Pangloss Collection
The Pangloss Collection is an open archive developed since 1994. It contributes to the documentation and study of the world’s languages by providing free access to documents of connected, spontaneous speech (as well as word lists), mostly in endangered or under-resourced languages, recorded in their cultural context and transcribed in consultation with native speakers. The Pangloss Collection hosts media files (audio and video recordings), text annotations, and metadata. The annotations consist of transcription, free translation, and (optional) word or morpheme glosses. Annotations are time-aligned with the recordings. A web interface makes these annotations accessible online in an interlinear display format, in synchrony with the audio, using any standard browser. The structure of the XML documents makes them accessible to searching and indexing. Long-term preservation is guaranteed through the CoCoON respository (Collection de Corpus Audio Numériques).