• Dirk Schmidt deposited A Speech Corpus of Dharamsala Tibetan in the group Group logo of History of Linguistics and Language StudyHistory of Linguistics and Language Study on Humanities Commons 4 years ago

    In 2016-2017, a 10-person team worked for 3 months with the goal of creating a multi-use, balanced corpus, similar to the Brown Corpus (BROWN Corpus search online). The speech section of the completed Nanhai Corpus—named for its sponsors, the Nanhai Nunnery of Taiwan—is a 289,497 word corpus of collected, transcribed, and word-split natural speech of local Dharamsala Tibetan (རྡ་ས་ཁུལ་གྱི་སྐད།).