    In 2016-2017, a 10-person team worked for 3 months with the goal of creating a multi-use, balanced corpus, similar to the Brown Corpus (BROWN Corpus search online). The speech section of the completed Nanhai Corpus—named for its sponsors, the Nanhai Nunnery of Taiwan—is a 289,497 word corpus of collected, transcribed, and word-split natural speech of local Dharamsala Tibetan (རྡ་ས་ཁུལ་གྱི་སྐད།).