Lexis and tradition: variation in the vocabulary of Sanskrit Mahāyāna literature - datasets
Description
Lexical datasets containing annotated concordances of words pertaining to the conceptual domains of language and conceptualisation in Buddhist Sanskrit Literature. The smaller dataset contains linguistic annotations, the larger only metadata. The concordances have been taken from the segmented Sanskrit corpus 10.5281/zenodo.3526665.
These datasets have been created as part of the project 'Lexis and Tradition: variation in the vocabulary of Sanskrit Mahāyāna literature', funded by the British Academy through a Newton International Fellowship (NF161436) and hosted at the Department of Theology and Religious Studies at King's College London under the supervision of Prof. Henrietta Kate Crosby.
Dr. Bruno Galasek-Hul and Luis Quiñones have assisted me with semantic annotations thanks to funding from the Mangalam Research Center.
The repository also contains R scripts for text clustering on the basis of the lexical data provided.
the annotated dataset can be interactively explored at:
- https://ligeialugli.shinyapps.io/VisualDictionaryOfBuddhistSanskrit/
- https://ligeialugli.shinyapps.io/VisualThesaurusOfBuddhistSanskrit/
Updated versions 1.1-1.2 correct some mistakes in metadata and add a few new lemmata.