Published October 20, 2019 | Version 1.2
Dataset Open

Lexis and tradition: variation in the vocabulary of Sanskrit Mahāyāna literature - datasets

  • 1. King's College London

Description

Lexical datasets containing annotated concordances of words pertaining to the conceptual domains of language and conceptualisation in Buddhist Sanskrit Literature. The smaller dataset contains linguistic annotations, the larger only metadata. The concordances have been taken from the segmented Sanskrit corpus 10.5281/zenodo.3526665.

These datasets have been created as part of the project 'Lexis and Tradition: variation in the vocabulary of Sanskrit Mahāyāna literature', funded by the British Academy through a Newton International Fellowship (NF161436) and hosted at the Department of Theology and Religious Studies at King's College London under the supervision of Prof. Henrietta Kate Crosby. 

Dr. Bruno Galasek-Hul and Luis Quiñones have assisted me with semantic annotations thanks to funding from the Mangalam Research Center.

The repository also contains R scripts for text clustering on the basis of the lexical data provided.

the annotated dataset can be interactively explored at:

  1.  https://ligeialugli.shinyapps.io/VisualDictionaryOfBuddhistSanskrit/
  2.  https://ligeialugli.shinyapps.io/VisualThesaurusOfBuddhistSanskrit/

Updated versions 1.1-1.2 correct some mistakes in metadata and add a few new lemmata.

 

Files

Lugli2019_LexisAndTraditionAnnotatedData.csv

Files (10.4 MB)

Name Size Download all
md5:5cb3c5d634cea2092c14207c0865e48e
26.6 kB Download
md5:6c4fb1fcfc6bea0ee269bf12ce1bb86f
18.9 kB Download
md5:2ed068639186b1d2219f931225e6f761
2.1 MB Preview Download
md5:d025c7146365aa21e2619015d0e9e36b
8.2 MB Preview Download