Software Open Access
{ "inLanguage": { "alternateName": "eng", "@type": "Language", "name": "English" }, "description": "<p>This folder contains R code for a rule-based Buddhist Sanskrit Segmenter and Lemmatiser, as well as data necessary to use and evaluate the Segmenter and explanatory materials.</p>\n\n<p>The segmenter has been tested on 639 sentences from 13 Buddhist text (9 s\u016btras, 4 \u015b\u0101stra) and has been evaluated as achieving 97% accuracy.</p>\n\n<p>The code and materials contained in this folder have been developed as part of a Newton International Fellowship at King's College London, funded by the British Academy (NF161436)</p>\n\n<p> </p>\n\n<p><strong>Contents</strong></p>\n\n<p>R code for segmentation, lemmatisation and evaluation (includes instructions to run code)</p>\n\n<p>powerpoint presentation with background and explanation of project</p>\n\n<p>Wordlists and Wordlists documentation</p>\n\n<p>ngrams and stems frequency tables necessary for segmentation</p>\n\n<p>gold standard set of manually segmented and stemmed sentences for evaluation</p>\n\n<p>set of raw sentences for evaluation</p>\n\n<p>evaluation of Krisha et al. seq2seq segmenter on Buddhist sentences for reference purposes</p>\n\n<p> </p>\n\n<p>This segmenter has been used to prepare the Sanskrit Corpus at DOI 10.5281/zenodo.3457822</p>", "license": "https://creativecommons.org/licenses/by/4.0/legalcode", "creator": [ { "affiliation": "King's College London", "@id": "https://orcid.org/0000-0003-0473-4290", "@type": "Person", "name": "Ligeia Lugli" } ], "url": "https://zenodo.org/record/3459219", "datePublished": "2019-09-24", "version": "1", "keywords": [ "Buddhist Sanskrit", "Natural Language Processing" ], "@context": "https://schema.org/", "identifier": "https://doi.org/10.5281/zenodo.3459219", "@id": "https://doi.org/10.5281/zenodo.3459219", "@type": "SoftwareSourceCode", "name": "Buddhist Sanskrit Segmenter" }
All versions | This version | |
---|---|---|
Views | 111 | 33 |
Downloads | 557 | 463 |
Data volume | 577.2 MB | 383.3 MB |
Unique views | 101 | 30 |
Unique downloads | 448 | 406 |