Brenner, Simon
2020-11-15
<p>We introduce a novel dataset of Subjective Assessments of Legibility in Ancient Manuscript Images (SALAMI) to serve as a ground truth for the development of quantitative evaluation metrics in the field of digital text restoration.<br>
This dataset consists of 250 images of 50 manuscript regions with corresponding spatial maps of mean legibility and uncertainty, which are based on a study conducted with 20 experts of philology and paleography.</p>
<p><strong>Description of files:</strong></p>
<ul>
<li>images
<ul>
<li>input - rated test images</li>
<li>mean_score_maps - spatial maps of mean legibility</li>
<li>std_maps - spatial maps of uncertainty (standard deviation of legibility)</li>
</ul>
</li>
<li>src
<ul>
<li>images.json - definition of source images contained in the dataset</li>
<li>users.json - list of participants with their respective properties</li>
<li>assessments.json - the main data generated by our experiments.</li>
<li>salami_proc.py - contains python functions to process the .json files named above</li>
<li>salami_proc_usage.py - uses the functions from salami_proc.py to reproduce the output images and statistical results described in the accompanying paper</li>
<li>salami_llm.R - documents the linear mixed models analysis performed in R</li>
</ul>
</li>
</ul>
<p> </p>
<p> </p>
https://doi.org/10.5281/zenodo.4270352
oai:zenodo.org:4270352
eng
Zenodo
https://zenodo.org/communities/cvl
https://doi.org/10.5281/zenodo.3751713
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
image quality assessment
legibility
historic manuscripts
subjective ratings
SALAMI - Subjective Assessments of Legibility in Ancient Manuscript Images
info:eu-repo/semantics/other