Published January 2, 2024 | Version v. 2.1
Technical note Open

TibSchol HTR tools

Description

In the framework of the project TibSchol – “The Dawn of Tibetan Buddhist Scholasticism (11th-13th c.)” (https://www.oeaw.ac.at/projects/tibschol)[1] – hosted at the Institute for the Cultural and Intellectual History of Asia of the Austrian Academy of Sciences, a baseline model for layout analysis (LA) and two handwritten text recognition (HTR) models, for dpe tshugs and ’bru tsha Tibetan cursive scripts respectively, have been trained on the platform Transkribus (https://readcoop.eu/transkribus/) to produce machine-readable e-texts from handwritten documents. In addition, Python scripts have been created to pre-process images in order to improve the HTR results. This document presents these tools and explains how the LA and HTR models, which have been made public on the Transkribus platform, can be used by anyone.


[1] The project TibSchol has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 101001002). These tools are being shared within the project team’s responsibility. TibSchol must not be held responsible for the accuracy of the output when using these tools or for future changes that would have an impact on their operation. The European Research Council or the European Commission must not be held responsible for their further use.

Files

TibSchol HTR tools - v2_1.pdf

Files (441.9 kB)

Name Size Download all
md5:485d796ce60736808dc361d36a8d6932
441.9 kB Preview Download

Additional details

Related works

Describes
Software: 10.5281/zenodo.10450672 (DOI)
Software: 10.5281/zenodo.10450684 (DOI)
Software: 10.5281/zenodo.10450698 (DOI)