Published May 12, 2018 | Version v1
Conference paper Open

Preservation of Original Orthography in the Construction of an Old Irish Corpus

  • 1. National University of Ireland Galway

Description

This paper will examine the process of creating a digital corpus based on the Würzburg glosses, the earliest large collection of glosses written in the Irish language. Modern editorial standards applied in publications of these glosses can alter spelling, punctuation, and even the semantic meaning of a sentence where one word is used in place of another. Therefore, an understanding of the original orthography utilised by Old Irish scribes is important in determining the orthography which should be utilised in a modern digital corpus. This paper will outline why the text of the Würzburg glosses as it appears in Thesaurus Palaeohibernicus is the best candidate for digitisation. The automated digitisation and proofing process of the corpus will be outlined, and details will be given of a tag-set utilised within the digital corpus in order to preserve information present in Thesaurus Palaeohibernicus as metadata.

Files

20_W26.pdf

Files (144.7 kB)

Name Size Download all
md5:127e3f35c0166c763c2a76820d6a5c1f
144.7 kB Preview Download