Published May 8, 2023 | Version 1.0
Dataset Open

A Grammatically Annotated Corpus of the Old Latvian Postil of Georg Mancelius

  • 1. Humboldt-Universität zu Berlin

Contributors

Data manager:

Project member:

  • 1. Humboldt-Universität zu Berlin

Description

This grammatically annoted corpus aims at facilitating linguistic research on Old Latvian based on the Postil of Georg Mancelius from the year 1654. The corpus is divided into two subcorpora, "pericopes" and "homilies" to make register related research easier.

The pericopes were annotated using SIL Toolbox and converted to be used in the search-tool ANNIS using the conversion tool PEPPER.

Three formats are provided in this release: 1. the Toolbox files, 2. the transitional Excel files and 3. a zipped folder to be imported into ANNIS.

Created in the project B02, Emergence and change of registers: The case of Lithuanian and Latvian of the CRC 1412 "Register" (funded by the Deutsche Forschungsgemeinschaft: DFG, German Research Foundation: 416591334).

Files

McP_corpus.zip

Files (2.4 MB)

Name Size Download all
md5:c7a065ca2579d9c3f16661a72d41ef2d
2.4 MB Preview Download

Additional details

References

  • Krause, Thomas (2019). ANNIS: A graph-based query system for deeply annotated text corpora. PhD thesis. Humboldt-Universität zu Berlin, Mathematisch-Naturwissenschaftliche Fakultät. doi: 10.18452/19659.
  • Andronova, Everita (2007). The Corpus of Early Written Latvian: current state and future tasks. Proceedings of the Corpus Linguistics Conference. CL2007. University of Birmingham, UK. 27-30 July 2007. Edited by Matthew Davies, Paul Rayson, Susan Hunston, Pernilla Danielsson. ISSN 1747-9398.