There is a newer version of the record available.

Published March 27, 2024 | Version 1.0
Publication Open

Old Permic Universal Dependencies Treebank

  • 1. University of Helsinki
  • 2. ROR icon Uppsala University

Description

Old Permic, also known as Old Komi, is an extinct variety of Komi that was spoken in the late Middle Ages in the lower Vychegda river basin in northeastern European Russia, in an area that currently is not Komi-speaking. This language variety is attested in fragmentary records from the 14th to 17th century written both in the Old Permic alphabet and in the Cyrillic alphabet. These records are of significant importance for research on the history of the Komi language. Here we introduce our attempt towards a new Universal Dependencies treebank that will contain the existing corpus of Old Permic in a structured and CoNLL-U annotated format. This is the first time this material is being made available in digital format, and our contribution describes the current state of the art and remaining challenges. 

Files

Old-Permic-Universal-Dependencies-treebank.pdf

Files (493.1 kB)

Name Size Download all
md5:e9c8c8111f8a20cdbed41afd00a819dd
493.1 kB Preview Download

Additional details

Dates

Submitted
2024-03-27
ournal of Data Mining & Digital Humanities