Published December 31, 2018 | Version v1
Journal article Open

Hamburg corpora for indigenous Northern Eurasian languages

  • 1. Universität Hamburg
  • 2. Universität Hamburg, Lomonosov Moscow State University

Description

The long-term INEL project (2016–2033), carried out at the University of Hamburg, aims to develop
digital linguistic corpora and supporting infrastructure for a number of selected languages of Northern
Eurasia. At present, corpora of Selkup, Kamas and Dolgan are being created. The project builds upon
existing materials from various archive sources, including the Selkup archive of Angelina I. Kuzmina preserved
at the University of Hamburg, Kamas audio recordings from the archives in Tartu and Helsinki,
and Dolgan recordings provided by the House of the Cultures of Taimyr Peninsula. All the texts in the
corpora are provided with a phonological transcription, morphological interlinear glossing, free translations;
selected subsets also bear additional annotations for semantic and syntactic features, information
status of referents, borrowings and code-switching. The corpora are intended for typologically aware grammatical
research but may also be of interest for a wider audience. A number of satellite information resources
are also being developed, contributing towards a more efficient research infrastructure.

Files

Arkhipov-Däbritz_2018_hamburg-corpora.pdf

Files (873.4 kB)

Name Size Download all
md5:9656398de152305e2ec1a50d21fe464f
873.4 kB Preview Download