Published August 12, 2016 | Version 1.0
Dataset Open

ArchiMob corpus Release 1

  • 1. University of Helsinki
  • 2. University of Zurich

Description

The ArchiMob corpus represents German varieties spoken on the territory of Switzerland. It is the first electronic resource containing long samples of transcribed text in Swiss German, intended to be used for studying spatial distribution of morphosyntactic features and for natural language processing. The size of the current version of the corpus is 528 381 tokens.

See project webpage at http://www.spur.uzh.ch/en/departments/korpuslab/ArchiMob.html

Files

ArchiMob_Release1_160812.zip

Files (5.5 MB)

Name Size Download all
md5:533e40e4a5b3998dc4cddfbaa6beffe2
5.5 MB Preview Download

Additional details