There is a newer version of the record available.

Published July 31, 2018 | Version v1
Dataset Open

Middle Dutch syllabified words

  • 1. University of Antwerp

Description

Specifics of the data:

  • Text file containing 43,703 syllabified Middle Dutch words, taken from the Corpus Van Reenen-Mulder. This database, created by Pieter van Reenen en Maaike Mulder at the Free University Amsterdam, contains about 2,500 Middle Dutch Middle Dutch. It has about 750,000 tokens. The charters were written in the Netherlands and Flanders between 1300 and 1400.
  • The 43,703 syllabified words in this list is the total amount of unique words from the Corpus Van Reenen-Mulder. This number, however, is an approximation due to the fact that some words contain diacritic symbols to indicate abbreviations, clitics, or unclear parts in the original charter. These words were disregarded when assembling the data.
  • A dash-symbol (-) is used as separator.

Files

corpus_viz.pdf

Files (668.4 kB)

Name Size Download all
md5:42bbfcf8169b988826f39cbffa577dd4
157.4 kB Preview Download
md5:9ce20d8307351cac8f3554a973cbc7c0
511.0 kB Preview Download

Additional details

References

  • Gosse Bouma & Ben Hermans. Syllabification of Middle Dutch. In F. Mambrini, M. Passarotti, and C. Sporleder, editors, Proceedings of the Second Workshop on Annotation of Corpora for Research in the Humanities, pp. 27-39, 2012.
  • Pieter van Reenen & Maaike Mulder. Een gegevensbank van 14de- eeuwse Middelnederlandse dialecten op computer. Lexikos 3, pp. 259-281, 1993.