There is a newer version of the record available.

Published March 31, 2020 | Version v1.0.5
Dataset Open

GermaParl Corpus of Plenary Protocols (v1.0.5)

  • 1. University of Duisburg-Essen

Description

The GermaParl Corpus has been prepared in the PolMine Project (http://polmine.github.io) and comprises all protocols of plenary sessions in the German Bundestag (1996 - 2013). This version of the corpus is based on plain text documents issued by the German Bundestag. For a period between 2008 and 2010, txt files are not available. To fill the gap, pdf documents were processed. As part of the corpus preparation pipeline, the data has been linguistically annotated (using the TreeTagger) and imported into the Corpus Workbench (CWB). See the GermaParl documentation website (http://polmine.github.io/GermaParl) for further information.

Files

Files (963.0 MB)

Name Size Download all
md5:f7aa0907b30283bddb7c4d8919fb83a8
963.0 MB Download