There is a newer version of this record available.

Dataset Open Access

GermaParl Corpus of Plenary Protocols (v1.0.5)

Blaette, Andreas

The GermaParl Corpus has been prepared in the PolMine Project (http://polmine.github.io) and comprises all protocols of plenary sessions in the German Bundestag (1996 - 2013). This version of the corpus is based on plain text documents issued by the German Bundestag. For a period between 2008 and 2010, txt files are not available. To fill the gap, pdf documents were processed. As part of the corpus preparation pipeline, the data has been linguistically annotated (using the TreeTagger) and imported into the Corpus Workbench (CWB). See the GermaParl documentation website (http://polmine.github.io/GermaParl) for further information.

Files (963.0 MB)
Name Size
germaparl_v1.0.5.tar.gz
md5:f7aa0907b30283bddb7c4d8919fb83a8
963.0 MB Download
298
1,120
views
downloads
All versions This version
Views 298163
Downloads 1,120113
Data volume 996.1 GB108.8 GB
Unique views 184113
Unique downloads 50678

Share

Cite as