Published June 17, 2019
| Version v4.2019
Dataset
Open
German Political Speeches Corpus
Description
This text archive focuses on German political speeches held by top officials mostly from 1990 onwards, selected according to their political relevance. The currently included speeches come from the following sources:
- Official pages of the German Presidency, Chancellery, Bundestag, Ministry of Foreign Affairs
- Personal pages of the Helmut Kohl archive, Wolfgang Thierse and Norbert Lammert
This resource is available online:
- Online queries on the DWDS website and usage instructions (the text base may be newer than the downloadable archives)
- http://purl.org/corpus/german-speeches
The files below consist of texts with metadata encoded in XML format. For appropriate tooling see:
- Python tutorial using the speeches: Natural Language Processing — Einsteigen und Loslegen!
- CorpusExplorer, corpus linguistics and text mining software featuring the speeches
- List of off-the-shelf NLP tools for German
This is work in progress, updated and extended versions will follow.
Files
German-political-speeches-2019-release.zip
Files
(28.1 MB)
Name | Size | Download all |
---|---|---|
md5:668203366d7b9b81daa51935a89c70a6
|
28.1 MB | Preview Download |
Additional details
Related works
- Is documented by
- Conference paper: https://hal.archives-ouvertes.fr/hal-01798703/document (URL)