Dataset Open Access


Al-Khatib, Khalid; Wachsmuth, Henning; Kiesel, Johannes; Hagen, Matthias; Stein, Benno; Göring, Steve

Newer Version of this corpus in the 2018 version can be found here:

The Webis-Editorials-16 corpus is a novel corpus with 300 news editorials evenly selected from three diverse online news portals: Al Jazeera, Fox News, and The Guardian. The aim of the corpus is to study (1) the mining and classification of fine-grained types of argumentative discourse units and (2) the analysis of argumentation strategies pursued in editorials to achieve persuasion. To this end, each editorial contains manual type annotations of all units that capture the role that a unit plays in the argumentative discourse, such as assumption or statistics. The corpus consists of 14,313 units of six different types, each annotated by three professional annotators from the crowdsourcing platform

Files (5.3 MB)
Name Size
5.3 MB Download
  • Khalid Al-Khatib, Henning Wachsmuth, Johannes Kiesel, Matthias Hagen, and Benno Stein. A News Editorial Corpus for Mining Argumentation Strategies. In 26th International Conference on Computational Linguistics (COLING 2016), pages 3433-3443, December 2016. Association for Computational Linguistics

All versions This version
Views 497496
Downloads 156156
Data volume 827.7 MB827.7 MB
Unique views 465464
Unique downloads 148148


Cite as