Dataset Open Access

Webis-Editorials-16

Al-Khatib, Khalid; Wachsmuth, Henning; Kiesel, Johannes; Hagen, Matthias; Stein, Benno; Göring, Steve

Newer Version of this corpus in the 2018 version can be found here: https://doi.org/10.5281/zenodo.1340629

The Webis-Editorials-16 corpus is a novel corpus with 300 news editorials evenly selected from three diverse online news portals: Al Jazeera, Fox News, and The Guardian. The aim of the corpus is to study (1) the mining and classification of fine-grained types of argumentative discourse units and (2) the analysis of argumentation strategies pursued in editorials to achieve persuasion. To this end, each editorial contains manual type annotations of all units that capture the role that a unit plays in the argumentative discourse, such as assumption or statistics. The corpus consists of 14,313 units of six different types, each annotated by three professional annotators from the crowdsourcing platform upwork.com.

Files (5.3 MB)
Name Size
corpus-webis-editorials-16.zip
md5:d3166ef735f292705511cd81a4d04072
5.3 MB Download
  • Khalid Al-Khatib, Henning Wachsmuth, Johannes Kiesel, Matthias Hagen, and Benno Stein. A News Editorial Corpus for Mining Argumentation Strategies. In 26th International Conference on Computational Linguistics (COLING 2016), pages 3433-3443, December 2016. Association for Computational Linguistics

165
31
views
downloads
All versions This version
Views 165164
Downloads 3131
Data volume 164.5 MB164.5 MB
Unique views 161160
Unique downloads 3131

Share

Cite as