There is a newer version of this record available.

Dataset Open Access

args.me corpus

Yamen Ajjour; Henning Wachsmuth; Johannes Kiesel; Martin Potthast; Matthias Hagen; Benno Stein

The args.me corpus comprises 387 740 arguments. They are crawled from the debate portals Debatewise (14 353 arguments), IDebate.org (13 522 arguments), Debatepedia (21 197 arguments), and Debate.org (338 620 arguments). Moreover, the corpus contains 48 arguments from Canadian Parliament discussions. The arguments are extracted using heuristics that are designed for each debate portal.

These arguments are the ones currently provided through the args.me search engine. Note that the args API does not return the sourceText (which is indexed by args.me an included in this dataset) due to its size.

Cite args.me as Henning Wachsmuth, Martin Potthast, Khalid Al-Khatib, Yamen Ajjour, Jana Puschmann, Jiani Qu, Jonas Dorsch, Viorel Morari, Janek Bevendorff, and Benno Stein. Building an Argument Search Engine for the Web. In 4th Workshop on Argument Mining (ArgMining 2017) at EMNLP, pages 49-59, September 2017. Association for Computational Linguistics.

Cite this dataset as Yamen Ajjour, Henning Wachsmuth, Johannes Kiesel, Martin Potthast, Matthias Hagen, and Benno Stein. Data Acquisition for Argument Search: The args.me corpus. In 42nd German Conference on Artificial Intelligence (KI 2019), September 2019. Springer. and with the DOI of Zenodo.

The development for args.me is hosted in our Gitlab.

This collection is licensed with the Creative Commons Attribution 4.0 International. Individual rights to the content still apply.

Files (1.4 GB)
Name Size
debateorg.zip
md5:0368ee47ce0ec8bed837c7e22c024493
1.2 GB Download
debatepedia.zip
md5:bde8e3ed832c19ca5ed8ed1506a862e8
184.3 MB Download
debatewise.zip
md5:5e5c498a5f657ed7d02e06016e9ce3b1
77.4 MB Download
idebate.zip
md5:5b888c94cce740f1216c063e5e47c74c
20.2 MB Download
parliamentary.zip
md5:c80d932c953b64fb300f13d0d93096bb
27.3 kB Download
  • Yamen Ajjour, Henning Wachsmuth, Johannes Kiesel, Martin Potthast, Matthias Hagen, and Benno Stein. Data Acquisition for Argument Search: The args.me corpus. In 42nd German Conference on Artificial Intelligence (KI 2019), September 2019. Springer.

  • Henning Wachsmuth, Martin Potthast, Khalid Al-Khatib, Yamen Ajjour, Jana Puschmann, Jiani Qu, Jonas Dorsch, Viorel Morari, Janek Bevendorff, and Benno Stein. Building an Argument Search Engine for the Web. In 4th Workshop on Argument Mining (ArgMining 2017) at EMNLP, pages 49-59, September 2017. Association for Computational Linguistics.

4,664
6,138
views
downloads
All versions This version
Views 4,6642,911
Downloads 6,1384,288
Data volume 2.0 TB1.5 TB
Unique views 3,4382,290
Unique downloads 1,9861,260

Share

Cite as