There is a newer version of the record available.

Published April 1, 2020 | Version 2020-04-01
Dataset Open

args.me corpus

  • 1. Bauhaus Universität Weimar
  • 2. Paderborn University
  • 3. Leipzig University
  • 4. Martin Luther University of Halle-Wittenberg

Description

The args.me corpus comprises 387 740 arguments. They are crawled from the debate portals Debatewise (14 353 arguments), IDebate.org (13 522 arguments), Debatepedia (21 197 arguments), and Debate.org (338 620 arguments). Moreover, the corpus contains 48 arguments from Canadian Parliament discussions. The arguments are extracted using heuristics that are designed for each debate portal.

These arguments are the ones currently provided through the args.me search engine. Note that the args API does not return the sourceText (which is indexed by args.me an included in this dataset) due to its size.

Cite args.me as Henning Wachsmuth, Martin Potthast, Khalid Al-Khatib, Yamen Ajjour, Jana Puschmann, Jiani Qu, Jonas Dorsch, Viorel Morari, Janek Bevendorff, and Benno Stein. Building an Argument Search Engine for the Web. In 4th Workshop on Argument Mining (ArgMining 2017) at EMNLP, pages 49-59, September 2017. Association for Computational Linguistics.

Cite this dataset as Yamen Ajjour, Henning Wachsmuth, Johannes Kiesel, Martin Potthast, Matthias Hagen, and Benno Stein. Data Acquisition for Argument Search: The args.me corpus. In 42nd German Conference on Artificial Intelligence (KI 2019), September 2019. Springer. and with the DOI of Zenodo.

The development for args.me is hosted in our Gitlab.

This collection is licensed with the Creative Commons Attribution 4.0 International. Individual rights to the content still apply.

Files

debateorg.zip

Files (1.4 GB)

Name Size Download all
md5:0368ee47ce0ec8bed837c7e22c024493
1.2 GB Preview Download
md5:bde8e3ed832c19ca5ed8ed1506a862e8
184.3 MB Preview Download
md5:5e5c498a5f657ed7d02e06016e9ce3b1
77.4 MB Preview Download
md5:5b888c94cce740f1216c063e5e47c74c
20.2 MB Preview Download
md5:c80d932c953b64fb300f13d0d93096bb
27.3 kB Preview Download

Additional details

References

  • Yamen Ajjour, Henning Wachsmuth, Johannes Kiesel, Martin Potthast, Matthias Hagen, and Benno Stein. Data Acquisition for Argument Search: The args.me corpus. In 42nd German Conference on Artificial Intelligence (KI 2019), September 2019. Springer.
  • Henning Wachsmuth, Martin Potthast, Khalid Al-Khatib, Yamen Ajjour, Jana Puschmann, Jiani Qu, Jonas Dorsch, Viorel Morari, Janek Bevendorff, and Benno Stein. Building an Argument Search Engine for the Web. In 4th Workshop on Argument Mining (ArgMining 2017) at EMNLP, pages 49-59, September 2017. Association for Computational Linguistics.