Dataset Open Access
Yamen Ajjour;
Henning Wachsmuth;
Johannes Kiesel;
Martin Potthast;
Matthias Hagen;
Benno Stein
The args.me corpus (version 1.0, cleaned) comprises 382 545 arguments crawled from four debate portals in the middle of 2019. The debate portals are Debatewise, IDebate.org, Debatepedia, and Debate.org. The arguments are extracted using heuristics that are designed for each debate portal.
This version contains the same arguments as Version 1.0, but cleaned as described in the corresponding publication.
Cite this dataset as Yamen Ajjour, Henning Wachsmuth, Johannes Kiesel, Martin Potthast, Matthias Hagen, and Benno Stein. Data Acquisition for Argument Search: The args.me corpus. In 42nd German Conference on Artificial Intelligence (KI 2019), September 2019. Springer. and with the DOI of Zenodo.
Cite the cleaning procedure as TODO.
The development for args.me is hosted in our Gitlab.
This collection is licensed with the Creative Commons Attribution 4.0 International. Individual rights to the content still apply.
Name | Size | |
---|---|---|
argsme-1.0-cleaned.zip
md5:fb0837103a4860e1d4536174f55b12c3 |
236.8 MB | Download |
Yamen Ajjour, Henning Wachsmuth, Johannes Kiesel, Martin Potthast, Matthias Hagen, and Benno Stein. Data Acquisition for Argument Search: The args.me corpus. In 42nd German Conference on Artificial Intelligence (KI 2019), September 2019. Springer.
Henning Wachsmuth, Martin Potthast, Khalid Al-Khatib, Yamen Ajjour, Jana Puschmann, Jiani Qu, Jonas Dorsch, Viorel Morari, Janek Bevendorff, and Benno Stein. Building an Argument Search Engine for the Web. In 4th Workshop on Argument Mining (ArgMining 2017) at EMNLP, pages 49-59, September 2017. Association for Computational Linguistics.
All versions | This version | |
---|---|---|
Views | 4,259 | 761 |
Downloads | 4,435 | 416 |
Data volume | 1.5 TB | 98.5 GB |
Unique views | 3,142 | 658 |
Unique downloads | 1,630 | 395 |