Dataset Open Access

Webis-ConcluGen-2021

Syed, Shahbaz; Al Khatib, Khalid; Alshomary, Milad; Wachsmuth, Henning; Potthast, Martin

Update: Patch 1.0.1 removes duplicates (up to 5% of the examples in the base version) and re-indexes the examples with new "id" values. The argument and conclusion are now merged into a single csv file for each split to be easily loaded as dataframes. We also include the test set used for manual evaluation (manual_evaluation_arguments.csv) to support qualitative comparisons. 

The corpus contains 130.519 (argumentative text, conclusion) pairs. There are four variations of this corpus:

  1. base:  (argumentative text, conclusion) pairs
  2. topic: (topic encoded argumentative text, conclusion) pairs
  3. targets: (topic and target encoded argumentative text, conclusion) pairs
  4. aspects: (topic and aspect encoded argumentative text, conclusion) pairs

Each variation is split into train, val, and test files. Additionally, the test set used for automatic evaluation is provided (automatic_evaluation_test_set.csv) which must be used for any quantitative evaluation. 

Files (463.1 MB)
Name Size
webis-conclugen-2021-v1.0.1.zip
md5:951013f68e09d1c5b81d0d933d66e7ce
239.2 MB Download
webis-conclugen-2021.zip
md5:a9370e32793a76b8ee1a792fdaa0b055
223.9 MB Download
145
66
views
downloads
All versions This version
Views 14527
Downloads 663
Data volume 14.8 GB702.2 MB
Unique views 13324
Unique downloads 462

Share

Cite as