{
  "DOI": "10.5281/zenodo.4105765",
  "abstract": "The Webis EditorialSum Corpus consists of 1330 manually curated extractive summaries for 266 news editorials spanning three diverse portals: Al-Jazeera, Guardian and Fox News. Each editorial has 5 summaries, each labeled for overall quality and fine grained properties such as thesis-relevance, persuasiveness, reasonableness, self-containedness.\n\n\nThe files are organized as follows:\n\n\n\ncorpus.csv - Contains all the editorials and their acquired summaries\n\n\n\nNote: (X = [1,5] for five summaries)\n\n\n- article_id : Article ID in the corpus\n- title : Title of the editorial\n- article_text : Plain text of the editorial\n- summary_{X}_text : Plain text of the corresponding summary\n- thesis_{X}_text : Plain text of the thesis from the corresponding summary\n- lead : top 15% of the editorial's segments\n- body : segments between lead and conclusion sections\n- conclusion : bottom 15% of the editorial's segments\n- article_segments: Collection of paragraphs, each further divided into collection of segments containing:\n\u00a0{ \"number\": segment order in the editorial,\n\u00a0\u00a0 \"text\" : segment text,\n\u00a0\u00a0 \"label\": ADU type\n\u00a0}\n- summary_{X}_segments: Collection of summary segments containing:\n{ \"number\": segment order in the editorial,\n\u00a0 \"text\" : segment text,\n\u00a0 \"adu_label\": ADU type from the editorial,\n\u00a0 \"summary_label\": can be 'thesis' or 'justification'\n}\n\n\n\nquality-groups.csv - Contains the IDs for high(and low)-quality summaries for each quality dimension per editorial\n\nFor example: article_id 2 has four high_quality summaries (summary_1, summary_2, summary_3, summary_4) and one low_quality summary (summary_5) in terms of overall quality.\nThe summary texts can be obtained from corpus.csv respectively.\n\n\n\u00a0\n\n\n\u00a0\n\n\n\u00a0",
  "author": [
    {
      "family": "Syed",
      "given": "Shahbaz"
    },
    {
      "family": "El Baff",
      "given": "Roxanne"
    },
    {
      "family": "Al-Khatib",
      "given": "Khalid"
    },
    {
      "family": "Kiesel",
      "given": "Johannes"
    },
    {
      "family": "Stein",
      "given": "Benno"
    },
    {
      "family": "Potthast",
      "given": "Martin"
    }
  ],
  "id": "4105765",
  "issued": {
    "date-parts": [
      [
        "2020",
        "10",
        "19"
      ]
    ]
  },
  "language": "eng",
  "publisher": "Zenodo",
  "title": "Webis EditorialSum Corpus 2020",
  "type": "dataset"
}