<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Syed, Shahbaz</dc:creator>
  <dc:creator>El Baff, Roxanne</dc:creator>
  <dc:creator>Al-Khatib, Khalid</dc:creator>
  <dc:creator>Kiesel, Johannes</dc:creator>
  <dc:creator>Stein, Benno</dc:creator>
  <dc:creator>Potthast, Martin</dc:creator>
  <dc:date>2020-10-19</dc:date>
  <dc:description>&amp;lt;p&amp;gt;The Webis EditorialSum Corpus consists of 1330 manually curated extractive summaries for 266 news editorials spanning three diverse portals: Al-Jazeera, Guardian and Fox News. Each editorial has 5 summaries, each labeled for overall quality and fine grained properties such as thesis-relevance, persuasiveness, reasonableness, self-containedness.&amp;lt;/p&amp;gt;

&amp;lt;p&amp;gt;The files are organized as follows:&amp;lt;/p&amp;gt;

&amp;lt;p&amp;gt;&amp;lt;br&amp;gt;
&amp;lt;em&amp;gt;corpus.csv&amp;lt;/em&amp;gt; - &amp;lt;strong&amp;gt;Contains all the editorials and their acquired summaries&amp;lt;/strong&amp;gt;&amp;lt;/p&amp;gt;

&amp;lt;p&amp;gt;&amp;lt;br&amp;gt;
Note: (X = [1,5] for five summaries)&amp;lt;/p&amp;gt;

&amp;lt;p&amp;gt;- article_id : Article ID in the corpus&amp;lt;br&amp;gt;
- title : Title of the editorial&amp;lt;br&amp;gt;
- article_text : Plain text of the editorial&amp;lt;br&amp;gt;
- summary_{X}_text : Plain text of the corresponding summary&amp;lt;br&amp;gt;
- thesis_{X}_text : Plain text of the thesis from the corresponding summary&amp;lt;br&amp;gt;
- lead : top 15% of the editorial&amp;#39;s segments&amp;lt;br&amp;gt;
- body : segments between lead and conclusion sections&amp;lt;br&amp;gt;
- conclusion : bottom 15% of the editorial&amp;#39;s segments&amp;lt;br&amp;gt;
- article_segments: Collection of paragraphs, each further divided into collection of segments containing:&amp;lt;br&amp;gt;
&amp;nbsp;{ &amp;quot;number&amp;quot;: segment order in the editorial,&amp;lt;br&amp;gt;
&amp;nbsp;&amp;nbsp; &amp;quot;text&amp;quot; : segment text,&amp;lt;br&amp;gt;
&amp;nbsp;&amp;nbsp; &amp;quot;label&amp;quot;: ADU type&amp;lt;br&amp;gt;
&amp;nbsp;}&amp;lt;br&amp;gt;
- summary_{X}_segments: Collection of summary segments containing:&amp;lt;br&amp;gt;
{ &amp;quot;number&amp;quot;: segment order in the editorial,&amp;lt;br&amp;gt;
&amp;nbsp; &amp;quot;text&amp;quot; : segment text,&amp;lt;br&amp;gt;
&amp;nbsp; &amp;quot;adu_label&amp;quot;: ADU type from the editorial,&amp;lt;br&amp;gt;
&amp;nbsp; &amp;quot;summary_label&amp;quot;: can be &amp;#39;thesis&amp;#39; or &amp;#39;justification&amp;#39;&amp;lt;br&amp;gt;
}&amp;lt;/p&amp;gt;

&amp;lt;p&amp;gt;&amp;lt;br&amp;gt;
&amp;lt;em&amp;gt;quality-groups.csv&amp;lt;/em&amp;gt; - &amp;lt;strong&amp;gt;Contains the IDs for high(and low)-quality summaries for each quality dimension per editorial&amp;lt;/strong&amp;gt;&amp;lt;br&amp;gt;
&amp;lt;br&amp;gt;
For example: article_id 2 has four high_quality summaries (summary_1, summary_2, summary_3, summary_4) and one low_quality summary (summary_5) in terms of overall quality.&amp;lt;br&amp;gt;
The summary texts can be obtained from corpus.csv respectively.&amp;lt;/p&amp;gt;

&amp;lt;p&amp;gt;&amp;nbsp;&amp;lt;/p&amp;gt;

&amp;lt;p&amp;gt;&amp;nbsp;&amp;lt;/p&amp;gt;

&amp;lt;p&amp;gt;&amp;nbsp;&amp;lt;/p&amp;gt;</dc:description>
  <dc:identifier>https://doi.org/10.5281/zenodo.4105765</dc:identifier>
  <dc:identifier>oai:zenodo.org:4105765</dc:identifier>
  <dc:language>eng</dc:language>
  <dc:publisher>Zenodo</dc:publisher>
  <dc:relation>https://zenodo.org/communities/webis</dc:relation>
  <dc:relation>https://doi.org/10.5281/zenodo.4105764</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>Creative Commons Attribution 4.0 International</dc:rights>
  <dc:rights>https://creativecommons.org/licenses/by/4.0/legalcode</dc:rights>
  <dc:subject>editorial summarization</dc:subject>
  <dc:subject>argumentation  summarization</dc:subject>
  <dc:subject>extractive summarization</dc:subject>
  <dc:title>Webis EditorialSum Corpus 2020</dc:title>
  <dc:type>info:eu-repo/semantics/other</dc:type>
</oai_dc:dc>