Dataset Open Access

War of Words: The Competitive Dynamics of Legislative Processes

Kristof, Victor; Grossglauser, Matthias; Thiran, Patrick


Citation Style Language JSON Export

{
  "DOI": "10.1145/3366423.3380041", 
  "title": "War of Words: The Competitive Dynamics of Legislative Processes", 
  "issued": {
    "date-parts": [
      [
        2020, 
        4, 
        20
      ]
    ]
  }, 
  "abstract": "<p>This upload contains the dataset presented and used in the paper:</p>\n\n<blockquote>\n<p>Kristof, V., Grossglauser, M., Thiran, P., <a href=\"https://infoscience.epfl.ch/record/275473/\"><em>War of Words: The Competitive Dynamics of Legislative Processes</em></a>, The Web Conference, April 20-24, 2020, Taipei, Taiwan</p>\n</blockquote>\n\n<p><strong>Read&nbsp;Section 2.2 of the paper to learn more about the European legislative process.&nbsp;</strong>The code to process and use the dataset can be found on <a href=\"https://github.com/indy-lab/war-of-words\">GitHub</a>.</p>\n\n<p>The dataset is split into two legislature periods of the European Parliament, the 7th (<strong>war-of-words-ep7.txt</strong>) and the 8th (<strong>war-of-words-ep8.txt</strong>) legislature.&nbsp;Here is a snippet to load the dataset (for EP7 in this example) in Python:</p>\n\n<pre><code class=\"language-python\">import json\n\nwith open('path/to/war-of-words-ep7.txt') as f:\n    dataset = [json.loads(l) for l in f.readlines()]\n</code></pre>\n\n<p>In the two text files, each line is a data point representing a <em>conflict between edits</em>. It is encoded as a JSON list of dictionaries, where each dictionary is an edit.&nbsp;Each edit has the following structure:</p>\n\n<pre><code class=\"language-json\">{\n  'edit_id': 163187,                     // Unique edit identifier.\n  'accepted': True,                      // Label.\n  'dossier_ref': 'ENVI-AD(2012)487738',  // Reference to dossier (see below).\n  'authors': [                           // List of authors.\n    {\n      'id': 4550,                        // Unique MEP identifier (see below).\n      'name': 'Jill EVANS',              // MEP name.\n      'rapporteur': False                // Whether the MEP is rapporteur.\n    },\n  ],\n}</code></pre>\n\n<p>You can assume that:</p>\n\n<ul>\n\t<li>Each data point has at least one edit.</li>\n\t<li>If there is only one edit, then it is <em>in conflict with the status quo&nbsp;</em>(see Section 4 of the paper).</li>\n\t<li>If there are two or more edits in conflict, then they are all in conflict against each other&nbsp;<em>and</em>&nbsp;they are in conflict with the status quo (see Section 4 of the paper).</li>\n\t<li>At most one edit is accepted in&nbsp;each data point.</li>\n\t<li>In each legislature, each edit has a unique identifier.</li>\n\t<li>There are no timestamps associated with edits (see Section 3 of the paper).&nbsp;</li>\n</ul>\n\n<p>The <strong>dossier_ref</strong>&nbsp;can be used to get more information on the dossier. It is formatted as <strong>COMM-TYPE(YEAR)PENUMBER </strong>(this follows the notation of file names used by the Parliament Secretariat), where</p>\n\n<ul>\n\t<li><strong>COMM</strong>&nbsp;is the <a href=\"https://www.europarl.europa.eu/committees/en/about/introduction\">committee</a> identifier (4 capital letters)</li>\n\t<li><strong>TYPE</strong>&nbsp;is either <strong>AD</strong>&nbsp;(opinion) or <strong>A{7,8}</strong>&nbsp;(report for EP7 or EP8, see Section 2.2 of the paper)</li>\n\t<li><strong>YEAR</strong>&nbsp;is the year the dossier has been voted</li>\n\t<li><strong>PENUMBER</strong>&nbsp;is the &quot;PE number&quot;, a&nbsp;document identifier used by the European Parliament</li>\n</ul>\n\n<p>You can browse the Parliament documents&nbsp;to find details about the dossier for <a href=\"https://www.europarl.europa.eu/committees/en/archives/7/document-search\">EP7</a> and <a href=\"https://www.europarl.europa.eu/committees/en/archives/8/document-search\">EP8</a>&nbsp;(the PE number field should be enough).</p>\n\n<p>The&nbsp;parliamentarians (MEPs, for Member of the European Parliament) have a unique identifier that you can use to get more details about them on the Parliament website: Go to&nbsp;<strong>https://www.europarl.europa.eu/meps/en/MEP_ID</strong>, where&nbsp;<strong>MEP_ID&nbsp;</strong>is the id of the MEP of interest.</p>\n\n<p>This dataset is vowed to become richer:&nbsp;I will add more features, as I am able to extract them.</p>\n\n<p>&nbsp;</p>\n\n<p><strong>Don&#39;t hesitate to <a href=\"mailto:victor.kristof@epfl.ch?subject=Question%20about%20the%20War%20of%20Words%20dataset\">reach out to me</a> if you have any questions!</strong></p>\n\n<p>&nbsp;</p>\n\n<p>To cite this work:</p>\n\n<pre><code>@inproceedings{kristof2020war,\n  author = {Kristof, Victor and Grossglauser, Matthias and Thiran, Patrick},\n  title = {War of Words: The Competitive Dynamics of Legislative Processes},\n  year = {2020},\n  booktitle = {Proceedings of The Web Conference 2020},\n  pages = {2803\u20132809},\n  numpages = {7},\n  location = {Taipei, Taiwan},\n  series = {WWW '20}\n}</code></pre>\n\n<p>&nbsp;</p>", 
  "author": [
    {
      "family": "Kristof, Victor"
    }, 
    {
      "family": "Grossglauser, Matthias"
    }, 
    {
      "family": "Thiran, Patrick"
    }
  ], 
  "version": "1.0", 
  "type": "dataset", 
  "id": "3757714"
}
384
38
views
downloads
Views 384
Downloads 38
Data volume 1.8 GB
Unique views 356
Unique downloads 29

Share

Cite as