Dataset Open Access

War of Words: The Competitive Dynamics of Legislative Processes

Kristof, Victor; Grossglauser, Matthias; Thiran, Patrick


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/1f3bfe5a-9cce-4a60-853c-8d95b63b1f4e/war-of-words-ep7.txt"
      }, 
      "checksum": "md5:6ed13142dbceab6795718501003fa4ff", 
      "bucket": "1f3bfe5a-9cce-4a60-853c-8d95b63b1f4e", 
      "key": "war-of-words-ep7.txt", 
      "type": "txt", 
      "size": 38045458
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/1f3bfe5a-9cce-4a60-853c-8d95b63b1f4e/war-of-words-ep8.txt"
      }, 
      "checksum": "md5:840a5992d258cc586db517f97cb3445c", 
      "bucket": "1f3bfe5a-9cce-4a60-853c-8d95b63b1f4e", 
      "key": "war-of-words-ep8.txt", 
      "type": "txt", 
      "size": 59891222
    }
  ], 
  "owners": [
    89424
  ], 
  "doi": "10.1145/3366423.3380041", 
  "stats": {
    "version_unique_downloads": 29.0, 
    "unique_views": 355.0, 
    "views": 382.0, 
    "version_views": 382.0, 
    "unique_downloads": 29.0, 
    "version_unique_views": 355.0, 
    "volume": 1751568100.0, 
    "version_downloads": 38.0, 
    "downloads": 38.0, 
    "version_volume": 1751568100.0
  }, 
  "links": {
    "doi": "https://doi.org/10.1145/3366423.3380041", 
    "latest_html": "https://zenodo.org/record/3757714", 
    "bucket": "https://zenodo.org/api/files/1f3bfe5a-9cce-4a60-853c-8d95b63b1f4e", 
    "badge": "https://zenodo.org/badge/doi/10.1145/3366423.3380041.svg", 
    "html": "https://zenodo.org/record/3757714", 
    "latest": "https://zenodo.org/api/records/3757714"
  }, 
  "created": "2020-04-20T16:36:30.494646+00:00", 
  "updated": "2020-04-24T16:11:39.922597+00:00", 
  "conceptrecid": "3757713", 
  "revision": 10, 
  "id": 3757714, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.1145/3366423.3380041", 
    "description": "<p>This upload contains the dataset presented and used in the paper:</p>\n\n<blockquote>\n<p>Kristof, V., Grossglauser, M., Thiran, P., <a href=\"https://infoscience.epfl.ch/record/275473/\"><em>War of Words: The Competitive Dynamics of Legislative Processes</em></a>, The Web Conference, April 20-24, 2020, Taipei, Taiwan</p>\n</blockquote>\n\n<p><strong>Read&nbsp;Section 2.2 of the paper to learn more about the European legislative process.&nbsp;</strong>The code to process and use the dataset can be found on <a href=\"https://github.com/indy-lab/war-of-words\">GitHub</a>.</p>\n\n<p>The dataset is split into two legislature periods of the European Parliament, the 7th (<strong>war-of-words-ep7.txt</strong>) and the 8th (<strong>war-of-words-ep8.txt</strong>) legislature.&nbsp;Here is a snippet to load the dataset (for EP7 in this example) in Python:</p>\n\n<pre><code class=\"language-python\">import json\n\nwith open('path/to/war-of-words-ep7.txt') as f:\n    dataset = [json.loads(l) for l in f.readlines()]\n</code></pre>\n\n<p>In the two text files, each line is a data point representing a <em>conflict between edits</em>. It is encoded as a JSON list of dictionaries, where each dictionary is an edit.&nbsp;Each edit has the following structure:</p>\n\n<pre><code class=\"language-json\">{\n  'edit_id': 163187,                     // Unique edit identifier.\n  'accepted': True,                      // Label.\n  'dossier_ref': 'ENVI-AD(2012)487738',  // Reference to dossier (see below).\n  'authors': [                           // List of authors.\n    {\n      'id': 4550,                        // Unique MEP identifier (see below).\n      'name': 'Jill EVANS',              // MEP name.\n      'rapporteur': False                // Whether the MEP is rapporteur.\n    },\n  ],\n}</code></pre>\n\n<p>You can assume that:</p>\n\n<ul>\n\t<li>Each data point has at least one edit.</li>\n\t<li>If there is only one edit, then it is <em>in conflict with the status quo&nbsp;</em>(see Section 4 of the paper).</li>\n\t<li>If there are two or more edits in conflict, then they are all in conflict against each other&nbsp;<em>and</em>&nbsp;they are in conflict with the status quo (see Section 4 of the paper).</li>\n\t<li>At most one edit is accepted in&nbsp;each data point.</li>\n\t<li>In each legislature, each edit has a unique identifier.</li>\n\t<li>There are no timestamps associated with edits (see Section 3 of the paper).&nbsp;</li>\n</ul>\n\n<p>The <strong>dossier_ref</strong>&nbsp;can be used to get more information on the dossier. It is formatted as <strong>COMM-TYPE(YEAR)PENUMBER </strong>(this follows the notation of file names used by the Parliament Secretariat), where</p>\n\n<ul>\n\t<li><strong>COMM</strong>&nbsp;is the <a href=\"https://www.europarl.europa.eu/committees/en/about/introduction\">committee</a> identifier (4 capital letters)</li>\n\t<li><strong>TYPE</strong>&nbsp;is either <strong>AD</strong>&nbsp;(opinion) or <strong>A{7,8}</strong>&nbsp;(report for EP7 or EP8, see Section 2.2 of the paper)</li>\n\t<li><strong>YEAR</strong>&nbsp;is the year the dossier has been voted</li>\n\t<li><strong>PENUMBER</strong>&nbsp;is the &quot;PE number&quot;, a&nbsp;document identifier used by the European Parliament</li>\n</ul>\n\n<p>You can browse the Parliament documents&nbsp;to find details about the dossier for <a href=\"https://www.europarl.europa.eu/committees/en/archives/7/document-search\">EP7</a> and <a href=\"https://www.europarl.europa.eu/committees/en/archives/8/document-search\">EP8</a>&nbsp;(the PE number field should be enough).</p>\n\n<p>The&nbsp;parliamentarians (MEPs, for Member of the European Parliament) have a unique identifier that you can use to get more details about them on the Parliament website: Go to&nbsp;<strong>https://www.europarl.europa.eu/meps/en/MEP_ID</strong>, where&nbsp;<strong>MEP_ID&nbsp;</strong>is the id of the MEP of interest.</p>\n\n<p>This dataset is vowed to become richer:&nbsp;I will add more features, as I am able to extract them.</p>\n\n<p>&nbsp;</p>\n\n<p><strong>Don&#39;t hesitate to <a href=\"mailto:victor.kristof@epfl.ch?subject=Question%20about%20the%20War%20of%20Words%20dataset\">reach out to me</a> if you have any questions!</strong></p>\n\n<p>&nbsp;</p>\n\n<p>To cite this work:</p>\n\n<pre><code>@inproceedings{kristof2020war,\n  author = {Kristof, Victor and Grossglauser, Matthias and Thiran, Patrick},\n  title = {War of Words: The Competitive Dynamics of Legislative Processes},\n  year = {2020},\n  booktitle = {Proceedings of The Web Conference 2020},\n  pages = {2803\u20132809},\n  numpages = {7},\n  location = {Taipei, Taiwan},\n  series = {WWW '20}\n}</code></pre>\n\n<p>&nbsp;</p>", 
    "contributors": [
      {
        "affiliation": "EPFL", 
        "type": "ContactPerson", 
        "name": "Kristof, Victor"
      }
    ], 
    "title": "War of Words: The Competitive Dynamics of Legislative Processes", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "3757713"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "3757714"
          }
        }
      ]
    }, 
    "communities": [
      {
        "id": "epfl"
      }
    ], 
    "version": "1.0", 
    "keywords": [
      "legislative amendments, european union, european parliament"
    ], 
    "publication_date": "2020-04-20", 
    "creators": [
      {
        "affiliation": "EPFL", 
        "name": "Kristof, Victor"
      }, 
      {
        "affiliation": "EPFL", 
        "name": "Grossglauser, Matthias"
      }, 
      {
        "affiliation": "EPFL", 
        "name": "Thiran, Patrick"
      }
    ], 
    "meeting": {
      "acronym": "WWW '20", 
      "dates": "20-24 April 2020", 
      "place": "Taipei, Taiwan", 
      "title": "The Web Conference"
    }, 
    "access_right": "open", 
    "resource_type": {
      "type": "dataset", 
      "title": "Dataset"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.1145/3366423.3380041", 
        "relation": "isDocumentedBy", 
        "resource_type": "publication-conferencepaper"
      }
    ]
  }
}
382
38
views
downloads
Views 382
Downloads 38
Data volume 1.8 GB
Unique views 355
Unique downloads 29

Share

Cite as