Dataset Open Access

Security Bug Conversations

Benjamin S. Meyers; Nuthan Munaiah; Andrew Meneely; Emily Prud'hommeaux


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/d1f4507e-a73c-4050-83e5-753e04f70c64/security_bug_coversations.csv"
      }, 
      "checksum": "md5:bd04e9a1c4eeede6d75a44cba283f0c4", 
      "bucket": "d1f4507e-a73c-4050-83e5-753e04f70c64", 
      "key": "security_bug_coversations.csv", 
      "type": "csv", 
      "size": 1199065342
    }
  ], 
  "owners": [
    62595
  ], 
  "doi": "10.5281/zenodo.2595071", 
  "stats": {
    "version_unique_downloads": 20.0, 
    "unique_views": 26.0, 
    "views": 33.0, 
    "downloads": 28.0, 
    "unique_downloads": 20.0, 
    "version_unique_views": 26.0, 
    "volume": 33573829576.0, 
    "version_downloads": 28.0, 
    "version_views": 33.0, 
    "version_volume": 33573829576.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.2595071", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.2595070", 
    "bucket": "https://zenodo.org/api/files/d1f4507e-a73c-4050-83e5-753e04f70c64", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.2595070.svg", 
    "html": "https://zenodo.org/record/2595071", 
    "latest_html": "https://zenodo.org/record/2595071", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.2595071.svg", 
    "latest": "https://zenodo.org/api/records/2595071"
  }, 
  "conceptdoi": "10.5281/zenodo.2595070", 
  "created": "2019-03-15T15:04:42.802160+00:00", 
  "updated": "2019-04-10T03:27:22.533239+00:00", 
  "conceptrecid": "2595070", 
  "revision": 4, 
  "id": 2595071, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.2595071", 
    "description": "<p>This dataset will be released as part of the following publication.</p>\n\n<ul>\n\t<li>Benjamin S. Meyers, Nuthan Munaiah, Andrew&nbsp;Meneely, and Emily Prud&#39;hommeaux.&nbsp;<strong>Pragmatic Characteristics of Security Conversation: An Exploratory Linguistic Analysis. </strong><em>Forthcoming.</em><strong>&nbsp;</strong>Proceedings of the 12th International Workshop on Cooperative and Human Aspects of Software Engineering (CHASE 2019).&nbsp;Montr&eacute;al, QC, Canada.</li>\n</ul>\n\n<p><strong>Files:</strong></p>\n\n<pre><code>security_bug_conversations.csv</code></pre>\n\n<p>The full dataset containing&nbsp;over 2.1 million comments posted by developers discussing bugs in the Chromium project. The dataset also includes the values we calculated for the five pragmatic features (described in Section 3 of the paper cited above).</p>\n\n<p><strong>CSV Fields:</strong></p>\n\n<ul>\n\t<li><strong>Organizational:</strong>\n\n\t<ul>\n\t\t<li><em>Bug ID:</em> Unique identifier of a bug discussion in the Chromium project. The URL&nbsp;https://bugs.chromium.org/p/chromium/issues/detail?id=&lt;Bug ID&gt; may be used to access the bug online</li>\n\t\t<li><em>Comment ID:</em> Unique identifier of a comment in a bug discussion</li>\n\t</ul>\n\t</li>\n\t<li><strong>Classification:</strong>\n\t<ul>\n\t\t<li><em>Is Security:</em> Binary indicator of whether or not a comment is part of a bug that is about security</li>\n\t</ul>\n\t</li>\n\t<li><strong>Natural Language:</strong>\n\t<ul>\n\t\t<li><em>Comment Text:</em>&nbsp;The raw natural language text of the bug comment</li>\n\t</ul>\n\t</li>\n\t<li><strong>Linguistic Metrics:</strong>\n\t<ul>\n\t\t<li><em>Min. Formality:</em>&nbsp;Minimum of the formality of sentences in the bug comment</li>\n\t\t<li><em>Max. Formality:</em>&nbsp;Maximum of the formality of sentences in the bug comment</li>\n\t\t<li><em>Max. Informativeness:</em>&nbsp;Maximum of the informativeness&nbsp;of sentences in the bug comment</li>\n\t\t<li><em>Max. Implicature:</em>&nbsp;Maximum of the implicature of sentences in the bug comment</li>\n\t\t<li><em>Min. Politeness:</em>&nbsp;Minimum of the politeness of sentences in the bug comment</li>\n\t\t<li><em>Max. Politeness:</em>&nbsp;Maximum of the politeness of sentences in the bug comment</li>\n\t\t<li>Number of Tokens</li>\n\t\t<li>Number of Sentences</li>\n\t\t<li><em>Has Doxastic Uncertainty:</em>&nbsp;Binary indicator of presence of a sentence with doxastic uncertainty in the bug comment</li>\n\t\t<li><em>Has Epistemic Uncertainty:</em>&nbsp;Binary indicator of presence of a sentence with epistemic&nbsp;uncertainty in the bug comment</li>\n\t\t<li><em>Has Conditional Uncertainty:</em>&nbsp;Binary indicator of presence of a sentence with conditional uncertainty in the bug comment</li>\n\t\t<li><em>Has Investigational Uncertainty:</em>&nbsp;Binary indicator of presence of a sentence with investigational uncertainty in the bug comment</li>\n\t\t<li><em>Has Uncertainty:</em>&nbsp;Binary indicator of presence of a sentence with any uncertainty in the bug comment</li>\n\t</ul>\n\t</li>\n</ul>", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "title": "Security Bug Conversations", 
    "notes": "Whenever possible, we would appreciate it if you cite both the paper that released this work and the DOI for this dataset. Thank you!", 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "2595070"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "2595071"
          }
        }
      ]
    }, 
    "language": "eng", 
    "version": "v1.0.0", 
    "keywords": [
      "Security", 
      "Software Engineering", 
      "Natural Language", 
      "Natural Language Processing", 
      "NLP", 
      "Chromium"
    ], 
    "publication_date": "2019-03-15", 
    "creators": [
      {
        "orcid": "0000-0001-7053-6722", 
        "affiliation": "Rochester Institute of Technology", 
        "name": "Benjamin S. Meyers"
      }, 
      {
        "orcid": "0000-0003-2071-664X", 
        "affiliation": "Rochester Institute of Technology", 
        "name": "Nuthan Munaiah"
      }, 
      {
        "orcid": "0000-0002-4850-1408", 
        "affiliation": "Rochester Institute of Technology", 
        "name": "Andrew Meneely"
      }, 
      {
        "affiliation": "Boston College", 
        "name": "Emily Prud'hommeaux"
      }
    ], 
    "meeting": {
      "acronym": "CHASE", 
      "url": "http://www.chaseresearch.org/workshops/chase2019", 
      "dates": "27 May, 2019", 
      "place": "Montr\u00e9al, QC, Canada", 
      "title": "International Workshop on Cooperative and Human Aspects of Software Engineering"
    }, 
    "access_right": "open", 
    "resource_type": {
      "type": "dataset", 
      "title": "Dataset"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "relation": "isVersionOf", 
        "identifier": "10.5281/zenodo.2595070"
      }
    ]
  }
}
33
28
views
downloads
All versions This version
Views 3333
Downloads 2828
Data volume 33.6 GB33.6 GB
Unique views 2626
Unique downloads 2020

Share

Cite as