Dataset Open Access

Security Bug Conversations

Benjamin S. Meyers; Nuthan Munaiah; Andrew Meneely; Emily Prud'hommeaux


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "eng", 
    "@type": "Language", 
    "name": "English"
  }, 
  "description": "<p>This dataset will be released as part of the following publication.</p>\n\n<ul>\n\t<li>Benjamin S. Meyers, Nuthan Munaiah, Andrew&nbsp;Meneely, and Emily Prud&#39;hommeaux.&nbsp;<strong>Pragmatic Characteristics of Security Conversation: An Exploratory Linguistic Analysis. </strong><em>Forthcoming.</em><strong>&nbsp;</strong>Proceedings of the 12th International Workshop on Cooperative and Human Aspects of Software Engineering (CHASE 2019).&nbsp;Montr&eacute;al, QC, Canada.</li>\n</ul>\n\n<p><strong>Files:</strong></p>\n\n<pre><code>security_bug_conversations.csv</code></pre>\n\n<p>The full dataset containing&nbsp;over 2.1 million comments posted by developers discussing bugs in the Chromium project. The dataset also includes the values we calculated for the five pragmatic features (described in Section 3 of the paper cited above).</p>\n\n<p><strong>CSV Fields:</strong></p>\n\n<ul>\n\t<li><strong>Organizational:</strong>\n\n\t<ul>\n\t\t<li><em>Bug ID:</em> Unique identifier of a bug discussion in the Chromium project. The URL&nbsp;https://bugs.chromium.org/p/chromium/issues/detail?id=&lt;Bug ID&gt; may be used to access the bug online</li>\n\t\t<li><em>Comment ID:</em> Unique identifier of a comment in a bug discussion</li>\n\t</ul>\n\t</li>\n\t<li><strong>Classification:</strong>\n\t<ul>\n\t\t<li><em>Is Security:</em> Binary indicator of whether or not a comment is part of a bug that is about security</li>\n\t</ul>\n\t</li>\n\t<li><strong>Natural Language:</strong>\n\t<ul>\n\t\t<li><em>Comment Text:</em>&nbsp;The raw natural language text of the bug comment</li>\n\t</ul>\n\t</li>\n\t<li><strong>Linguistic Metrics:</strong>\n\t<ul>\n\t\t<li><em>Min. Formality:</em>&nbsp;Minimum of the formality of sentences in the bug comment</li>\n\t\t<li><em>Max. Formality:</em>&nbsp;Maximum of the formality of sentences in the bug comment</li>\n\t\t<li><em>Max. Informativeness:</em>&nbsp;Maximum of the informativeness&nbsp;of sentences in the bug comment</li>\n\t\t<li><em>Max. Implicature:</em>&nbsp;Maximum of the implicature of sentences in the bug comment</li>\n\t\t<li><em>Min. Politeness:</em>&nbsp;Minimum of the politeness of sentences in the bug comment</li>\n\t\t<li><em>Max. Politeness:</em>&nbsp;Maximum of the politeness of sentences in the bug comment</li>\n\t\t<li>Number of Tokens</li>\n\t\t<li>Number of Sentences</li>\n\t\t<li><em>Has Doxastic Uncertainty:</em>&nbsp;Binary indicator of presence of a sentence with doxastic uncertainty in the bug comment</li>\n\t\t<li><em>Has Epistemic Uncertainty:</em>&nbsp;Binary indicator of presence of a sentence with epistemic&nbsp;uncertainty in the bug comment</li>\n\t\t<li><em>Has Conditional Uncertainty:</em>&nbsp;Binary indicator of presence of a sentence with conditional uncertainty in the bug comment</li>\n\t\t<li><em>Has Investigational Uncertainty:</em>&nbsp;Binary indicator of presence of a sentence with investigational uncertainty in the bug comment</li>\n\t\t<li><em>Has Uncertainty:</em>&nbsp;Binary indicator of presence of a sentence with any uncertainty in the bug comment</li>\n\t</ul>\n\t</li>\n</ul>", 
  "license": "http://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "Rochester Institute of Technology", 
      "@id": "https://orcid.org/0000-0001-7053-6722", 
      "@type": "Person", 
      "name": "Benjamin S. Meyers"
    }, 
    {
      "affiliation": "Rochester Institute of Technology", 
      "@id": "https://orcid.org/0000-0003-2071-664X", 
      "@type": "Person", 
      "name": "Nuthan Munaiah"
    }, 
    {
      "affiliation": "Rochester Institute of Technology", 
      "@id": "https://orcid.org/0000-0002-4850-1408", 
      "@type": "Person", 
      "name": "Andrew Meneely"
    }, 
    {
      "affiliation": "Boston College", 
      "@type": "Person", 
      "name": "Emily Prud'hommeaux"
    }
  ], 
  "url": "https://zenodo.org/record/2595071", 
  "datePublished": "2019-03-15", 
  "version": "v1.0.0", 
  "keywords": [
    "Security", 
    "Software Engineering", 
    "Natural Language", 
    "Natural Language Processing", 
    "NLP", 
    "Chromium"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/d1f4507e-a73c-4050-83e5-753e04f70c64/security_bug_coversations.csv", 
      "@type": "DataDownload", 
      "fileFormat": "csv"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.2595071", 
  "@id": "https://doi.org/10.5281/zenodo.2595071", 
  "@type": "Dataset", 
  "name": "Security Bug Conversations"
}
47
39
views
downloads
All versions This version
Views 4747
Downloads 3939
Data volume 46.8 GB46.8 GB
Unique views 3939
Unique downloads 2929

Share

Cite as