Dataset Open Access

Security Bug Conversations

Benjamin S. Meyers; Nuthan Munaiah; Andrew Meneely; Emily Prud'hommeaux


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Security</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Software Engineering</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Natural Language</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Natural Language Processing</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">NLP</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Chromium</subfield>
  </datafield>
  <controlfield tag="005">20190410032722.0</controlfield>
  <datafield tag="500" ind1=" " ind2=" ">
    <subfield code="a">Whenever possible, we would appreciate it if you cite both the paper that released this work and the DOI for this dataset. Thank you!</subfield>
  </datafield>
  <controlfield tag="001">2595071</controlfield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="d">27 May, 2019</subfield>
    <subfield code="g">CHASE</subfield>
    <subfield code="a">International Workshop on Cooperative and Human Aspects of Software Engineering</subfield>
    <subfield code="c">Montréal, QC, Canada</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Rochester Institute of Technology</subfield>
    <subfield code="0">(orcid)0000-0003-2071-664X</subfield>
    <subfield code="a">Nuthan Munaiah</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Rochester Institute of Technology</subfield>
    <subfield code="0">(orcid)0000-0002-4850-1408</subfield>
    <subfield code="a">Andrew Meneely</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Boston College</subfield>
    <subfield code="a">Emily Prud'hommeaux</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1199065342</subfield>
    <subfield code="z">md5:bd04e9a1c4eeede6d75a44cba283f0c4</subfield>
    <subfield code="u">https://zenodo.org/record/2595071/files/security_bug_coversations.csv</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="y">Conference website</subfield>
    <subfield code="u">http://www.chaseresearch.org/workshops/chase2019</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2019-03-15</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:2595071</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Rochester Institute of Technology</subfield>
    <subfield code="0">(orcid)0000-0001-7053-6722</subfield>
    <subfield code="a">Benjamin S. Meyers</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Security Bug Conversations</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">http://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;This dataset will be released as part of the following publication.&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;Benjamin S. Meyers, Nuthan Munaiah, Andrew&amp;nbsp;Meneely, and Emily Prud&amp;#39;hommeaux.&amp;nbsp;&lt;strong&gt;Pragmatic Characteristics of Security Conversation: An Exploratory Linguistic Analysis. &lt;/strong&gt;&lt;em&gt;Forthcoming.&lt;/em&gt;&lt;strong&gt;&amp;nbsp;&lt;/strong&gt;Proceedings of the 12th International Workshop on Cooperative and Human Aspects of Software Engineering (CHASE 2019).&amp;nbsp;Montr&amp;eacute;al, QC, Canada.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Files:&lt;/strong&gt;&lt;/p&gt;

&lt;pre&gt;&lt;code&gt;security_bug_conversations.csv&lt;/code&gt;&lt;/pre&gt;

&lt;p&gt;The full dataset containing&amp;nbsp;over 2.1 million comments posted by developers discussing bugs in the Chromium project. The dataset also includes the values we calculated for the five pragmatic features (described in Section 3 of the paper cited above).&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;CSV Fields:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;Organizational:&lt;/strong&gt;

	&lt;ul&gt;
		&lt;li&gt;&lt;em&gt;Bug ID:&lt;/em&gt; Unique identifier of a bug discussion in the Chromium project. The URL&amp;nbsp;https://bugs.chromium.org/p/chromium/issues/detail?id=&amp;lt;Bug ID&amp;gt; may be used to access the bug online&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Comment ID:&lt;/em&gt; Unique identifier of a comment in a bug discussion&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;Classification:&lt;/strong&gt;
	&lt;ul&gt;
		&lt;li&gt;&lt;em&gt;Is Security:&lt;/em&gt; Binary indicator of whether or not a comment is part of a bug that is about security&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;Natural Language:&lt;/strong&gt;
	&lt;ul&gt;
		&lt;li&gt;&lt;em&gt;Comment Text:&lt;/em&gt;&amp;nbsp;The raw natural language text of the bug comment&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
	&lt;li&gt;&lt;strong&gt;Linguistic Metrics:&lt;/strong&gt;
	&lt;ul&gt;
		&lt;li&gt;&lt;em&gt;Min. Formality:&lt;/em&gt;&amp;nbsp;Minimum of the formality of sentences in the bug comment&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Max. Formality:&lt;/em&gt;&amp;nbsp;Maximum of the formality of sentences in the bug comment&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Max. Informativeness:&lt;/em&gt;&amp;nbsp;Maximum of the informativeness&amp;nbsp;of sentences in the bug comment&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Max. Implicature:&lt;/em&gt;&amp;nbsp;Maximum of the implicature of sentences in the bug comment&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Min. Politeness:&lt;/em&gt;&amp;nbsp;Minimum of the politeness of sentences in the bug comment&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Max. Politeness:&lt;/em&gt;&amp;nbsp;Maximum of the politeness of sentences in the bug comment&lt;/li&gt;
		&lt;li&gt;Number of Tokens&lt;/li&gt;
		&lt;li&gt;Number of Sentences&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Has Doxastic Uncertainty:&lt;/em&gt;&amp;nbsp;Binary indicator of presence of a sentence with doxastic uncertainty in the bug comment&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Has Epistemic Uncertainty:&lt;/em&gt;&amp;nbsp;Binary indicator of presence of a sentence with epistemic&amp;nbsp;uncertainty in the bug comment&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Has Conditional Uncertainty:&lt;/em&gt;&amp;nbsp;Binary indicator of presence of a sentence with conditional uncertainty in the bug comment&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Has Investigational Uncertainty:&lt;/em&gt;&amp;nbsp;Binary indicator of presence of a sentence with investigational uncertainty in the bug comment&lt;/li&gt;
		&lt;li&gt;&lt;em&gt;Has Uncertainty:&lt;/em&gt;&amp;nbsp;Binary indicator of presence of a sentence with any uncertainty in the bug comment&lt;/li&gt;
	&lt;/ul&gt;
	&lt;/li&gt;
&lt;/ul&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.2595070</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.2595071</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
43
33
views
downloads
All versions This version
Views 4343
Downloads 3333
Data volume 39.6 GB39.6 GB
Unique views 3636
Unique downloads 2525

Share

Cite as