Dataset Open Access

Security Bug Conversations

Benjamin S. Meyers; Nuthan Munaiah; Andrew Meneely; Emily Prud'hommeaux


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Benjamin S. Meyers</dc:creator>
  <dc:creator>Nuthan Munaiah</dc:creator>
  <dc:creator>Andrew Meneely</dc:creator>
  <dc:creator>Emily Prud'hommeaux</dc:creator>
  <dc:date>2019-03-15</dc:date>
  <dc:description>This dataset will be released as part of the following publication.


	Benjamin S. Meyers, Nuthan Munaiah, Andrew Meneely, and Emily Prud'hommeaux. Pragmatic Characteristics of Security Conversation: An Exploratory Linguistic Analysis. Forthcoming. Proceedings of the 12th International Workshop on Cooperative and Human Aspects of Software Engineering (CHASE 2019). Montréal, QC, Canada.


Files:

security_bug_conversations.csv

The full dataset containing over 2.1 million comments posted by developers discussing bugs in the Chromium project. The dataset also includes the values we calculated for the five pragmatic features (described in Section 3 of the paper cited above).

CSV Fields:


	Organizational:

	
		Bug ID: Unique identifier of a bug discussion in the Chromium project. The URL https://bugs.chromium.org/p/chromium/issues/detail?id=&lt;Bug ID&gt; may be used to access the bug online
		Comment ID: Unique identifier of a comment in a bug discussion
	
	
	Classification:
	
		Is Security: Binary indicator of whether or not a comment is part of a bug that is about security
	
	
	Natural Language:
	
		Comment Text: The raw natural language text of the bug comment
	
	
	Linguistic Metrics:
	
		Min. Formality: Minimum of the formality of sentences in the bug comment
		Max. Formality: Maximum of the formality of sentences in the bug comment
		Max. Informativeness: Maximum of the informativeness of sentences in the bug comment
		Max. Implicature: Maximum of the implicature of sentences in the bug comment
		Min. Politeness: Minimum of the politeness of sentences in the bug comment
		Max. Politeness: Maximum of the politeness of sentences in the bug comment
		Number of Tokens
		Number of Sentences
		Has Doxastic Uncertainty: Binary indicator of presence of a sentence with doxastic uncertainty in the bug comment
		Has Epistemic Uncertainty: Binary indicator of presence of a sentence with epistemic uncertainty in the bug comment
		Has Conditional Uncertainty: Binary indicator of presence of a sentence with conditional uncertainty in the bug comment
		Has Investigational Uncertainty: Binary indicator of presence of a sentence with investigational uncertainty in the bug comment
		Has Uncertainty: Binary indicator of presence of a sentence with any uncertainty in the bug comment
	
	
</dc:description>
  <dc:description>Whenever possible, we would appreciate it if you cite both the paper that released this work and the DOI for this dataset. Thank you!</dc:description>
  <dc:identifier>https://zenodo.org/record/2595071</dc:identifier>
  <dc:identifier>10.5281/zenodo.2595071</dc:identifier>
  <dc:identifier>oai:zenodo.org:2595071</dc:identifier>
  <dc:language>eng</dc:language>
  <dc:relation>doi:10.5281/zenodo.2595070</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>http://creativecommons.org/licenses/by/4.0/legalcode</dc:rights>
  <dc:subject>Security</dc:subject>
  <dc:subject>Software Engineering</dc:subject>
  <dc:subject>Natural Language</dc:subject>
  <dc:subject>Natural Language Processing</dc:subject>
  <dc:subject>NLP</dc:subject>
  <dc:subject>Chromium</dc:subject>
  <dc:title>Security Bug Conversations</dc:title>
  <dc:type>info:eu-repo/semantics/other</dc:type>
  <dc:type>dataset</dc:type>
</oai_dc:dc>
47
39
views
downloads
All versions This version
Views 4747
Downloads 3939
Data volume 46.8 GB46.8 GB
Unique views 3939
Unique downloads 2929

Share

Cite as