Supplementary Material for "Challenges of Mining Solution Snippets from Stack Overflow: A Taxonomy to Guide Reusability"
Description
Stack Overflow is the de-facto standard resource for seeking and finding answers for specific programming problems. Many of these answers provide code snippets. Unfortunately, it is not always easy for developers to identify a reusable code snippet that solves a specific programming problem because the provided answers on Stack Overflow often contain the solution source code along with other content such as examples of how to use the code, solutions that only apply to a very specific context, and even incorrect ways to solve the problem. In our research, we explore the nature of reusability of what we call Solution Snippets, to help developers find solutions to their problems. To that end, we conduct an empirical study to firstly understand the characteristics of questions that would typically be answered with a Solution Snippet, and secondly to understand the characteristics of Solution Snippets themselves and how they may be reused by developers. Through an exploratory study of 598 Stack Overflow answers, we: (1) identify the characteristics of questions that require a Solution Snippet to address (how-to questions); (2) identify how Solution Snippets are presented in the answers to those questions; (3) classify Solution Snippets according to their reusability into five categories; and (4) provide suggestions for the designers of Stack Overflow to improve the presentation of answers on Stack Overflow, facilitating reusability and automatic extraction.
Files
Annotated Posts.zip
Files
(61.3 kB)
Name | Size | Download all |
---|---|---|
md5:42730155520e6d17ee221edbbdab18f7
|
18.9 kB | Preview Download |
md5:dc510a0738727b2b6055847d34572d2a
|
42.5 kB | Preview Download |