Dataset Open Access

How Do I Refactor This? An Empirical Study on Refactoring Trends and Topics in Stack Overflow

Peruma, Anthony; Simmons, Steven; Alomar, Eman Abdullah; Newman, Christian D.; Mkaouer, Mohamed Wiem; Ouni, Ali

This is the dataset that accompanies the study: "How Do I Refactor This? An Empirical Study on Refactoring Trends and Topics in Stack Overflow." This study has been accepted for publication in the software engineering journal Empirical Software Engineering.

 

Following is the abstract of the study:

An essential part of software maintenance and evolution, refactoring is performed by developers, regardless of technology or domain, to improve the internal quality of the system, and reduce its technical debt. However, choosing the appropriate refactoring strategy is not always straightforward, resulting in developers seeking assistance. Although research in refactoring is well-established, with several studies altering between the detection of refactoring opportunities and the recommendation of appropriate code changes, little is known about their adoption in practice. Analyzing the perception of developers is critical to understand better what developers consider to be problematic in their code and how they handle it. Additionally, there is a need for bridging the gap between refactoring, as research, and its adoption in practice, by extracting common refactoring intents that are more suitable for what developers face in reality. In this study, we analyze refactoring discussions on Stack Overflow through a series of quantitative and qualitative experiments. Our results show that Stack Overflow is utilized by a diverse set of developers for refactoring assistance for a variety of technologies. Our observations show five areas that developers typically require help with refactoring-- Code Optimization, Tools and IDEs, Architecture and Design Patterns, Unit Testing, and Database. We envision our findings better bridge the support between traditional (or academic) aspects of refactoring and their real-world applicability, including better tool support.

Manuscript DOI: https://doi.org/10.1007/s10664-021-10045-x 

Files (106.7 MB)
Name Size
analysis.xlsx
md5:2693ecf5e1ccfa7f20fdcf839a3d756d
339.0 kB Download
LDA_Model.zip
md5:d12ba6bfcef8ace1aed108291c054ca8
1.1 MB Download
README.txt
md5:37b44ad648cc8f4e4d10586442fbcf65
2.8 kB Download
Stackoverflow-Refactor.sqlite
md5:442117e7eac33cd8a1a55e44f7bf874d
105.3 MB Download
45
13
views
downloads
All versions This version
Views 4545
Downloads 1313
Data volume 320.0 MB320.0 MB
Unique views 4040
Unique downloads 77

Share

Cite as