An Empirical Study of Obsolete Knowledge on Stack Overflow
- 1. Queen's University
- 2. Concordia University
Description
Stack Overflow accumulates an enormous amount of software engineering knowledge. However, as time passes, certain knowledge in answers may become obsolete. Such obsolete answers, if not identified or documented clearly, may mislead answer seekers and cause unexpected problems (e.g., using an out-dated security protocol). In this paper, we investigate how the knowledge in answers becomes obsolete and identify the characteristics of such obsolete answers. We find that: 1) More than half of the obsolete answers (58.4%) were probably already obsolete when they were first posted. 2) When an obsolete answer is observed, only a small proportion (23.5%) of such answers are ever updated. 3) Answers to questions in certain tags (e.g., tags related to web and mobile development) are more likely to become obsolete. Our findings suggest that Stack Overflow should develop mechanisms to encourage the whole community to maintain answers (to avoid obsolete answers) and answer seekers are encouraged to carefully go through all information (e.g., comments) in answer threads.
Files
so.pdf
Files
(1.3 MB)
Name | Size | Download all |
---|---|---|
md5:6d890c77c39170331a1b5285a9314c28
|
1.3 MB | Preview Download |
Additional details
Subjects
- software engineering
- 1
- knowledge management
- 2
- mining software repositories
- 3