Published July 24, 2018 | Version v1
Journal article Open

An Empirical Study of Obsolete Knowledge on Stack Overflow

  • 1. Queen's University
  • 2. Concordia University

Description

Stack Overflow accumulates an enormous amount of software engineering knowledge. However, as time passes, certain knowledge in answers may become obsolete. Such obsolete answers, if not identified or documented clearly, may mislead answer seekers and cause unexpected problems (e.g., using an out-dated security protocol). In this paper, we investigate how the knowledge in answers becomes obsolete and identify the characteristics of such obsolete answers. We find that: 1) More than half of the obsolete answers (58.4%) were probably already obsolete when they were first posted. 2) When an obsolete answer is observed, only a small proportion (23.5%) of such answers are ever updated. 3) Answers to questions in certain tags (e.g., tags related to web and mobile development) are more likely to become obsolete. Our findings suggest that Stack Overflow should develop mechanisms to encourage the whole community to maintain answers (to avoid obsolete answers) and answer seekers are encouraged to carefully go through all information (e.g., comments) in answer threads.

Files

so.pdf

Files (1.3 MB)

Name Size Download all
md5:6d890c77c39170331a1b5285a9314c28
1.3 MB Preview Download

Additional details

Subjects

software engineering
1
knowledge management
2
mining software repositories
3