Published January 31, 2020 | Version v1
Journal article Open

Automating orthogonal defect classification using machine learning algorithms

  • 1. CISUC, Department of Informatics Engineering, University of Coimbra, Portugal
  • 2. CISUC, Department of Informatics Engineering, University of Coimbra, Portugal; Polytechnic of Coimbra - ISEC, Portugal

Description

Software systems are increasingly being used in business or mission critical scenarios, where the presence of certain types of software defects, i.e., bugs, may result in catastrophic consequences (e.g., financial losses or even the loss of human lives). To deploy systems in which we can rely on, it is vital to understand the types of defects that tend to affect such systems. This allows developers to take proper action, such as adapting the development process or redirecting testing efforts (e.g., using a certain set of testing techniques, or focusing on certain parts of the system). Orthogonal Defect Classification (ODC) has emerged as a popular method for classifying software defects, but it requires one or more experts to categorize each defect in a quite complex and time-consuming process. In this paper, we evaluate the use of machine learning algorithms (k-Nearest Neighbors, Support Vector Machines, Naïve Bayes, Nearest Centroid, Random Forest and Recurrent Neural Networks) for automatic classification of software defects using ODC, based on unstructured textual bug reports. Experimental results reveal the difficulties in automatically classifying certain ODC attributes solely using reports, but also suggest that the overall classification accuracy may be improved in most of the cases, if larger datasets are used.

Files

2019-fgcs-submitted.pdf

Files (749.3 kB)

Name Size Download all
md5:7ae82668e94fc41deeeec5a101d66f72
749.3 kB Preview Download