Published August 21, 2020 | Version v1
Dataset Open

Replication Kit: On the Feasibility of Automated Classification of Bug and Non-Bug Issues

  • 1. Karlsruhe Institute of Technology
  • 2. University of Goettingen

Description

This is the replication kit for the manuscript "On the Feasibility of Automated Issue Bug and Non-Bug Issues". 

The replication kit contains the following data.
 - results_all_issues contains the csv files with the results of the classifiers we trained with all issues for phases 1 and 2 of the experiment.
 - results_only_bugs contains the csv files with the results of the classifiers we trained with only bugs for phases 1 and 2 of the experiment.
 - results_unvalidated contains the csv files with the results for phases 3 and 4 of the experiment
 - The .p files contain pickled python objects with the issue data. The code for creating this pickles is contained in the evaluation.py.
 - The evaluation.py creates all results that are missing, i.e., will do nothing if there are already CSV files in the folders! 
 - The evaluation.py also generates the .p files, if they are not available. However, this requires the raw data from Herzig et al (2013), Herbold et al (2020), and Ortu et al (2015) which is not included in this replication kit due to the size of several gigabytes.
 - The EvaluationNotebook contains the code for the statistical analysis.
 

Files

replication-kit-issue-type-prediction.zip

Files (425.7 MB)

Name Size Download all
md5:489c90bdaa4e2d0dac4f51da6185b04f
425.7 MB Preview Download