Published October 21, 2020 | Version v3
Other Open

The implementation of characterization of duplicate bugs and datasets.

Authors/Creators

  • 1. Anonymous

Description

This zipped file contains the implementation of data preprocessing and analysis of the research questions (characterize_duplicates.py) and Lamkanfi's Defect Tracking Dataset (https://github.com/ansymo/msr2013-bug_dataset) and our dataset which keeps the relationship of duplicates and their masters as ids in JSON format. The relationships between masters and duplicates are stored in the file named 'ids.json' in each Eclipse and Mozilla files. 

The reason for providing Lamkanfi's dataset is to make our code ready to run.

Files

characterizeDuplicateBugs.zip

Files (265.5 MB)

Name Size Download all
md5:fc613dcce5074c5acc61facf8db09c22
265.5 MB Preview Download

Additional details

References

  • Ahmed Lamkanfi, Javier Perez, and Serge Demeyer. The eclipse and mozilla defect tracking dataset: a genuine dataset for mining bug information. InMSR '13: Proceedings of the 10th Working Conference onMining Software Repositories, May 18-–19, 2013. San Francisco, California, USA, 2013