The implementation of characterization of duplicate bugs and datasets.
Description
This zipped file contains the implementation of data preprocessing and analysis of the research questions (characterize_duplicates.py) and Lamkanfi's Defect Tracking Dataset (https://github.com/ansymo/msr2013-bug_dataset) and our dataset which keeps the relationship of duplicates and their masters as ids in JSON format. The relationships between masters and duplicates are stored in the file named 'ids.json' in each Eclipse and Mozilla files.
The reason for providing Lamkanfi's dataset is to make our code ready to run.
Files
characterizeDuplicateBugs.zip
Files
(265.5 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:fc613dcce5074c5acc61facf8db09c22
|
265.5 MB | Preview Download |
Additional details
References
- Ahmed Lamkanfi, Javier Perez, and Serge Demeyer. The eclipse and mozilla defect tracking dataset: a genuine dataset for mining bug information. InMSR '13: Proceedings of the 10th Working Conference onMining Software Repositories, May 18-–19, 2013. San Francisco, California, USA, 2013