Dataset for Mininig Off-by-One Errors
Description
The dataset that was made by downloading top 500 starred Java projects from GitHub and then eliminating common projects found with java-large-training and java-large-testing raw datasets*. The resulting dataset consists of 155 GitHub projects.
The repositories were downloaded and the code analyzed using code found in the following repository: https://github.com/serg-ml4se-2019/group5-deep-bugs/tree/master (more specifically, the code found in bug_mining folder)
* java-large raw dataset can be found at https://github.com/tech-srl/code2seq/blob/master/README.md#datasets
Files
projects_for_bug_mining.zip
Files
(17.0 GB)
Name | Size | Download all |
---|---|---|
md5:8364fef3b2d60465d4637e2a842a6960
|
17.0 GB | Preview Download |