Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

There is a newer version of the record available.

Published December 8, 2019 | Version 1.0
Dataset Open

Dataset for Mininig Off-by-One Errors

  • 1. TU Delft

Description

The dataset that was made by downloading top 500 starred Java projects from GitHub and then eliminating common projects found with java-large-training and java-large-testing raw datasets*. The resulting dataset consists of 155 GitHub projects.

The repositories were downloaded and the code analyzed using code found in the following repository: https://github.com/serg-ml4se-2019/group5-deep-bugs/tree/master (more specifically, the code found in bug_mining folder)

* java-large raw dataset can be found at https://github.com/tech-srl/code2seq/blob/master/README.md#datasets

Files

projects_for_bug_mining.zip

Files (17.0 GB)

Name Size Download all
md5:8364fef3b2d60465d4637e2a842a6960
17.0 GB Preview Download