Published June 11, 2019 | Version v1
Dataset Open

Data to Replicate paper Improving Bug Detection via Context-based Code Representation Learning and Attention-based Neural Networks part 1

Creators

  • 1. New Jersey Institute of Technology, USA

Description

Data to Replicate paper "Improving Bug Detection via Context-based Code Representation Learning and Attention-based Neural Networks" part 1. Part 2 accessible here: https://doi.org/10.5281/zenodo.3719225

The author of the paper uploaded dataset to Google Drive. These are the same files, uploaded to Zenodo. Since detection_data.tar.gz exceeded zenodo limits, I split the data into 2 parts detection_data.tar.gz and detection_data.tar.gz. This is the first part. Splitting was achieved on OS X with:

split -b 31000m "detection_data.tar.gz" "detection_data.tar.gz."

To get original file back, run

cat detection_data.tar.gz.* > detection_data.tar.gz

GitHub link to the project: https://github.com/OOPSLA-2019-BugDetection/OOPSLA-2019-BugDetection

Files

Files (35.2 GB)

Name Size Download all
md5:5ccd2520b0a278c05e55e8cd065ef9d7
32.5 GB Download
md5:21e223893dd1fed0d45a3312cd83b445
2.7 GB Download