Published January 27, 2022 | Version 1.0.0
Software Open

Source code for "TSSB-3M: Mining single statement bugs at massive scale"

  • 1. Carl von Ossietzky University Oldenburg

Description

This artifact contains the source code for "TSSB-3M: Mining single statement bugs at massive scale".

We include packages for running the commit miner and two additional packages that were developed in the process:

  • TSSBMiner.zip: A collection of Python script used to mine single statement bug fixes in open source repositories
  • code_diff.zip: A library for fast AST-based code differencing in Python. This package includes version 0.1.0. 
  • code_tokenize.py: A general purpose tokenizer and AST parser in Python. Mainly used as the parsing backend for code_diff. This package includes version 0.1.0

To replicate the mining process or part of it, please follow the instruction in the TSSBMiner README.

Code and descriptions also available on Github.

Files

code_diff.zip

Files (78.1 kB)

Name Size Download all
md5:9a8a017b465f3ebda096752902985c52
27.4 kB Preview Download
md5:6bf7c6cacfbb0c64ddf6cc13e6f575d0
28.8 kB Preview Download
md5:3f757aa73f7dde17f6bcf446f3514313
21.9 kB Preview Download