Published May 1, 2015 | Version 1.0
Dataset Open

Landfill: An open dataset of code smells with public evaluation

  • 1. University of Salerno, Italy
  • 2. Microsoft, USA
  • 3. Università della Svizzera italiana, Switzerland
  • 4. University of Molise, Italy
  • 5. The College of William and Mary, USA

Description

Code smells are symptoms of poor design and implementation choices that may hinder code comprehension and possibly increase the change- and fault-proneness of source code. Several techniques have been proposed in the literature for detecting code smells. These techniques are generally evaluated by comparing their accuracy on a set of detected candidate code smells against a manually-produced oracle. Unfortunately, such comprehensive sets of annotated code smells are not available in the literature, with only a few exceptions. This dataset provides 243 instances of five types of code smells identified from 20 open-source software projects. In particular, it contains a SQL file with the information concerning such instances and a zip file with their source code.

Files

src.zip

Files (2.1 MB)

Name Size Download all
md5:d5ef62ae5d85b655e521995af9d4a74c
171.4 kB Download
md5:c500e3961fb59a6ad667ea547aa26019
1.9 MB Preview Download