A corpus of Java projects representing the 2012 Ohloh universe
Description
A corpus of Java projects build using the Software Projects Sampling (SPS) tool by Naggapan et al [1]. SPS measures representativeness of a smaller corpus with respect to the universe (Ohloh 2012) in terms of diversity dimensions and constructs a maximally representative corpus by iteratively adding projects that would increase the representativeness most.
The corpus contains the source code of the projects selected at 1 June 2012. This is due to the fact that the Ohloh data on which the diversity was calculated was of that period.
[1] M. Nagappan, T. Zimmermann, and C. Bird, “Diversity in software engineering research,” in ESEC/FSE. ACM, 2013, pp. 466–476.
Notes
Files
Files
(783.5 MB)
Name | Size | Download all |
---|---|---|
md5:f449a0c58995384bccf998323c7e7123
|
783.5 MB | Download |
Additional details
Related works
- Is identical to
- urn:NBN:nl:ui:18-24399 (URN)