10.5281/zenodo.2878368
https://zenodo.org/records/2878368
oai:zenodo.org:2878368
CorĂ², Federico
Federico
CorĂ²
0000-0002-7321-3467
Gran Sasso Science Institute
Verdecchia, Roberto
Roberto
Verdecchia
0000-0001-9206-6637
Gran Sasso Science Institute & Vrije Universiteit Amsterdam
Cruciani, Emilio
Emilio
Cruciani
0000-0002-4744-5635
Gran Sasso Science Institute
Miranda, Breno
Breno
Miranda
0000-0001-9608-9393
Federal University of Pernambuco
Bertolino, Antonia
Antonia
Bertolino
0000-0001-8749-1356
Consiglio Nazionale delle Ricerche
JTeC: A Large Collection of Java Test Classes forTest Code Analysis and Processing
Zenodo
2019
Software Testing, GitHub, Test Suite, Large Scale
2019-05-19
eng
10.5281/zenodo.2558713
https://zenodo.org/communities/msr
2.0
Creative Commons Attribution 4.0 International
The recent push towards test automation and test-driven development continues to scale up the dimensions of test code that needs to be maintained, analysed, and processed side-by-side with production code. As a consequence, on the one side regression testing techniques, e.g., for test suite prioritization or test case selection, capable to handle such large-scale test suites become indispensable; on the other side, as test code exposes own characteristics, specific techniques for its analysis and refactoring are actively sought. We present JTeC, a large-scale dataset of test cases that researchers can use for benchmarking the above techniques or any other type of tool expressly targeting test code. JTeC collects more than 2.5M+ test classes belonging to 31K+ GitHub projects and summing up to more than 430 Million LOCs of ready-to-use real-world test code.
Companion page for the JTeC dataset at https://github.com/JTeCDataset/JTeC