CorĂ², Federico
Verdecchia, Roberto
Cruciani, Emilio
Miranda, Breno
Bertolino, Antonia
2019-05-19
<p>The recent push towards test automation and test-driven development continues to scale up the dimensions of test code that needs to be maintained, analysed, and processed side-by-side with production code. As a consequence, on the one side regression testing techniques, e.g., for test suite prioritization or test case selection, capable to handle such large-scale test suites become indispensable; on the other side, as test code exposes own characteristics, specific techniques for its analysis and refactoring are actively sought. We present JTeC, a large-scale dataset of test cases that researchers can use for benchmarking the above techniques or any other type of tool expressly targeting test code. JTeC collects more than 2.5M+ test classes belonging to 31K+ GitHub projects and summing up to more than 430 Million LOCs of ready-to-use real-world test code.</p>
Companion page for the JTeC dataset at https://github.com/JTeCDataset/JTeC
https://doi.org/10.5281/zenodo.3711509
oai:zenodo.org:3711509
eng
Zenodo
https://zenodo.org/communities/msr
https://doi.org/10.5281/zenodo.2558713
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
Software Testing, GitHub, Test Suite, Large Scale
JTeC: A Large Collection of Java Test Classes forTest Code Analysis and Processing
info:eu-repo/semantics/other