Scalable and Accurate Test Case Prioritization in Continuous Integration Contexts
- 1. University of Ottawa
- 2. Carleton University
Description
This dataset is a benchmark of 25 open-source subjects with 21.5k builds and 2.5k failed builds that enables a fair comparison and evaluation of Test Case Prioritization (TCP) techniques. We made our data collection tools available (github.com/Ahmadreza-SY/TCP-CI), which can be used to extend and update the subjects. The description of the structure and files of the dataset can be also found in the documentation of the data collection tool.
Please refer to our academic paper, which can be found on arxiv.org/abs/2109.13168, for details on definitions, experiments, and results. Please cite our paper in any published work that uses resources that are provided in this dataset.
We provide two compressed files:
- TCP-CI-dataset.tar.gz: This file contains the dataset, source code of the subjects, the build logs, and the results of the experiments which were conducted in our research. In other words, this file includes all the required resources to replicate the study, and therefore its size is significantly large.
- TCP-CI-main-dataset.tar.gz: This file only contains the dataset which is described in our GitHub repository (link).
Files
Files
(16.4 GB)
Name | Size | Download all |
---|---|---|
md5:bcc24a1d6aecc9a3731511e4d8721d0e
|
16.2 GB | Download |
md5:728804085c757ff5357aa165b4b6384f
|
237.1 MB | Download |
Additional details
Related works
- Is supplement to
- Preprint: https://arxiv.org/abs/2109.13168 (URL)
- Software: https://github.com/Ahmadreza-SY/TCP-CI (URL)