Published March 12, 2023 | Version msr23-v1.0
Dataset Open

GIRT-Data: Sampling GitHub Issue Report Templates

Description

GIRT-Data is the first and largest dataset of issue report templates (IRTs) in both YAML and Markdown format. This dataset and its corresponding open-source crawler tool are intended to support research in this area and to encourage more developers to use IRTs in their repositories. The stable version of the dataset, containing 1_084_300 repositories, that 50_032 of them support IRTs.

For more details see the GitHub page of the dataset: https://github.com/kargaranamir/girt-data


The dataset is accepted for MSR 2023 conference, under the title of "GIRT-Data: Sampling GitHub Issue Report Templates" Search in Google Scholar.

Files

characteristics_irts_markdown.csv

Files (503.1 MB)

Name Size Download all
md5:71d991a2a9ba2a65fcc8036f32a08a6d
209.8 MB Preview Download
md5:b0eb4905cdfc18ee493e4dbb9e73cbd3
29.1 MB Preview Download
md5:6e1ff42c923617ddf653a46e28014f5f
264.2 MB Preview Download
md5:5aa643faf1cc971068aecb81ded9c7d0
7.5 kB Download
md5:86435ec0c743b2b0936d0ddbc364dbca
105 Bytes Preview Download
md5:919cc9b712b54c2dc33c98035ec281ec
1.2 kB Download