Published January 25, 2022 | Version v5
Dataset Open

The Public Jira Dataset

  • 1. University of Hamburg

Description

Jira is an issue tracking system that supports software companies (among other types of companies) with managing their projects, community, and processes. This dataset is a collection of public Jira repositories downloaded from the internet using the Jira API V2. We collected data from 16 pubic Jira repositories containing 1822 projects and 2.7 million issues. Included in this data are historical records of 32 million changes, 9 million comments, and 1 million issue links that connect the issues in complex ways. This artefact repository contains the data as a MongoDB dump, the scripts used to download the data, the scripts used to interpret the data, and qualitative work conducted to make the data more approachable.

 

Please cite this work as:

Montgomery L, Lüders C, Maalej W. An Alternative Issue Tracking Dataset of Public Jira Repositories. In 2022 IEEE/ACM 19th International Conference on Mining Software Repositories (MSR) 2022 May 23 (pp. 73-77). IEEE.

Files

ThePublicJiraDataset.zip

Files (6.4 GB)

Name Size Download all
md5:6a9105606863f24c0a4339d13ea30b65
6.4 GB Preview Download