The Public Jira Dataset
Description
Jira is an issue tracking system that supports software companies (among other types of companies) with managing their projects, community, and processes. This dataset is a collection of public Jira repositories downloaded from the internet using the Jira API V2. We collected data from 16 pubic Jira repositories containing 1822 projects and 2.7 million issues. Included in this data are historical records of 32 million changes, 9 million comments, and 1 million issue links that connect the issues in complex ways. This artefact repository contains the data as a MongoDB dump, the scripts used to download the data, the scripts used to interpret the data, and qualitative work conducted to make the data more approachable.
Files
2022.01.25 - ThePublicJiraDataset.zip
Files
(6.4 GB)
Name | Size | Download all |
---|---|---|
md5:9a65536540641fcde482988eb378426e
|
6.4 GB | Preview Download |