There is a newer version of the record available.

Published January 25, 2022 | Version v3
Dataset Restricted

The Public Jira Dataset

  • 1. University of Hamburg

Description

Jira is an issue tracking system that supports software companies (among other types of companies) with managing their projects, community, and processes. This dataset is a collection of public Jira repositories downloaded from the internet using the Jira API V2. We collected data from 16 pubic Jira repositories containing 1822 projects and 2.7 million issues. Included in this data are historical records of 32 million changes, 9 million comments, and 1 million issue links that connect the issues in complex ways. This artefact repository contains the data as a MongoDB dump, the scripts used to download the data, the scripts used to interpret the data, and qualitative work conducted to make the data more approachable.

Notes

In this version of the dataset, the database dump contains personal information of the users of these Jira repositories. Accordingly, we have restricted (disabled) this version in favour of the later versions where the data has been anonymised.

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.