There is a newer version of the record available.

Published August 5, 2022 | Version v1
Dataset Open

Dataset of Open-Source Software Developers Labeled by their Experience Level in the Project and their Associated Software Metrics

  • 1. EuroMov Digital Health in Motion, Univ. Montpellier & IMT Mines Ales, Ales, France

Description

Developers were extracted from 17 open-source projects from GitHub. Projects were chosen that use the java
programming language, the Spring framework and Maven/Gradle build tools. Along with these developers, 23
software engineering metrics were extracted for each of them. These metrics are either calculated by analyzing
the source code or relative to project management metadata. Each of these developers then have been
manually searched for in professional social media such as LinkedIn or Twitter to be labeled with their
experience level in their project. Outliers have been statistically detected and manually re-assigned when
needed. The resulting dataset contains 703 anonymized developers qualified by their 23 project-related
software engineering metrics and labeled for their experience. It is suitable for studies that need to connect
developers’ level of experience to tangible software engineering metrics.

Files

dataset_developers_metrics.csv

Files (195.9 kB)

Name Size Download all
md5:3e089dbc24727b0b3319c77ba943b04a
195.9 kB Preview Download