Published September 21, 2020 | Version v2
Dataset Open

Bot-NonBot-Commit-Msg

  • 1. The University of Tennessee

Description

This dataset contains the commit messages of the 13,150 bots and 13,150 human developers, as part of the data used in https://dl.acm.org/doi/10.1145/3379597.3387478. For privacy concerns, the developer identities have been replaced with their corresponding SHA1 values. The format of the data is:

<commit SHA>; <SHA1 value for developer identity>;commit message; <whether the developer is a bot or non-bot>; all git repositories the commit is a part of (separated by ;)

git repo format: userName_repoName

Files

Files (2.1 GB)

Name Size Download all
md5:bcab54cac884859c7845f192e7706e0a
2.1 GB Download