Published March 6, 2020 | Version v1
Dataset Open

A mapping between Bot Commit, Projects, Files, and Blobs

  • 1. University of Tennessee
  • 2. Carnegie Mellon University

Description

This dataset contains information about 150,633,947 file updates by 12,326,137 bot commits, in terms of  blob updates.

Each line in the file is of the following form:

commit_sha; projects (projectOwner_projectName) associated with the commit separated by comma; filename; new blob associated with the file; old blob associated with the file (before the commit)

Example:
00000a0cee231959811e6c5d808730240b325417;guyellis_plant;package.json;fadda016e338f970adc62395e77aab80b09fa11b;2e7b221f2bfdeda4ef78d3a56e039e456b6901bb


Contents of each blob can be accessed using the GitHub API:

https://api.github.com/repos/<projectOwner>/<projectName>/git/blobs/<blob id>

Example:

https://api.github.com/repos/guyellis/plant/git/blobs/fadda016e338f970adc62395e77aab80b09fa11b

See https://developer.github.com/v3/git/blobs/ for more deatils.

If you're using our data, please do not forget it cite it !!!

 

Files

Files (39.6 GB)

Name Size Download all
md5:e051452462da772e86f989b1c0e7b7a5
2.1 GB Download
md5:7e777b9abf148a71002f463c882f0eaa
2.1 GB Download
md5:d034c926ab86c23be7525a84e5c5ffac
2.1 GB Download
md5:1b106b361bd315fcc008955950ae3acb
2.1 GB Download
md5:a02bd92f4c4ade93bfa9215e53adf2c7
2.1 GB Download
md5:54e0e6b7ac9acdaea9d506a96139c2be
2.1 GB Download
md5:151354ee6fa88c28f5eeff1ef8b20e36
2.1 GB Download
md5:f6a2a123dd300dba34e9a8558b919cb9
2.1 GB Download
md5:92f009fc8956588def4ec49bf53b3320
2.1 GB Download
md5:82a19090a0862be1e28ba68e06344e03
2.1 GB Download
md5:2096264fe63fb6e72fe287551731d328
2.1 GB Download
md5:0da51283729320a615ffe11f59fc65ef
2.1 GB Download
md5:7ea81374b91c35db5f3cb8bfa6b5241a
2.1 GB Download
md5:b6a223a0d57ca8a7acd9ecffa0897161
2.1 GB Download
md5:4037448307f6d54dbd90221f3d53ae80
2.1 GB Download
md5:bcff018848f8ae5af50560d5529f2049
2.1 GB Download
md5:a22424e9cdb00b50446a03a0f3400456
2.1 GB Download
md5:d537217b2e47645fb65c6f33dbf06927
2.1 GB Download
md5:1a0bac409eafe94cf17f251bca6c2a19
930.0 MB Download