A mapping between Bot Commit, Projects, Files, and Blobs
Authors/Creators
- 1. University of Tennessee
- 2. Carnegie Mellon University
Description
This dataset contains information about 150,633,947 file updates by 12,326,137 bot commits, in terms of blob updates.
Each line in the file is of the following form:
commit_sha; projects (projectOwner_projectName) associated with the commit separated by comma; filename; new blob associated with the file; old blob associated with the file (before the commit)
Example:
00000a0cee231959811e6c5d808730240b325417;guyellis_plant;package.json;fadda016e338f970adc62395e77aab80b09fa11b;2e7b221f2bfdeda4ef78d3a56e039e456b6901bb
Contents of each blob can be accessed using the GitHub API:
https://api.github.com/repos/<projectOwner>/<projectName>/git/blobs/<blob id>
Example:
https://api.github.com/repos/guyellis/plant/git/blobs/fadda016e338f970adc62395e77aab80b09fa11b
See https://developer.github.com/v3/git/blobs/ for more deatils.
If you're using our data, please do not forget it cite it !!!
Files
Files
(39.6 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:e051452462da772e86f989b1c0e7b7a5
|
2.1 GB | Download |
|
md5:7e777b9abf148a71002f463c882f0eaa
|
2.1 GB | Download |
|
md5:d034c926ab86c23be7525a84e5c5ffac
|
2.1 GB | Download |
|
md5:1b106b361bd315fcc008955950ae3acb
|
2.1 GB | Download |
|
md5:a02bd92f4c4ade93bfa9215e53adf2c7
|
2.1 GB | Download |
|
md5:54e0e6b7ac9acdaea9d506a96139c2be
|
2.1 GB | Download |
|
md5:151354ee6fa88c28f5eeff1ef8b20e36
|
2.1 GB | Download |
|
md5:f6a2a123dd300dba34e9a8558b919cb9
|
2.1 GB | Download |
|
md5:92f009fc8956588def4ec49bf53b3320
|
2.1 GB | Download |
|
md5:82a19090a0862be1e28ba68e06344e03
|
2.1 GB | Download |
|
md5:2096264fe63fb6e72fe287551731d328
|
2.1 GB | Download |
|
md5:0da51283729320a615ffe11f59fc65ef
|
2.1 GB | Download |
|
md5:7ea81374b91c35db5f3cb8bfa6b5241a
|
2.1 GB | Download |
|
md5:b6a223a0d57ca8a7acd9ecffa0897161
|
2.1 GB | Download |
|
md5:4037448307f6d54dbd90221f3d53ae80
|
2.1 GB | Download |
|
md5:bcff018848f8ae5af50560d5529f2049
|
2.1 GB | Download |
|
md5:a22424e9cdb00b50446a03a0f3400456
|
2.1 GB | Download |
|
md5:d537217b2e47645fb65c6f33dbf06927
|
2.1 GB | Download |
|
md5:1a0bac409eafe94cf17f251bca6c2a19
|
930.0 MB | Download |