Published June 29, 2024 | Version v1
Dataset Open

A Dataset of bot and human contributors names in GitHub

Description

A Dataset of Bot and Human Contributors' names in GitHub

This repository provides a dataset of 2,150 contributors (1,035 bots and 1,115 humans) that were active enough (made at least 5 events in GitHub) as of 3 May 2024. This dataset accompanies the paper titled A Bot Identification Model and Tool Based on GitHub Activity Sequences published at the Journal of Systems and Software (JSS), see https://doi.org/10.1016/j.jss.2024.112287. This research paper is co-authored by Natarajan Chidambaram, Alexandre Decan and Tom Mens (Software Engineering Lab, University of Mons, Belgium). This work is supported by Service Public de Wallonie Recherche under grant number 2010235 - ARIAC by DigitalWallonia4.AI, by the Fonds de la Recherche Scientifique – FNRS under grant numbers J.0147.24, T.0149.22, and F.4515.23.

Files description

bots.txt - contains the login name of bots, one per line

humans.txt - contains the login name of humans, one per line.

Files

bots.txt

Files (26.1 kB)

Name Size Download all
md5:ab51192d1265e61fe99594b354fb3d84
15.0 kB Preview Download
md5:c6f95acc7fbb319ac8441a6a194f1452
11.1 kB Preview Download