10.5281/zenodo.3986172
https://zenodo.org/records/3986172
oai:zenodo.org:3986172
João Eduardo
João Eduardo
Montandon
Luciana L.
Luciana L.
Silva
Marco Tulio
Marco Tulio
Valente
Mining the Technical Roles of GitHub Users
Zenodo
2019
github
developer profile
technical role
technical recruiters
expertise
machine learning
2019-10-11
eng
10.5281/zenodo.2559483
2.0
Creative Commons Attribution 4.0 International
This dataset contains the scripts and dataset used in the study reported at Mining the Technical Roles of GitHub Users paper. The files are described in more detailed below:
processed_ground_truth.csv: A CSV file with the information of the developers considered in the study. Due to privacy issues, we already preprocessed the dataset to remove identification clues. Please contact the authors in case you need the original one.
processed_ground_truth_fullstack.csv: Same CSV file but with fullstack developers.
script.ipynb, utils.py: Source code of the script used in our study.
Dockerfile, docker-compose.yml, requirements.txt: Files to replicate the code environment used in this study.
BoW-tuning.csv: List of classifications results for different bag of words parameters.