Published March 4, 2020
| Version 1.0
Dataset
Open
Software Developer Expertise GitHub and Stack Overflow data sets
Description
Cross-Platform Software Developer Expertise Learning by Norbert Eke
This data set is part of my Master's thesis project on developer expertise learning by mining Stack Overflow (SOTorrent) and Github (GHTorrent) data. Check out my portfolio website at norberte.github.io
Files
GH_annotations_processed.csv
Files
(6.8 GB)
Name | Size | Download all |
---|---|---|
md5:5684d71aecde6b47903e1012cc0d3574
|
413.4 kB | Download |
md5:443aa1952181c38a0facb957c3677732
|
413.5 kB | Download |
md5:5250d80cd0a788bb47b7bd23908f5546
|
2.0 MB | Download |
md5:a6ee8128488a9e217d5fd2929b836061
|
2.0 MB | Download |
md5:a55ab897a02d0f6b067e2d1846f64d25
|
72.3 kB | Preview Download |
md5:15b95df1b46a2f19c97c435ed4f8fb29
|
40.7 kB | Preview Download |
md5:6ec5f55a520a8b2079fe6926245689ec
|
1.2 GB | Preview Download |
md5:27796b11c3068baec55d436a81cb228e
|
1.2 GB | Download |
md5:efecbed0ee91de52bb45cded93cabaf9
|
591.1 MB | Preview Download |
md5:54ef672adead3f1b7fc01ee4946efeaf
|
790 Bytes | Preview Download |
md5:4fd864c2c7404139feb298ead44d1250
|
661.9 MB | Preview Download |
md5:e74ed9467cec86e02c8f3dcf020eb13c
|
891.3 kB | Preview Download |
md5:536f58545c5b546354f1b16c73969070
|
779 Bytes | Preview Download |
md5:7ba3bd09eaa4a6c9cbf6d60f1d8c877e
|
5.9 kB | Preview Download |
md5:dcfe611317c8bafc4f2f3129080e7dbe
|
13.1 MB | Preview Download |
md5:0272e7edefdaa8e1015930d960b63d6d
|
11.2 MB | Preview Download |
md5:daab386f4687f2d9aa858f964812c953
|
10.1 MB | Preview Download |
md5:3e9f9efe9a492ca7e65d9a390588edfe
|
8.8 MB | Preview Download |
md5:b39914118aaaf183ce23a63d1f0a86a5
|
73.0 kB | Preview Download |
md5:67ed398a0bf8efc6739ad47a51e38382
|
41.8 kB | Preview Download |
md5:c3b746d170471cee172e719f1f95f77d
|
112.3 MB | Preview Download |
md5:d62ab51a59ddf1e8ec72aba93ccdec4b
|
112.4 MB | Download |
md5:a4e01df0b69b0c1df324087be8d97e55
|
91.9 MB | Preview Download |
md5:d9bda57d2ad46151526de3790470eff9
|
106.1 MB | Download |
md5:40fae295205ba250cc61d556c0de4083
|
1.4 GB | Download |
md5:20431d032594999e9d673c9e8f289d50
|
18.9 MB | Preview Download |
md5:8406f1d62d367988f9b5e739c46bdef7
|
37.6 kB | Preview Download |
md5:4205d4c44975ec60a1f75fde20c595ba
|
8.1 kB | Preview Download |
md5:25a099c0b3ada5cc7ea7c411cdc24ba4
|
579.6 MB | Preview Download |
md5:c0b143ba038a37489b26249409f49c14
|
237.0 MB | Preview Download |
md5:ab424e95da226bf18fcc3ab079fa9fbb
|
45.1 MB | Preview Download |
md5:43ddcfb43fcb60deebc28956b7739735
|
56.1 MB | Preview Download |
md5:0a66983612a3cac3068924bc53eb3782
|
11.1 MB | Preview Download |
md5:d96cc8ccfdca82d8c098d89657b16dd1
|
258.2 MB | Preview Download |
md5:80584662697fc7779b1884d36c63a9a0
|
12.4 kB | Download |
Additional details
References
- Gousios, Georgios. "The GHTorent dataset and tool suite." 2013 10th Working Conference on Mining Software Repositories (MSR). IEEE, 2013.
- Baltes, Sebastian, et al. "Sotorrent: Reconstructing and analyzing the evolution of stack overflow posts." Proceedings of the 15th international conference on mining software repositories. 2018.
- Vasilescu, Bogdan, Vladimir Filkov, and Alexander Serebrenik. "Stackoverflow and github: Associations between software development and crowdsourced knowledge." 2013 International Conference on Social Computing. IEEE, 2013.