Dataset Open Access
Benjamin S. Meyers;
Nuthan Munaiah;
Emily Prud'hommeaux;
Andrew Meneely;
Cecilia O. Alm;
Josephine Wolff;
Pradeep K. Murukannaiah
This dataset was released as part of the following publication.
Files:
chromium_conversations.csv
This is the full dataset containing over 1.5 million comments posted by developers reviewing proposed code changes. The dataset also includes the values we calculated for all nine linguistic features (described in Section 4 of the paper cited above).
chromium_conversations_annotations.csv
This dataset is a subset of the chromium_conversations.csv dataset. It contains the data used in the classification experiment outlined in Section 5 of the paper cited above (2,994 comments automatically identified as acted-upon and 800 comments manually identified as not (known-to-be) acted-upon).
CSV Fields:
Name | Size | |
---|---|---|
chromium_conversations.csv
md5:7cb1bcca65ca9609bb07580b0ff42cdd |
1.1 GB | Download |
chromium_conversations_annotations.csv
md5:bb416635063a2d569142cde1367d2ab7 |
1.5 MB | Download |
README.md
md5:efdb7cb2c70b51171af12773a89cc41e |
6.6 kB | Download |
All versions | This version | |
---|---|---|
Views | 454 | 454 |
Downloads | 443 | 444 |
Data volume | 186.2 GB | 187.3 GB |
Unique views | 414 | 414 |
Unique downloads | 333 | 334 |