Dataset Open Access

Dataset: "I Can't Keep It Up." A Dataset from the Defunct Voat.co News Aggregator

Mekacher, Amin; Papasavva, Antonis

This is the dataset released with the paper titled: "I Can’t Keep It Up." A Dataset from the Defunct Voat.co News Aggregator. 
The dataset consists of 15,133 Newline delimited JSON files (ndjson). More specifically, 7,616 files for submission data, 7,515 for comment data, 1 for user data, and 1 for subverse data. Each line in the ndjson files consists of a JSON object. The JSON objects contain all the key/values we collect through the Voat API and the custom parser of the Internet Archive Wayback Machine Voat snapshot release.
For the detailed description of every key in the JSON structure, along with the type of the value, please read the readme.pdf file provided with this dataset.

 

If you find our dataset useful, please cite our paper:

@inproceedings{mekacher2022can,
  title={"I Can't Keep It Up." A Dataset from the Defunct Voat.co News Aggregator},
  author={Mekacher, Amin and Papasavva, Antonis},
  booktitle={16th International Conference on Web and Social Media},
  year={2022}
}
Files (2.2 GB)
Name Size
Readme.pdf
md5:44476ff47e8f99b8306cc0ca6cdf2be9
214.8 kB Download
voat_dataset.zip
md5:6437dcccc121766858bf018e6c903114
2.2 GB Download
1,117
1,030
views
downloads
All versions This version
Views 1,1171,117
Downloads 1,0301,030
Data volume 386.6 GB386.6 GB
Unique views 1,0131,013
Unique downloads 799799

Share

Cite as