Published January 10, 2024 | Version v2
Dataset Open

The Koo Dataset: An Indian Microblogging Platform With Global Ambitions

Creators

Description

This is the dataset released with the paper titled "The Koo Dataset: An Indian Microblogging Platform With Global Ambitions". 

The dataset contains 43 JSON files containing the posts made on the platform, 34 JSON files for the comments, the shares and the likes. It also contains a JSON file for the user profiles. The metadata included in each data type is described in the paper.

If you use our dataset, please cite the arXiv version:

@misc{mekacher2024koo,
      title={The Koo Dataset: An Indian Microblogging Platform With Global Ambitions}, 
      author={Amin Mekacher and Max Falkenberg and Andrea Baronchelli},
      year={2024},
      eprint={2401.07599},
      archivePrefix={arXiv},
      primaryClass={cs.SI}
}

Files

koo_comments.zip

Files (23.2 GB)

Name Size Download all
md5:b05da107aca7bb178c6b2b004820077b
8.1 GB Preview Download
md5:2fd4dd4829c8e7ba46c3827f55a6ae35
5.7 GB Preview Download
md5:ef675075b18982b28d9d218b6af2bb03
7.4 GB Preview Download
md5:1a0bf33d431743a95e6bb0d64b8464fd
1.7 GB Preview Download
md5:e42146597aa3ac1eb6f0c808367052c8
318.4 MB Preview Download