Published October 20, 2020 | Version v1
Preprint Open

PoliWAM: An Exploration of a Large Scale Corpus of Political Discussions on WhatsApp Messenger

  • 1. TCS Research and Innovation
  • 2. IIT Gandhinagar

Description

WhatsApp Messenger is one of the most popular channels for spreading information with a current reach of more than 180 countries and 2 billion people. Its widespread usage has made it one of the most popular media for information propagation among masses during any socially engaging event. In the recent past, several countries have witnessed its effectiveness and influence in political and social campaigns. We observe a high surge in information and propaganda flow during elections. To explore such activities, in this paper, we discuss challenges, methodology, and opportunities in data curation from WhatsApp for politics-based exploratory studies. As a use case, we study the period before, during, and after the Indian General Elections 2019, encompassing all major Indian political parties. We present several complementing insights into the investigative and sensational news stories from the same period. Exploratory data analysis and experiments showcase several exciting results and future research opportunities. To facilitate reproducible research, we make the anonymized datasets available in the public domain. 

If you are using this dataset as part of your research, please cite the following paper

@article{srivastava2020poliwam,
  title={PoliWAM: an exploration of a large scale corpus of political discussions on WhatsApp messenger},
  author={Srivastava, Vivek and Singh, Mayank},
  journal={arXiv preprint arXiv:2010.13263},
  year={2020}
}

Files

WAM_dataset.zip

Files (22.2 MB)

Name Size Download all
md5:b0f96f9933db9280ec1d1a4b73b9b48d
22.2 MB Preview Download