Published January 1, 2019 | Version v1
Dataset Open

FIRE 2018 IRMiDis track dataset: Fact-checkable tweets posted during disasters

  • 1. UEM Kolkata, India
  • 2. IIT Kharagpur, India
  • 3. IIT Kanpur, India

Description

This is the dataset used for the FIRE 2018 track on Information Retrieval from Microblogs during Disasters (IRMiDis). 

The dataset contains ~50K tweets (microblogs) and ~6.8K news articles posted after the 2015 Nepal earthquake. The dataset can be used for tasks such as:  (i) Identifying fact-checkable tweets from among tweets posted during a disaster -- the dataset includes a gold standard set of fact-checkable tweets, (2) Identifying news articles supporting / opposing a fact-checkable tweet, etc. 

Notes

If you use this dataset, please cite the following paper: Moumita Basu, Saptarshi Ghosh, Kripabandhu Ghosh. Overview of the FIRE 2018 track: Information Retrieval from Microblogs during Disasters (IRMiDis). Proceedings of the Annual Meeting of the Forum for Information Retrieval Evaluation (FIRE), Gandhinagar, India, pp. 1-5, December 2018.

Files

IRMiDis-FIRE-2018-Dataset.zip

Files (41.0 MB)

Name Size Download all
md5:f97ec0fa526745c1dfb117f6c9629ac6
41.0 MB Preview Download