Published July 15, 2017 | Version v1
Dataset Open

Retrieval and summarization of microblogs posted after a disaster event: SMERP 2017 dataset

  • 1. IIT Kharagpur, India
  • 2. IIT Kanpur, India
  • 3. IBM, Ireland
  • 4. IIIT Delhi, India
  • 5. Dublin City University, Ireland
  • 6. KU Leuven, Belgium

Description

This is the dataset used in the Data Challenge track of the ECIR 2017 Workshop on Exploitation of Social Media for Emergency Relief and Preparedness (SMERP 2017).

The Data Challenge track was about extracting and summarizing information relevant to a set of practical information needs (topics) that are critical for post-disaster relief operations, such as need and availability of resources, infrastructure damage and restoration, etc. The track used a dataset of tweets / microblogs posted during the August 2016 earthquake in central Italy. Specifically, the data challenge consisted of two tasks:
(1) Retrieve the microblogs that are relevant to the given set of topics, and
(2) Summarizing the microblogs that are relevant to the given set of topics.  


This dataset can be used to develop algorithms for retrieval and summarization of microblogs that are useful for post-disaster relief operations, in the aftermath of a disaster.

For more details, refer to the workshop report.

Notes

Please cite the following report on the SMERP 2017 workshop if you use this dataset: Saptarshi Ghosh. Kripabandhu Ghosh, Debasis Ganguly, Tanmoy Chakraborty, Gareth J. F. Jones, Marie-Francine Moens. ECIR 2017 Workshop on Exploitation of Social Media for Emergency Relief and Preparedness (SMERP 2017). ACM SIGIR Forum Newsletter, Volume 51, Issue 1, pp. 36-41, June 2017.

Files

SMERP2017-dataset.zip

Files (567.4 kB)

Name Size Download all
md5:fabe8c4ecbaa24535da12e4cb5cc3092
567.4 kB Preview Download