Dataset Open Access

The impact of news exposure on collective attention in the United States during the 2016 Zika epidemic

Michele Tizzoni; André Panisson; Daniela Paolotti; Ciro Cattuto

This repository contains the data of the study "The impact of news exposure on collective attention in the United States during the 2016 Zika epidemic".

Epidemiological data

The folder zika_USA_weekly_cases_2016.zip contains weekly ZIKV incidence counts reported by the US Centers for Disease Control and Prevention in 2016, by state. Data were extracted from reports made publicly available by the CDC at:  https://zenodo.org/record/584136#.Xk07-RNKjOQ 

Web news data

The file news_GDELT_data.csv.gz contains all news items extracted from the GDELT platform (https://www.gdeltproject.org/) matching TAX_DISEASE_ZIKA as a Theme, and United_States as a Location in the GDELT platform. 

TV closed captions

The file zika_TV_mentions_dataframe.csv contains all the TV news items of 2016 matching the word ``Zika"  in the TV News Archive https://archive.org/details/tv

Wikipedia pageview counts

Dataset 1: wikipedia_dataset1_zika_daily_pageview_usa.csv

Content of each line of the dataset: day, pageview_count

The dataset contains the daily number of pageview counts of 128 different Wikipedia pages related to the Zika virus (aggregated and summed to total) originated in the United States, from January 1st to December 31st, 2016.

Dataset 2: wikipedia_dataset2_zika_daily_pageview_bystate.zip

Content of each line of the dataset: day, pageview_count, state

The dataset contains the daily number of pageview counts of 128 different Wikipedia pages related to the Zika virus (aggregated and summed to total) originated in the United States, disaggregated by state, from January 1st to December 31st, 2016.

Dataset 3: wikipedia_dataset3_zika_pagecount_by_city.csv

Content of each line of the dataset: US_city, pageview_count_Zika,pageview_count_total

The dataset contains the total number of pageview counts of 128 different Wikipedia pages related to the Zika virus (pageview_count_Zika) originated in 788 cities (US_city) of the United States with a population larger than 40,000 in 2016.The dataset also contains the total number of pageview counts to all Wikipedia pages (all Wikipedia projects, pageview_count_total) originated in 788 cities (US_city) of the United States with a population larger than 40,000 in 2016."

Files (571.1 MB)
Name Size
news_GDELT_data.csv.gz
md5:3cea5c38ceeca82f92e46cecec498b93
543.6 MB Download
wikipedia_dataset1_zika_daily_pageview_usa.csv
md5:9a001dbd261b3cddb2be00d3b1e2ea3f
6.0 kB Download
wikipedia_dataset2_zika_daily_pageview_bystate.zip
md5:7ab2e612f16d5bac8e744a1b0e55143f
78.2 kB Download
wikipedia_dataset3_zika_pagecount_by_city.csv
md5:29f25da4fa9e79bb0ceef15c3cc4d8ce
75.9 kB Download
zika_TV_mentions_dataframe.csv
md5:5175ce318eb7e8deaf3095ce5a851802
27.2 MB Download
zika_USA_weekly_cases_2016.zip
md5:664e2e4b71f0c7136cceb7a7be5ee96b
94.6 kB Download
  • Tizzoni M, Panisson A, Paolotti D, Cattuto C (2020) The impact of news exposure on collective attention in the United States during the 2016 Zika epidemic. PLoS Comput Biol 16(3): e1007633.

136
40
views
downloads
All versions This version
Views 136136
Downloads 4040
Data volume 2.7 GB2.7 GB
Unique views 123123
Unique downloads 3232

Share

Cite as