Published August 14, 2020 | Version v3
Dataset Open

A dataset of media releases (Twitter, News and Comments, Youtube, Facebook) form Poland related to COVID-19 for open research

  • 1. Interdisciplinary Research Institute in Wrocław

Contributors

Researcher:

Work package leader:

  • 1. Institute of Political Studies, Polish Academy of Sciences
  • 2. Institute of Sociology, University of Wrocław

Description

Social behavior has a fundamental impact on the dynamics of infectious diseases (such as COVID-19), challenging public health mitigation strategies and possibly the political consensus. The widespread use of the traditional and social media on the Internet provides us with an invaluable source of information on societal dynamics during pandemics. With this dataset, we aim to understand mechanisms of COVID-19 epidemic-related social behavior in Poland deploying methods of computational social science and digital epidemiology. We have collected and analyzed COVID-19 perception on the Polish language Internet during 15.01-31.07(06.08) and labeled data quantitatively (Twitter, Youtube, Articles) and qualitatively (Facebook, Articles and Comments of Article) in the Internet by infomediological approach.

- manually labelled1,449 articles / Facebook posts from Lower Silesia (facebook_articles_lower_silesia.zip) and 111 texts from outside this region;

-manually labelled 1000 most popular tweets (twits_annotated.xlsx) with cathegories is_fake (categorical and numeric) topic and sentiment; 

-extracted 57,306 representative articles (articles_till_06_08.zip) in Polish using Eventregitry.org tool in language Polish and topic "Coronavirus" in article body;

- extracted 1,015,199 (tweets_till_31_07_users.zip and tweets_till_31_07_text.zip) and Tweets from #Koronawirus in language Polish using Twitter API.

- collected 1,574 videos (youtube_comments_till_31_07.zip and youtube_movie.csv) with keyword: Koronawirus on YouTube and 247,575 comments on them using Google API;

- We supplemented the media observations with an analysis of 244 social empirical studies till 25.05 on COVID-19 in Poland (empirical_social_studies.csv).

Reports and analyzes and coding books can be found in Polish at: http://www.infodemia-koronawirusa.pl

Main report (in Polish) https://depot.ceon.pl/handle/123456789/19215  

Files

articles_till_06_08.zip

Files (147.7 MB)

Name Size Download all
md5:3838a903a340fa7223e2892a833a2484
11.1 MB Preview Download
md5:5a7f7909df84ec51247b9bd53346888b
94.3 kB Preview Download
md5:9f7be5eedbbb3f4b01a859d1b6fd719c
1.2 MB Preview Download
md5:ff73803828c782e4691eda40f4152106
69.8 MB Preview Download
md5:0ddfb4b0b8c0394b650f00f0e790bb5d
25.9 MB Preview Download
md5:7712e830524eff7339007001d8976609
155.4 kB Download
md5:a6b8f92ac7203a02329dd2f5c78c950d
38.6 MB Preview Download
md5:bc4fedcad2ed636116b2f6313a73fcbb
923.8 kB Preview Download

Additional details

Funding

EOSCsecretariat.eu – EOSCsecretariat.eu 831644
European Commission

References

  • Jarynowski A, Wójta-Kempa M, Belik V. TRENDS IN PERCEPTION OF COVID-19 IN POLISH INTERNET, Polish Epidemiological Review
  • Jarynowski A, Wójta-Kempa M, Płatek D, Krzowski Ł, Belik V. Spatial Diversity of COVID-19 Cases in Poland Explained by Mobility Patterns - Preliminary Results 2020; http://dx.doi.org/10.2139/ssrn.3621152