There is a newer version of this record available.

Dataset Open Access

A dataset of media releases (Twitter, News and Comments, Youtube, Facebook) from Poland related to COVID-19 for open research

Andrzej Jarynowski

Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="" xmlns:oai_dc="" xmlns:xsi="" xsi:schemaLocation="">
  <dc:contributor>Daniel Płatek</dc:contributor>
  <dc:contributor>Małgorzata Stochmal</dc:contributor>
  <dc:creator>Andrzej Jarynowski</dc:creator>
  <dc:description>Social behavior has a fundamental impact on the dynamics of infectious diseases (such as COVID-19), challenging public health mitigation strategies and possibly the political consensus. The widespread use of the traditional and social media on the Internet provides us with an invaluable source of information on societal dynamics during pandemics. With this dataset, we aim to understand mechanisms of COVID-19 epidemic-related social behavior in Poland deploying methods of computational social science and digital epidemiology. We have collected and analyzed COVID-19 perception on the Polish language Internet during 15.01-31.07(06.08) and labeled data quantitatively (Twitter, Youtube, Articles) and qualitatively (Facebook, Articles and Comments of Article) in the Internet by infomediological approach.

- manually labeled1,449 articles / Facebook posts from Lower Silesia ( and 111 texts from outside this region;

-extracted 57,306 representative articles ( in Polish using tool in language Polish and topic "Coronavirus" in article body;

- extracted 1,015,199 ( and and Tweets from #Koronawirus in language Polish usiing Twitter API.

- collected 1,574 videos ( and youtube_movie.csv) with keyword: Koronawirus on YouTube and 247,575 comments on them using Google API;

- We supplemented the media observations with an analysis of 244 social empirical studies till 25.05 on COVID-19 in Poland (empirical_social_studies.csv).

Reports and analyzes and coding books can be found in Polish at:</dc:description>
  <dc:subject>media analysis</dc:subject>
  <dc:subject>risk perception</dc:subject>
  <dc:title>A dataset of media releases (Twitter, News and Comments, Youtube, Facebook) from Poland related to COVID-19 for open research</dc:title>
All versions This version
Views 2,385417
Downloads 829259
Data volume 11.2 GB2.6 GB
Unique views 2,005364
Unique downloads 468161


Cite as