Published December 17, 2021 | Version v1
Dataset Open

Co-occurrences of trending keywords in popular tech media (01.2016-04.2021)

  • 1. University of Warsaw

Description

Sources with weights

  • Euractiv 5%
  • The Conversation 5%
  • Politico Europe 5 %
  • IEEE Spectrum 5 %
  • Techforge 5%
  • Fastcompany 5%
  • The Guardian (Tech) 12%
  • Arstechnica 5%
  • Reuters 5%
  • Gizmodo 9%
  • ZDNet 9%
  • The Register 12%
  • The Verge 9%
  • TechCrunch 9%

Methodology

  • Exploring the relationship between topics
  • Pairs of terms which are mentioned together in media articles
  • Most trending social issues have been selected (e.g. 'metoo', 'gdpr')
  • The co-occurrence analysis is calculated for pairs consisting of emerging social issues and trending uni/bigrams
  • The number of times the terms appear in articles together with a social issue is divided by the number of times the social issue is mentioned across all articles
  • A single index is constructed for all word pairs by weighted average (taking into account the prevalence of the given source)

Files

cooccount_weighted_newall_1129.csv

Files (27.7 MB)

Name Size Download all
md5:5f63a061f8df499142ee4f24466d999b
19.7 MB Preview Download
md5:a0dbaa5cd32c0bb901ef452ea1613730
8.0 MB Preview Download

Additional details

Funding

European Commission
NGI FORWARD – NGI FORWARD 825652