Published March 18, 2020 | Version 1.0
Dataset Open

Co-occurrences of trending keywords in popular tech media (01.2016-02.2020)

  • 1. University of Warsaw

Description

Sources with weights

        Arstechnica: 1/8,
        Euractiv: 1/8,
        Fastcompany: 1/8,
        The Register: 1/8,
        Techcrunch: 1/8,
        The Guardian: 1/8,
        Venturebeat: 1/8,
        The Verge: 1/8

Methodology

  • Exploring the relationship between topics
  • Pairs of terms which are mentioned together in media articles
  • Most trending social issues and technologies have been selected (e.g. 'gdpr', '5G')
  • The co-occurrence analysis is calculated for pairs consisting of emerging social issues and trending uni/bigrams
  • The number of times the terms appear in articles together with a social issue is divided by the number of times the social issue is mentioned across all articles
  • A single index is constructed for all word pairs by weighted average (taking into account the prevalence of the given source)

Files

cooc_weighted.csv

Files (35.9 MB)

Name Size Download all
md5:829b65470750054a713f9b62a67988dd
35.9 MB Preview Download

Additional details

Funding

NGI FORWARD – NGI FORWARD 825652
European Commission