Dataset Open Access

Co-occurrences of trending keywords in popular tech media (01.2016-02.2020)

Kristóf Gyódi; Łukasz Nawaro; Michał Paliński

Sources with weights

        Arstechnica: 1/8,
        Euractiv: 1/8,
        Fastcompany: 1/8,
        The Register: 1/8,
        Techcrunch: 1/8,
        The Guardian: 1/8,
        Venturebeat: 1/8,
        The Verge: 1/8


  • Exploring the relationship between topics
  • Pairs of terms which are mentioned together in media articles
  • Most trending social issues and technologies have been selected (e.g. 'gdpr', '5G')
  • The co-occurrence analysis is calculated for pairs consisting of emerging social issues and trending uni/bigrams
  • The number of times the terms appear in articles together with a social issue is divided by the number of times the social issue is mentioned across all articles
  • A single index is constructed for all word pairs by weighted average (taking into account the prevalence of the given source)

Files (35.9 MB)
Name Size
35.9 MB Download
All versions This version
Views 106106
Downloads 5454
Data volume 1.9 GB1.9 GB
Unique views 9090
Unique downloads 4444


Cite as