Published July 16, 2022 | Version 1.1
Dataset Open

Data and model for detecting spam activity on academic articles

  • 1. Northern Illinois University
  • 2. ROR icon University of South Carolina

Description

With the remarkable capability to reach the public instantly, social media has become integral in sharing scholarly articles to measure public response. This paper analyzes how Twitter bots interact with scholarly articles on the platform. Spamming by bots on social media can steer the conversation and present a false public interest in given research, affecting policies impacting the public's lives in the real world. In this paper, we determined whether bots are disseminating a given scholarly article based on analyzing the relationship between Twitter bots and several research factors. We developed and tested several supervised machine-learning classification models to tackle this problem. Through our analysis, we also identified that scholarly articles in health and human science are more prone to bot activity than other research areas.

Files

anonymized_altmetric_bot_data.csv

Files (211.3 MB)

Name Size Download all
md5:e9d644e3a43301aced7178d74c008060
210.9 MB Preview Download
md5:23e06c349c45003cb4e846492eb312b5
425.9 kB Download

Additional details

Related works

Is supplement to
Journal article: 10.1108/AJIM-01-2024-0050 (DOI)