Published April 23, 2021 | Version v1
Dataset Open

Discovery and classification of Twitter bots

Description

Online Social Networks (OSN) are used by millions of users, daily. This user-base shares and discovers different opinions on popular topics. 
Social influence of large groups may be influenced by user believes or be attracted the interest in particular news or products. A large number of users, gathered in a single group or number of followers, increases the probability to influence OSN users. 
Botnets, collections of automated accounts controlled by a single agent, are a common mechanism for exerting
maximum influence. Botnets may be used to better infiltrate the social graph over time and create an illusion of community
behaviour, amplifying their message and increasing persuasion.

This paper investigates Twitter botnets, their behavior, their interaction with user communities and their evolution over time.
We analyze a dense crawl of a subset of Twitter traffic, amounting to nearly all interactions by Greek-speaking Twitter users for a period
of 36 months. 

The collected users are labeled as botnets, based on long term and frequent content similarity events. We detect over a million events, where seemingly unrelated accounts tweeted nearly identical content, at almost the same time.  We filter these concurrent content injection events and detect a set of 1,850 accounts that repeatedly exhibit this pattern of behavior, suggesting that they are fully or in part controlled and orchestrated by the same entity.  We find botnets that appear for brief intervals and disappear, as well as botnets that evolve and grow, spanning the duration of our dataset.  We analyze statistical differences between the bot accounts and human users, as well as the botnet interactions with the user communities and the  Twitter trending topics.

Files

bot_users.txt

Files (5.7 MB)

Name Size Download all
md5:16e07cc9cffd8575f648374951e34a99
22.7 kB Preview Download
md5:91d88a37ef721a9ae61a4e632c667120
2.1 MB Download
md5:9705bb7cc717d64d8649af19980c70df
2.1 MB Download
md5:23a263edb57324366662488572f19a9a
1.5 MB Download

Additional details

Related works

Is supplement to
Other: arXiv:2010.15393 (arXiv)

Funding

CONCORDIA – Cyber security cOmpeteNCe fOr Research anD InnovAtion 830927
European Commission