Dataset Open Access
Yoshida, Mitsuo; Yamaguchi, Yuto
Abstract (our paper)
Why does Smith follow Johnson on Twitter? In most cases, the reason why users follow other users is unavailable. In this work, we answer this question by proposing TagF, which analyzes the who-follows-whom network (matrix) and the who-tags-whom network (tensor) simultaneously. Concretely, our method decomposes a coupled tensor constructed from these matrix and tensor. The experimental results on million-scale Twitter networks show that TagF uncovers different, but explainable reasons why users follow other users.
The first column is the source user id (from user id), the second column is the destination user id (to user id), and the third column is the tag id.
The first column is the user id for coupled_tensor, and the second column is the user id on Twitter.
The first column is the tag id for coupled_tensor, and the second column is the tag (i.e. slug or list name) on Twitter. On the tags, ###follow### and ###friend### are special tags expressing follower and following.
This dataset was created for our study. If you make use of this dataset, please cite:
Yuto Yamaguchi, Mitsuo Yoshida, Christos Faloutsos, Hiroyuki Kitagawa. Why Do You Follow Him? Multilinear Analysis on Twitter. The 24th International World Wide Web Conference (WWW2015). Florence, Italy, May 18-22, pp.137-138, 2015 (poster).
Our code outputting experiment results made available at:
If you would like to use larger dataset, the dataset on 1 million seed users made available at:
(The dataset on 0.1 million seed users is not subset of the dataset on 1 million seed users.)