Dataset Open Access
Yoshida, Mitsuo; Yamaguchi, Yuto
Abstract (our paper)
How do users behave if they can tag each other in social networks? In this paper, we answer this question by studying the interactive tagging network constructed by Twitter lists. Twitter lists can be regarded as the tagging process; a user (i.e., tagger) creates a list with a name (i.e., tag) and adds other users (i.e., tagged users) into the list. This tagging network is by nature different from the resource tagging networks (e.g., Flickr and Delicious) because users on this network can tag each other. We address the following research questions: (RQ1) What is the common patterns and the difference between the interactive tagging network and the resource tagging networks? (RQ2) Do users tag each other on the interactive tagging network? And if so, to what extent? (RQ3) What is the difference between the two types of relationships on Twitter: who-tags-whom and who-follows-whom? By quantitatively studying million-scale networks, we found the pervasive patterns across the different tagging networks, and the interactive patterns within the interactive tagging network. This study sheds light on the underlying characteristics of the interactive tagging network, which is relevant to the social scientists and the system designers of the tagging systems.
The first column is the user id, and the second column is the json of the user objects on Twitter. This is the set of 1 million seed users to collect the following data.
The first column is the source user id (from user id), the second column is the destination user id (to user id), the third column is the tag (i.e. slug or list name), and the fourth column is the list id.
The first column is the source user id (from user id), the second column is the destination user id (to user id), the third column is the tag (i.e. slug or list name), and the fourth column is the list id. This is only the out-going edges from the seed users, i.e., this is a subset of twitter.tagging.network.
The first column is the source user id (from user id), and the second column is the destination user id (to user id).
The first column is the source user id (from user id), and the second column is the destination user id (to user id). This is not used in the following publication paper, but will be useful in other studies.
This data set was created for our study. If you make use of this data set, please cite:
Yuto Yamaguchi, Mitsuo Yoshida, Christos Faloutsos, Hiroyuki Kitagawa. Patterns in Interactive Tagging Networks. Proceedings of the Ninth International AAAI Conference on Web and Social Media (ICWSM-15). pp.513-522, 2015.
Our code outputting experiment results made available at:
|twitter.following-closed-seed-users.network.gz md5:af6d56ae60cec854e37c50042e8a7474||191.2 MB||Download|
|twitter.following.network.xz md5:de3a222d36606bf323923cdad00b7b8e||1.5 GB||Download|
|twitter.seed.users.gz md5:7faea7936246ae0d48bdb505967b7830||221.8 MB||Download|
|twitter.tagging-out-going-from-seed-users.network.gz md5:1126d83e1d469e335d79c57b06f5a605||75.1 MB||Download|
|twitter.tagging.network.gz md5:85ca0d48011c40c02066faf75937eef8||196.3 MB||Download|