Published March 15, 2015 | Version v1
Dataset Open

Interactive Tagging Networks (Following/Followers and Tags on 1 million Twitter Users)

  • 1. Toyohashi University of Technology
  • 2. University of Tsukuba

Description

Abstract (our paper)

How do users behave if they can tag each other in social networks? In this paper, we answer this question by studying the interactive tagging network constructed by Twitter lists. Twitter lists can be regarded as the tagging process; a user (i.e., tagger) creates a list with a name (i.e., tag) and adds other users (i.e., tagged users) into the list. This tagging network is by nature different from the resource tagging networks (e.g., Flickr and Delicious) because users on this network can tag each other. We address the following research questions: (RQ1) What is the common patterns and the difference between the interactive tagging network and the resource tagging networks? (RQ2) Do users tag each other on the interactive tagging network? And if so, to what extent? (RQ3) What is the difference between the two types of relationships on Twitter: who-tags-whom and who-follows-whom? By quantitatively studying million-scale networks, we found the pervasive patterns across the different tagging networks, and the interactive patterns within the interactive tagging network. This study sheds light on the underlying characteristics of the interactive tagging network, which is relevant to the social scientists and the system designers of the tagging systems.

Data

twitter.seed.users:
The first column is the user id, and the second column is the json of the user objects on Twitter. This is the set of 1 million seed users to collect the following data.

twitter.tagging.network:
The first column is the source user id (from user id), the second column is the destination user id (to user id), the third column is the tag (i.e. slug or list name), and the fourth column is the list id.

twitter.tagging-out-going-from-seed-users.network:
The first column is the source user id (from user id), the second column is the destination user id (to user id), the third column is the tag (i.e. slug or list name), and the fourth column is the list id. This is only the out-going edges from the seed users, i.e., this is a subset of twitter.tagging.network.

twitter.following.network:
The first column is the source user id (from user id), and the second column is the destination user id (to user id).

twitter.following-closed-seed-users.network:
The first column is the source user id (from user id), and the second column is the destination user id (to user id). This is not used in the following publication paper, but will be useful in other studies.

Publication

This data set was created for our study. If you make use of this data set, please cite:
Yuto Yamaguchi, Mitsuo Yoshida, Christos Faloutsos, Hiroyuki Kitagawa. Patterns in Interactive Tagging Networks. Proceedings of the Ninth International AAAI Conference on Web and Social Media (ICWSM-15). pp.513-522, 2015.
http://www.aaai.org/ocs/index.php/ICWSM/ICWSM15/paper/view/10556

Code

Our code outputting experiment results made available at:
https://github.com/yamaguchiyuto/icwsm15

Files

Files (2.2 GB)

Name Size Download all
md5:af6d56ae60cec854e37c50042e8a7474
191.2 MB Download
md5:de3a222d36606bf323923cdad00b7b8e
1.5 GB Download
md5:7faea7936246ae0d48bdb505967b7830
221.8 MB Download
md5:1126d83e1d469e335d79c57b06f5a605
75.1 MB Download
md5:85ca0d48011c40c02066faf75937eef8
196.3 MB Download

Additional details

References

  • Yuto Yamaguchi, Mitsuo Yoshida, Christos Faloutsos, Hiroyuki Kitagawa. Patterns in Interactive Tagging Networks. Proceedings of the Ninth International AAAI Conference on Web and Social Media (ICWSM-15). pp.513-522, 2015.