Published January 17, 2020
| Version 1.0.0
Dataset
Open
Undirected Node Attributed Social Network Graph of Twitter Users interested in plastic pollution - created in the framework of the PlasticTwist project
Description
This dataset has been created in the framework of the Plastic Twist project (Ptwist) and more specifically using the Ptwist crowdsourcing application (crowdsourcing.plastictwist.com/). We are sharing the edge list and specific node attributes (hashtags) of Twitter users posting about plastic pollution. The dataset can be used for community detection,clustering, node importance, influence maximization tasks, etc. Each user is represented by a unique integer which has nothing to do with the official Twitter user ID. The dataset contains three (3) files:
- ptwist.edgelist: A list containing all the 1,362,863 edges between the users. When loaded they create an undirected graph of 800K+ users.
- node_attributes.txt: This file contains information about the hashtags used by each user. (e.g. "652003": ["SingleUsePlastic"] -> user 6529003 has used the hashtag SingleUsePlastic)
- annotated_graph: A pickle file which, when loaded, returns a NetworkX node attributed undirected graph.
Files
Ptwist_Dataset.zip
Files
(41.2 MB)
Name | Size | Download all |
---|---|---|
md5:28d3eb0498ae7178dc9be525d6bc98f1
|
41.2 MB | Preview Download |