Dataset Open Access
Star Wars social network
This dataset contains the social network of Star Wars characters extracted from movie scripts. In short, two characters are connected if they speak together within the same scene. The data contain characters and links from episodes I to VII.
How the data were created is described in my blog posts:
The associated code is available in the main Github repository evelinag/StarWars-social-network.
Contents of the files are the following:
starwars-episode-N-interactions.json
contains the social network extracted from Episode N, where the links between characters are defined by the times the characters speak within the same scene.
starwars-episode-N-mentions.json
contains the social network extracted from Episode N, where the links between characters are defined by the times the characters are mentioned within the same scene.
starwars-episode-N-interactions-allCharacters.json
is the interactions
network with R2-D2 and Chewbacca added in using data from mentions
network.
starwars-full-...
contain the corresponding social networks for the whole set of 6 episodes.
Description of networks
The json files representing the networks contain the following information:
Nodes
The nodes contain the following fields:
Links
Links represent connections between characters. The link information corresponds to:
Name | Size | |
---|---|---|
evelinag/star-wars-network-data-1.0.1.zip
md5:1af1ddbb75c4bca337e4ef0eabd51118 |
41.6 kB | Download |
All versions | This version | |
---|---|---|
Views | 3,889 | 3,889 |
Downloads | 459 | 459 |
Data volume | 19.1 MB | 19.1 MB |
Unique views | 3,512 | 3,512 |
Unique downloads | 431 | 431 |