Dataset Open Access
Star Wars social network
This dataset contains the social network of Star Wars characters extracted from movie scripts. In short, two characters are connected if they speak together within the same scene. The data contain characters and links from episodes I to VII.
How the data were created is described in my blog posts:
The associated code is available in the main Github repository evelinag/StarWars-social-network.
Contents of the files are the following:
starwars-episode-N-interactions.json contains the social network extracted from Episode N, where the links between characters are defined by the times the characters speak within the same scene.
starwars-episode-N-mentions.json contains the social network extracted from Episode N, where the links between characters are defined by the times the characters are mentioned within the same scene.
starwars-episode-N-interactions-allCharacters.json is the
interactions network with R2-D2 and Chewbacca added in using data from
starwars-full-... contain the corresponding social networks for the whole set of 6 episodes.
Description of networks
The json files representing the networks contain the following information:
The nodes contain the following fields:
Links represent connections between characters. The link information corresponds to:
|All versions||This version|
|Data volume||19.1 MB||19.1 MB|