Dataset Open Access

Star Wars social network

Evelina Gabasova

Citation Style Language JSON Export

  "publisher": "Zenodo", 
  "DOI": "10.5281/zenodo.1411479", 
  "title": "Star Wars social network", 
  "issued": {
    "date-parts": [
  "abstract": "<p><strong>Star Wars social network</strong></p>\n\n<p>This dataset contains the social network of Star Wars characters extracted from movie scripts. In short, two characters are connected if they speak together within the same scene. The data contain characters and links from episodes I to VII.</p>\n\n<p>How the data were created is described in my blog posts:</p>\n\n<ul>\n\t<li><a href=\"\">The Star Wars social network</a></li>\n\t<li><a href=\"\">Star Wars social network: Force Awakens</a></li>\n</ul>\n\n<p>The associated code is available in the main Github repository <a href=\"\">evelinag/StarWars-social-network</a>.</p>\n\n<p>Contents of the files are the following:</p>\n\n<ul>\n\t<li>\n\t<p><code>starwars-episode-N-interactions.json</code> contains the social network extracted from Episode N, where the links between characters are defined by the times the characters speak within the same scene.</p>\n\t</li>\n\t<li>\n\t<p><code>starwars-episode-N-mentions.json</code> contains the social network extracted from Episode N, where the links between characters are defined by the times the characters are mentioned within the same scene.</p>\n\t</li>\n\t<li>\n\t<p><code>starwars-episode-N-interactions-allCharacters.json</code> is the <code>interactions</code> network with R2-D2 and Chewbacca added in using data from <code>mentions</code> network.</p>\n\t</li>\n\t<li>\n\t<p><code>starwars-full-...</code> contain the corresponding social networks for the whole set of 6 episodes.</p>\n\t</li>\n</ul>\n\n<p><strong>Description of networks</strong></p>\n\n<p>The json files representing the networks contain the following information:</p>\n\n<p><strong>Nodes</strong></p>\n\n<p>The nodes contain the following fields:</p>\n\n<ul>\n\t<li>name: Name of the character</li>\n\t<li>value: Number of scenes the character appeared in</li>\n\t<li>colour: Colour in the visualization</li>\n</ul>\n\n<p><strong>Links</strong></p>\n\n<p>Links represent connections between characters. The link information corresponds to:</p>\n\n<ul>\n\t<li>source: zero-based index of the character that is one end of the link, the order of nodes is the order in which they are listed in the &ldquo;nodes&rdquo; element</li>\n\t<li>target: zero-based index of the character that is the the other end of the link.</li>\n\t<li>value: Number of scenes where the &ldquo;source character&rdquo; and &ldquo;target character&rdquo; of the link appeared together. Please not that the network is <em>undirected</em>. Which character represents the source and the target is arbitrary, they correspond only to two ends of the link.</li>\n</ul>", 
  "author": [
      "family": "Evelina Gabasova"
  "version": "1.0.1", 
  "type": "dataset", 
  "id": "1411479"
All versions This version
Views 4,0824,082
Downloads 485485
Data volume 20.2 MB20.2 MB
Unique views 3,6893,689
Unique downloads 455455


Cite as