Dataset Open Access

All Your Script Are Belong to Us: Collecting and Analyzing JavaScript Code from 10K Sites for 9 Months

Dimitris Mitropoulos; Panos Louridas; Vitalis Salis; Diomidis Spinellis


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Dimitris Mitropoulos</dc:creator>
  <dc:creator>Panos Louridas</dc:creator>
  <dc:creator>Vitalis Salis</dc:creator>
  <dc:creator>Diomidis Spinellis</dc:creator>
  <dc:date>2019-03-14</dc:date>
  <dc:description>We present a massive dataset (~2 TB) of client-side JavaScript code. Specifically, we have collected and stored on adaily basis JavaScript code from Alexa's Top 10000 web sites (~7.5 GB per day) for nine consecutive months. Our collection involved both inline scripts extracted from each web site's main page and external scripts linked from it. In order to aid researchers identify similar scripts and examine their popularity and evolution, we have produced hashes that represent the scripts' logical structure. Furthermore, we have analyzed the resulting dataset with well-established static analysis tools, generating additional metadata including reports with quality bugs and vulnerable libraries.</dc:description>
  <dc:identifier>https://zenodo.org/record/2593266</dc:identifier>
  <dc:identifier>10.5281/zenodo.2593266</dc:identifier>
  <dc:identifier>oai:zenodo.org:2593266</dc:identifier>
  <dc:relation>doi:10.5281/zenodo.2593265</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>https://creativecommons.org/licenses/by/4.0/legalcode</dc:rights>
  <dc:title>All Your Script Are Belong to Us: Collecting and Analyzing JavaScript Code from 10K Sites for 9 Months</dc:title>
  <dc:type>info:eu-repo/semantics/other</dc:type>
  <dc:type>dataset</dc:type>
</oai_dc:dc>
197
185
views
downloads
All versions This version
Views 197197
Downloads 185185
Data volume 5.9 TB5.9 TB
Unique views 174174
Unique downloads 8686

Share

Cite as