Published June 3, 2020 | Version 1.0
Dataset Open

Dataset Collected by JSObserver

  • 1. Chinese University of Hong Kong

Description

This is a sampled dataset collected by JSObserver on Alexa top 100K websites. We analyze the log files to identify JavaScript global identifier conflicts, i.e., variable value conflicts, variable type conflicts and function definition conflicts.

We release the log files on websites where we detect the above conflicts, and split the whole dataset into 10 subsets, i.e., 1-50K-0.zip ~ 50K-100K-4.zip.

 

The writes to a memory location in JavaScript are saved in [rank].[main/sub].[frame_cnt].asg (e.g., 1.main.0.asg) files. 

JavaScript global function definitions are saved in [rank].[main/sub].[frame_cnt].func (e.g., 1.main.0.func) files.

The maps from script IDs to script URLs are saved in [rank].[main/sub].[frame_cnt].id2url (e.g., 1.main.0.id2url) files.

The source code of scripts are saved in [rank].[main/sub].[frame_cnt].[script_ID].script (e.g., 1.main.0.17.script) files.

 

We also sample 100 websites on which we did not detect any conflicts. The log files collected on those websites are available in sampled_no_conflict.zip

Files

1-50K-0.zip

Files (37.5 GB)

Name Size Download all
md5:99e7faf8ec80d127525992c6ac41c9cf
4.3 GB Preview Download
md5:6a2b33acbf0ed707a0cc0d64730840f6
4.2 GB Preview Download
md5:ee16a98d3daf79ba6d92054d3c8d8c88
4.1 GB Preview Download
md5:372dda67b46dffb7e62d8b23615549f6
3.9 GB Preview Download
md5:0fbe684660c1716a82d03268673590dd
3.9 GB Preview Download
md5:ecbe23da65934db0e3782a1d2fd671ab
3.4 GB Preview Download
md5:eb692bbe514923c486f6fdfc8d7ecee4
3.2 GB Preview Download
md5:24f69546b7f2d24325308990d0fdf1ae
3.4 GB Preview Download
md5:515f8e35fe4618baff3bc21257d3bc8c
3.6 GB Preview Download
md5:77850fb910c45f6f4413d93cb618e1b5
3.3 GB Preview Download
md5:4271c131737cd193ec1c3ad8b8158ab5
92.0 MB Preview Download