Dataset Collected by JSObserver
Description
This is a sampled dataset collected by JSObserver on Alexa top 100K websites. We analyze the log files to identify JavaScript global identifier conflicts, i.e., variable value conflicts, variable type conflicts and function definition conflicts.
We release the log files on websites where we detect the above conflicts, and split the whole dataset into 10 subsets, i.e., 1-50K-0.zip ~ 50K-100K-4.zip.
The writes to a memory location in JavaScript are saved in [rank].[main/sub].[frame_cnt].asg (e.g., 1.main.0.asg) files.
JavaScript global function definitions are saved in [rank].[main/sub].[frame_cnt].func (e.g., 1.main.0.func) files.
The maps from script IDs to script URLs are saved in [rank].[main/sub].[frame_cnt].id2url (e.g., 1.main.0.id2url) files.
The source code of scripts are saved in [rank].[main/sub].[frame_cnt].[script_ID].script (e.g., 1.main.0.17.script) files.
We also sample 100 websites on which we did not detect any conflicts. The log files collected on those websites are available in sampled_no_conflict.zip
Files
1-50K-0.zip
Files
(37.5 GB)
Name | Size | Download all |
---|---|---|
md5:99e7faf8ec80d127525992c6ac41c9cf
|
4.3 GB | Preview Download |
md5:6a2b33acbf0ed707a0cc0d64730840f6
|
4.2 GB | Preview Download |
md5:ee16a98d3daf79ba6d92054d3c8d8c88
|
4.1 GB | Preview Download |
md5:372dda67b46dffb7e62d8b23615549f6
|
3.9 GB | Preview Download |
md5:0fbe684660c1716a82d03268673590dd
|
3.9 GB | Preview Download |
md5:ecbe23da65934db0e3782a1d2fd671ab
|
3.4 GB | Preview Download |
md5:eb692bbe514923c486f6fdfc8d7ecee4
|
3.2 GB | Preview Download |
md5:24f69546b7f2d24325308990d0fdf1ae
|
3.4 GB | Preview Download |
md5:515f8e35fe4618baff3bc21257d3bc8c
|
3.6 GB | Preview Download |
md5:77850fb910c45f6f4413d93cb618e1b5
|
3.3 GB | Preview Download |
md5:4271c131737cd193ec1c3ad8b8158ab5
|
92.0 MB | Preview Download |