Dataset Open Access

Webis-WebSeg-20

Kiesel, Johannes; Kneist, Florian; Meyer, Lars; Komlossy, Kristof; Stein, Benno; Potthast, Martin

The Webis-WebSeg-20 dataset comprises 42,450 crowdsourced segmentations for 8,490 web pages from the Webis-Web-Archive-17. Segmentations were fused from the segmentations of five crowd workers each. If you use this dataset in your research, please cite it using this paper.

Files (13.5 GB)
Name Size
README.txt
md5:e1c15c08939635ef26bb9694f31d7d12
2.7 kB Download
webis-webseg-20-000000.zip
md5:16d1b7e858bddb9d6629f99745c9dd56
5.8 MB Download
webis-webseg-20-annotations.zip
md5:a06202b70dd114f0addd38a0485d6163
25.6 MB Download
webis-webseg-20-dom-and-nodes.zip
md5:123b15e7d1eb6f77ffa3893fe926d7ec
380.9 MB Download
webis-webseg-20-ground-truth.zip
md5:c2105bd4444502fced4b004d4d148673
7.8 MB Download
webis-webseg-20-screenshots-edges.zip
md5:119c60a663ee4384f5ca1c6bdafe4a1d
1.1 GB Download
webis-webseg-20-screenshots.zip
md5:868cda90bca002fad005b293b9939df9
12.0 GB Download
393
1,237
views
downloads
All versions This version
Views 393320
Downloads 1,2371,108
Data volume 1.7 TB1.3 TB
Unique views 314262
Unique downloads 246211

Share

Cite as