Dataset Open Access

Webis-WebSeg-20

Kiesel, Johannes; Kneist, Florian; Meyer, Lars; Komlossy, Kristof; Stein, Benno; Potthast, Martin

The Webis-WebSeg-20 dataset comprises 42,450 crowdsourced segmentations for 8,490 web pages from the Webis-Web-Archive-17. Segmentations were fused from the segmentations of five crowd workers each. If you use this dataset in your research, please cite it using this paper.

Files (13.5 GB)
Name Size
README.txt
md5:e1c15c08939635ef26bb9694f31d7d12
2.7 kB Download
webis-webseg-20-000000.zip
md5:16d1b7e858bddb9d6629f99745c9dd56
5.8 MB Download
webis-webseg-20-annotations.zip
md5:a06202b70dd114f0addd38a0485d6163
25.6 MB Download
webis-webseg-20-dom-and-nodes.zip
md5:123b15e7d1eb6f77ffa3893fe926d7ec
380.9 MB Download
webis-webseg-20-ground-truth.zip
md5:c2105bd4444502fced4b004d4d148673
7.8 MB Download
webis-webseg-20-screenshots-edges.zip
md5:119c60a663ee4384f5ca1c6bdafe4a1d
1.1 GB Download
webis-webseg-20-screenshots.zip
md5:868cda90bca002fad005b293b9939df9
12.0 GB Download
760
1,897
views
downloads
All versions This version
Views 760664
Downloads 1,8971,759
Data volume 4.0 TB3.6 TB
Unique views 612546
Unique downloads 565523

Share

Cite as