Published April 15, 2019
| Version 2019-01-25
Dataset
Open
Webis-Web-Errors-19
- 1. Bauhaus-Universität Weimar
- 2. Leipzig University
Description
The Webis-Web-Errors-19 comprises various annotations for the 10,000 web page archives of the Webis-Web-Archive-17. The annotations are whether the page is (1) mostly advertisement, (2) cut off, (3) still loading, (4) pornographic; and whether it shows (not/a bit/ very) (5) pop-ups, (6) CAPTCHAs, or (7) error messages. If you use this dataset in your research, please cite it using this paper.
Files
annotation-interface.png
Additional details
Related works
- Is documented by
- Conference paper: https://webis.de/publications.html#filter:A+Dataset+for+Content+Error+Detection+in+Web+Archives (URL)
- Is supplement to
- Dataset: 10.5281/zenodo.1002203 (DOI)