Webis-Web-Errors-19
Synopsis
The Webis-Web-Errors-19 comprises various annotations for the 10,000 web page archives of the Webis-Web-Archive-17. The annotations are whether the page is (1) mostly advertisement, (2) cut off, (3) still loading, (4) pornographic; and whether it shows (not/a bit/ very) (5) pop-ups, (6) CAPTCHAs, or (7) error messages.
Access
Please refer to this publication for citing the dataset. If you want to link the dataset, please use the dataset permalink [doi].
People
- Johannes Kiesel
- Martin Potthast
- Matthias Hagen
- Benno Stein
- Florian Kneist
Publications