Feedback
6 results found
  1. Webis-Web-Archive-17

    • webis.de
    • temir.org
    • +1more
    Published 2017
  2. Webis-Web-Archive-17 Content Error Annotations

    • zenodo.org
    Published Apr 15, 2019
  3. Webis-Web-Archive-17 Content Error Annotations

    • zenodo.org
    Published Mar 22, 2019
  4. Webis-Web-Archive-17 Content Error Annotations

    • zenodo.org
    Published Jan 25, 2019
  5. Webis-Clickbait-17

    • webis.de
    • temir.org
    Published 2017
  6. Webis Clickbait Corpus 2017 (Webis-Clickbait-17)

    • zenodo.org
    Published Jun 11, 2018
  7. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
Facebook
Twitter
Email
Click to copy link
Link copied

Webis-Web-Archive-17

  • Dataset published 2017
Dataset provided by
Bauhaus University, Weimarhttp://www.uni-weimar.de/
The Web Technology & Information Systems Network
Authors
Stein, Benno; Hagen, Matthias; Kiesel, Johannes; Potthast, Martin; Kneist, Florian
Description

The Webis-Web-Archive-17 comprises a total of 10,000 web page archives from mid-2017. The original Webis-Web-Archive-17 dataset contains the web archive files, HTML DOM, and screenshots of each web page, as well as annotations per web page on how well the web page can be reduced from the archive. Later on, the dataset was extended with annotations of content errors.

Search
Clear search
Close search
Google apps
Main menu