Feedback
2 results found
  1. Webis-Web-Archive-17

    • zenodo.org
    • webis.de
    Published Oct 4, 2017
  2. Webis-Clickbait-17

    • webis.de
    Published 2017
  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
Facebook
Twitter
Google+
Email
Click to copy link
Link copied
  • Dataset published   Oct 4, 2017
Dataset provided by
Martin Luther University of Halle-Wittenberghttp://www.uni-halle.de/
Leipzig Universityhttp://www.uni-leipzig.de/
Bauhaus University, Weimarhttp://www.uni-weimar.de/
Ulm University
Authors
Kiesel, Johannes; Potthast, Martin; Hagen, Matthias; Kneist, Florian; Stein, Benno
Available download formats from providers
zip
,
png
,
txt
Description

This dataset was created mid-2017 from 10,000 web pages that were carefully sampled from the Common Crawl to involve a mixture of high-ranking and low-ranking web pages. The process is described in detail in an upcoming publication.

Search
Clear search
Close search
Google apps
Main menu