Feedback
1 result found
  1. Wikipedia Text Reuse Corpus

    • webis.de
    • temir.org
    Published 2018
  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
Facebook
Twitter
Email
Click to copy link
Link copied

Wikipedia Text Reuse Corpus

  • Dataset published 2018
Dataset provided by
Martin-Luther-University Halle-Wittenberghttp://www.uni-halle.de/
Leipzig Universityhttp://www.uni-leipzig.de/
Bauhaus University, Weimarhttp://www.uni-weimar.de/
University of Paderbornhttp://www.uni-paderborn.de/
The Web Technology & Information Systems Network
Authors
Hagen, Matthias; Alshomary, Milad; Wachsmuth, Henning; Potthast, Martin; Stein, Benno; Völske, Michael
Description

A cropus of text reuse cases extracted from within Wikipedia and in between Wikipedia and a sample of Common Crawl

Search
Clear search
Close search
Google apps
Main menu