Feedback
12 results found
  1. Webis-CPC-11

    • webis.de
    • temir.org
    Published 2011
  2. Webis Crowd Paraphrase Corpus 2011 (Webis-CPC-11)

    • zenodo.org
    Published Jun 1, 2013
  3. Kenya CT-OVC Program Evaluation

    • dataverse.unc.edu
    • datasearch.gesis.org
    Updated Oct 13, 2017
  4. U

    Lesotho Child Grant Programme (CGP) Evaluation

    • dataverse.unc.edu
    Updated Jun 21, 2018
  5. National Longitudinal Study of Adolescent Health (Add Health), 1994-2008:...

    • www.icpsr.umich.edu
  6. Contract History

    • open.canada.ca
    Updated Jun 18, 2019
  7. Data from: XtalOpt version r11: An open–source evolutionary algorithm for...

    • www.narcis.nl
    Published Nov 6, 2017
  8. 2017 Conservative Party of Canada Leadership

    • www.kaggle.com
    Updated May 28, 2017
  9. g

    Number of newsletters per week

    • www.getresponse.com
  10. d

    Standardised Precipitation-Evapotranspiration Index

    • data.world
    Updated May 16, 2019
  11. d

    X283-Y24 of North American Land Data Assimilation System (NLDAS) NASA...

    • datadiscoverystudio.org
    Published Jan 1, 2017
  12. d

    X268-Y32 of North American Land Data Assimilation System (NLDAS) NASA...

    • datadiscoverystudio.org
    Published Jan 1, 2017
  13. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
Facebook
Twitter
Email
Click to copy link
Link copied

Webis-CPC-11

14 scholarly articles cite this dataset (View in Google Scholar)
  • Dataset published 2011
Dataset provided by
Bauhaus University, Weimarhttp://www.uni-weimar.de/
The Web Technology & Information Systems Network
Authors
Stein, Benno; Burrows, Steven; Potthast, Martin
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Available download formats from providers
txt
Description

The Webis Crowd Paraphrase Corpus 2011 (Webis-CPC-11) contains 7,859 candidate paraphrases obtained from Mechanical Turk crowdsourcing. The corpus is made up of 4,067 accepted paraphrases, 3,792 rejected non-paraphrases, and the original texts. These samples have formed part of PAN 2010 international plagiarism detection competition, but were not previously available separate to rest of the competition data.

Search
Clear search
Close search
Google apps
Main menu