1 dataset found

W
Webis-CPC-11
webis.de
anthology.aicmu.ac.cn
3251771
Updated 2011
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Steven Burrows; Martin Potthast; Benno Stein (2011). Webis-CPC-11 [Dataset]. http://doi.org/10.5281/zenodo.3251771
Explore at:
3251771Available download formats
Unique identifier
https://doi.org/10.5281/zenodo.3251771
Dataset updated
2011
Dataset provided by
Bauhaus-Universität Weimar
Computer Power Institute, Melbourne, Australia
The Web Technology & Information Systems Network
University of Kassel, hessian.AI, and ScaDS.AI
Authors
Steven Burrows; Martin Potthast; Benno Stein
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The Webis Crowd Paraphrase Corpus 2011 (Webis-CPC-11) contains 7,859 candidate paraphrases obtained from Mechanical Turk crowdsourcing. The corpus is made up of 4,067 accepted paraphrases, 3,792 rejected non-paraphrases, and the original texts. These samples have formed part of PAN 2010 international plagiarism detection competition, but were not previously available separate to rest of the competition data.
Not seeing a result you expected?
Learn how you can add new datasets to our index.

Facebook

Twitter

Click to copy link

Link copied

Cite

Steven Burrows; Martin Potthast; Benno Stein (2011). Webis-CPC-11 [Dataset]. http://doi.org/10.5281/zenodo.3251771

Webis-CPC-11

Explore at:

26 scholarly articles cite this dataset (View in Google Scholar)

3251771Available download formats

Unique identifier

https://doi.org/10.5281/zenodo.3251771

Dataset updated

2011

Dataset provided by

Bauhaus-Universität Weimar
Computer Power Institute, Melbourne, Australia
The Web Technology & Information Systems Network
University of Kassel, hessian.AI, and ScaDS.AI

Authors

Steven Burrows; Martin Potthast; Benno Stein

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

The Webis Crowd Paraphrase Corpus 2011 (Webis-CPC-11) contains 7,859 candidate paraphrases obtained from Mechanical Turk crowdsourcing. The corpus is made up of 4,067 accepted paraphrases, 3,792 rejected non-paraphrases, and the original texts. These samples have formed part of PAN 2010 international plagiarism detection competition, but were not previously available separate to rest of the competition data.

Webis-CPC-11

Webis-CPC-11See More Versions

Webis-CPC-11