Search
Clear search
Close search
Main menu
Google apps
1 dataset found
  1. Z

    PAN12 Originality: Source Retrieval

    • data.niaid.nih.gov
    • zenodo.org
    Updated Jun 11, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stein, Benno (2022). PAN12 Originality: Source Retrieval [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3713287
    Explore at:
    Dataset updated
    Jun 11, 2022
    Dataset provided by
    Stein, Benno
    Oberländer, Arnd
    Gupta, Parth
    Rosso, Paolo
    Potthast, Martin
    Kiesel, Johannes
    Barrón-Cedeño, Alberto
    Tippmann, Martin
    Hagen, Matthias
    Graßegger, Jan
    Michel, Maximilian
    Gollub, Tim
    Description

    We provide you with a training corpus that consists of suspicious documents. Each suspicious document is about a specific topic and may consist of plagiarized passages obtained from web pages on that topic found in the ClueWeb09 corpus.

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Stein, Benno (2022). PAN12 Originality: Source Retrieval [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3713287

PAN12 Originality: Source Retrieval

Explore at:
Dataset updated
Jun 11, 2022
Dataset provided by
Stein, Benno
Oberländer, Arnd
Gupta, Parth
Rosso, Paolo
Potthast, Martin
Kiesel, Johannes
Barrón-Cedeño, Alberto
Tippmann, Martin
Hagen, Matthias
Graßegger, Jan
Michel, Maximilian
Gollub, Tim
Description

We provide you with a training corpus that consists of suspicious documents. Each suspicious document is about a specific topic and may consist of plagiarized passages obtained from web pages on that topic found in the ClueWeb09 corpus.