Feedback
3 results found
  1. Genre-KI-04

    • webis.de
    Published 2004
  2. Bollywood Movie Dataset

    • www.kaggle.com
    Updated Feb 4, 2018
  3. n

    Data from: List of Bacterial Names with Standing in Nomenclature

    • cmr.earthdata.nasa.gov
    Updated Apr 21, 2017
  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
Facebook
Twitter
Email
Click to copy link
Link copied

Genre-KI-04

2 scholarly articles cite this dataset (View in Google Scholar)
  • Dataset published 2004
Dataset provided by
Bauhaus University, Weimarhttp://www.uni-weimar.de/
The Web Technology & Information Systems Network
Authors
Stein, Benno; Meyer zu Eissen, Sven
Description

The web genre corpus 2004 (Genre-KI-04) is designed for the evaluation of techniques for genre classification. It consists of 1239 web documents classified into 8 genres and basic meta data for each of the files.

Search
Clear search
Close search
Google apps
Main menu