Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Webis Open Directory Project Corpus 2010 (Webis-ODP-10) is a corpus for the evaluation of cluster labeling algorithms. The corpus contains about 5,000 web pages which are grouped into 4 main categories and 12 subcategories based on the human made Open Directory Project (ODP) classification. The web pages were collected in May 2010.