Saved datasets
1 dataset found
  1. Webis TripAdvisor Corpus 2014 (Webis-Tripad-14)

    • zenodo.org
    • explore.openaire.eu
    application/gzip
    Updated Jan 24, 2020
    + more versions
  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Martin Trenkmann; Katharina Spiel; Katharina Spiel; Martin Trenkmann (2020). Webis TripAdvisor Corpus 2014 (Webis-Tripad-14) [Dataset]. http://doi.org/10.5281/zenodo.3266882
Organization logo

Webis TripAdvisor Corpus 2014 (Webis-Tripad-14)

Explore at:
application/gzipAvailable download formats
Dataset updated
Jan 24, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Martin Trenkmann; Katharina Spiel; Katharina Spiel; Martin Trenkmann
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Crawled over 2 weeks in January 2014, the Webis TripAdvisor Corpus 2014 (Webis-Tripad-14) consists of 266 061 reviews on 12 044 hotels by 208 785 users. Additionally, there is meta data about the hotels (such as location or overall ratings), the users (such as gender and age range) and the reviews itself (such as date posted and rating) available. We offer a download in json format: one file per hotel and one file containing all the user information.

The Webis TripAdvisor Corpus 2014 (Webis-Tripad-14) is designed in such a way that several different tasks can be performed on it, such as sentiment analysis, author profiling or usefulness detection.

The json-corpus consists of 12 045 files, where one of them contains all the user data and the others are one for each of the hotels in the data set. A detailed description of the data and the key/value pairs can be found as a README.txt in the download folder.

Search
Clear search
Close search
Google apps
Main menu