Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Webis-SameSentiment-21 dataset is a collection of sentiment review pairs for Same Sentiment Classification. The dataset only contains the pair ids (business and review id) to allow recreation of the dataset. The actual review text has to be downloaded from Yelp.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Cross-Lingual Sentiment (CLS) dataset comprises about 800.000 Amazon product reviews in the four languages English, German, French, and Japanese.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Webis-Dataset-Reviews-21 corpus comprises the curated list of 13,372 NLP-related datasets and their 539,411 mentions extracted from all the publications available in ACL Anthology corpus.
Data on public websites maintained by or on behalf of the city agencies.
Web traffic statistics for the top 2000 most visited pages on nyc.gov by month.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Webis-CMV-20 dataset comprises all available posts and comments in the ChangeMyView subreddit from the foundation of the subreddit in 2005, until September 2017. From these, we have derived two sub-datasets for the tasks of persuasiveness prediction, and opinion malleability prediction. In addition, the corpus comprises historical posts by CMV authors, and derived personal characteristics.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Webis-SameSide-21 dataset is a resampled dataset based on the Same Side Stance Classification shared task dataset.
This data about nola.gov provides a window into how people are interacting with the the City of New Orleans online. The data comes from a unified Google Analytics account for New Orleans. We do not track individuals and we anonymize the IP addresses of all visitors.
Data on public websites maintained by or on behalf of the city agencies.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This corpus is outdated. Please use its successor PAN-PC-11.
Data on public websites maintained by or on behalf of the city agencies.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Webis-STEREO-21 is a massive collection of scientific text reuse in open-access publications. It contains more than 91 million cases of reused text passages found in 4.2 million unique open-access publications, with a high coverage of scientific disciplines and varieties of reuse, as well as comprehensive metadata to contextualize each case.
Daily utilization metrics for data.lacity.org and geohub.lacity.org. Updated monthly
The Office of the Chief Human Capital Officer (OCHCO) provides effective leadership on policies, programs, and partnerships related to all aspects of human capital management. We support the Department in achieving its mission by proactively planning, recruiting, developing, and retaining the best workforce possible.
Data on public websites maintained by or on behalf of the city agencies.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The dataset contains argument pairs which are sampled from args.me dataset and cover two topics: abortion and gay marriage. The dataset is used in the same side stance classification challenge which consists of two experiments (cross-topics and within topics)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Webis Netspeak Instant Search Log 2021 (Webis-NIL-21) is an excerpt of the log of the Netspeak search engine. The dataset contains about 37,000 log entries, which correspond to keystroke interactions the users of Netspeak made with it's search interface while entering their queries. This enables the study of instant search logs in general, and that of identifying keystroke interactions belonging to the same query in particular. The latter is annotated in the log.
National Veterans Small Business Engagement homepage
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This corpus provides the simulation data mining community with a collection of 14641 bridge models and simulated behavior.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Webis-SameSentiment-21 dataset is a collection of sentiment review pairs for Same Sentiment Classification. The dataset only contains the pair ids (business and review id) to allow recreation of the dataset. The actual review text has to be downloaded from Yelp.