This archive contains the training corpus for the "Sexual Predator Identification" task of the PAN 2012 Lab, held in conjunction with the CLEF 2012 conference.
Enables research on early detection of sexual predators in chats (eSPD). It is made from the sexual predator identification dataset from PAN12 and from the dataset ChatCoder2. It provides both full-length predator chats from PervertedJustice as well as short segments of non-predator chats. Together these can be used to evaluate eSPD systems.
Dataset Card for Dataset Name
A part of the PAN-2012 dataset as translated in the Romanian language using automated tools for predator detection and automated translation comparison.
Dataset Details
Dataset Description
This datasets were created based on the training and testing datasets presented at the PAN12 competition (https://pan.webis.de/clef12/pan12-web/sexual-predator-identification.html) which was centered around sexual harassment… See the full description on the dataset page: https://huggingface.co/datasets/CristinaMierla/PAN12_predatorTask_romanianTranslation.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
This archive contains the training corpus for the "Sexual Predator Identification" task of the PAN 2012 Lab, held in conjunction with the CLEF 2012 conference.