A collection of full-text documents from various sources including the Financial Times Limited (1991, 1992, 1993, 1994), the Congressional Record of the 103rd Congress (1993), the Federal Register (1994), the Foreign Broadcast Information Service (1996), and the Los Angeles Times (1989, 1990). These documents are the document set for several TREC information retrieval test collections. (Data contains document text only.)
Dataset Card for disks45/nocr/trec-robust-2004
The disks45/nocr/trec-robust-2004 dataset, provided by the ir-datasets package. For more information about the dataset, see the documentation.
Data
This dataset provides:
queries (i.e., topics); count=250
qrels: (relevance assessments); count=311,410
For docs, use irds/disks45_nocr
Usage
from datasets import load_dataset
queries = load_dataset('irds/disks45_nocr_trec-robust-2004', 'queries') for… See the full description on the dataset page: https://huggingface.co/datasets/irds/disks45_nocr_trec-robust-2004.
A collection of full-text English documents from various sources including the Foreign Broadcast Information Service (1996) and the Los Angeles Times (1989, 1990). These documents make up part of the document set for several TREC information retrieval test collections.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
A collection of full-text documents from various sources including the Financial Times Limited (1991, 1992, 1993, 1994), the Congressional Record of the 103rd Congress (1993), the Federal Register (1994), the Foreign Broadcast Information Service (1996), and the Los Angeles Times (1989, 1990). These documents are the document set for several TREC information retrieval test collections. (Data contains document text only.)