-
Notifications
You must be signed in to change notification settings - Fork 47
Open
Labels
Description
Dataset Information:
This is the new corpus for the upcoming 2025 TREC tip-of-the-tongue shared task.
The corpus is updated and already contains train and dev queries + qrels.
The 2025 test queries are still in preparation, but when we get the corpus into ir-datasets adding the test queries later would likely be easy.
Links to Resources:
- Task description and details: https://trec-tot.github.io/
- Data: https://zenodo.org/records/15356599
Dataset ID(s) & supported entities:
- trec-tot/2025/train
- trec-tot/2025/dev1
- trec-tot/2025/dev2
- trec-tot/2025/dev3
Checklist
Mark each task once completed. All should be checked prior to merging a new dataset.
- Dataset definition (in
ir_datasets/datasets/[topid].py) - Tests (in
tests/integration/[topid].py) - Metadata generated (using
ir_datasets generate_metadatacommand, should appear inir_datasets/etc/metadata.json) - Documentation (in
ir_datasets/etc/[topid].yaml)- Documentation generated in https://github.com/seanmacavaney/ir-datasets.com/
- Downloadable content (in
ir_datasets/etc/downloads.json)
Additional comments/concerns/ideas/etc.