Published May 20, 2019
| Version v1
Dataset
Open
Webis Query-Task-Mapping Corpus 2019 (Webis-QTM-19)
- 1. Bauhaus-Universität Weimar
- 2. Martin-Luther-Universität Halle-Wittenberg
Description
The Webis Query-Task-Mapping Corpus 2019 (Webis-QTM-19) comprises three benchmark datasets on the query-task-mapping problem, which consists of finding the correct task for a new query in a given task-split background query log.
It comprises three subdatasets in separate CSV files, each of which has three columns:
- Query. The query string.
- Source. The source of the query. In all datasets, a source field with value 'google' or 'bing' indicates that the query was derived from query suggestions from the respective search engine; otherwise, the query is from one of the underlying base corpora:
- 'lucc' : lucchese:2011
- 'webis' : stein:2013b
- 'trc' : stein:2016a
- 'trec' : various collections of TREC queries
- 'wikihow' : based on titles of wikiHow questions
- Task. The ID of the ground-truth task for the corresponding query.
Further details can be found in reference:
Michael Völske, Ehsan Fatehifar, Benno Stein, and Matthias Hagen. Query-Task Mapping. In 42nd International ACM Conference on Research and Development in Information Retrieval (SIGIR 2019), July 2019. ACM.
http://doi.acm.org/10.1145/3331184.3331286
Files
webis-qtm-19.zip
Files
(1.9 MB)
Name | Size | Download all |
---|---|---|
md5:d39718b62ec203f556d82e41b0c075c0
|
1.9 MB | Preview Download |
Additional details
References
- Michael Völske, Ehsan Fatehifar, Benno Stein, and Matthias Hagen. Query-Task Mapping. In 42nd International ACM Conference on Research and Development in Information Retrieval (SIGIR 2019), July 2019. ACM.