Published August 11, 2017
| Version 2 incl. error annotations
Dataset
Open
Webis Query Spelling Corpus 2017 (Webis-QSpell-17)
- 1. Bauhaus-Universität Weimar
Description
The Webis Query Spelling Corpus 2017 (Webis-QSpell-17) contains 54,772 web queries that were manually spell-checked; for 9,171 queries alternative spelling variants are contained.
As for segmentations of many of the queries (i.e., tagged concepts and phrases), please refer to the companion corpus Webis-QSeC-10.
Files
corpus-webis-qspell-17.zip
Files
(2.3 MB)
Name | Size | Download all |
---|---|---|
md5:d5dc918fa0be3f321b547a765e6e2956
|
2.3 MB | Preview Download |
Additional details
References
- Matthias Hagen, Martin Potthast, Marcel Gohsen, Anja Rathgeber, and Benno Stein. A Large-Scale Query Spelling Correction Corpus. In Noriko Kando et al, editors, 40th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2017), pages 1261-1264, August 2017. ACM. ISBN 978-1-4503-5022-8.