Dataset Open Access

Webis Query Spelling Corpus 2017 (Webis-QSpell-17)

Hagen, Matthias; Potthast, Martin; Stein, Benno; Gohsen, Marcel; Rathgeber, Anja

The Webis Query Spelling Corpus 2017 (Webis-QSpell-17) contains 54,772 web queries that were manually spell-checked; for 9,171 queries alternative spelling variants are contained.

As for segmentations of many of the queries (i.e., tagged concepts and phrases), please refer to the companion corpus Webis-QSeC-10.

Files (1.1 MB)
Name Size
corpus-webis-qspell-17.zip
md5:cda2d9bf28ff94a2f17dc97a642b8b84
1.1 MB Download
  • Matthias Hagen, Martin Potthast, Marcel Gohsen, Anja Rathgeber, and Benno Stein. A Large-Scale Query Spelling Correction Corpus. In Noriko Kando et al, editors, 40th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2017), pages 1261-1264, August 2017. ACM. ISBN 978-1-4503-5022-8.

15
3
views
downloads
All versions This version
Views 1515
Downloads 33
Data volume 3.2 MB3.2 MB
Unique views 1313
Unique downloads 33

Share

Cite as