KeySearchWiki
- 1. Heinz Nixdorf Chair for Distributed Information Systems, Friedrich Schiller University Jena, Jena, Germany
- 2. Institute of Data Science, German Aerospace Center DLR, Jena, Germany
Description
KeySearchWiki is a dataset for evaluating keyword search systems over Wikidata.
The dataset was automatically generated by leveraging Wikidata and Wikipedia set categories (e.g., Category:American television directors) as data sources for both relevant entities and queries.
Relevant entities are gathered by carefully navigating the Wikipedia set categories hierarchy in all available languages. Furthermore, those categories are refined and combined to derive more complex queries.
Detailed information about KeySearchWiki and its generation can be found on the Github page.
Files
KeySearchWiki-dataset.zip
Files
(187.0 MB)
Name | Size | Download all |
---|---|---|
md5:381262fb10f9f4985a852d04d2e4a478
|
187.0 MB | Preview Download |
Additional details
Related works
- Is supplemented by
- Dataset: 10.5281/zenodo.4965398 (DOI)
- Software: 10.5281/zenodo.4968550 (DOI)