Dataset Open Access
Heindorf, Stefan;
Potthast, Martin;
Stein, Benno;
Engels, Gregor
The Wikidata vandalism corpus 2015 (WDVC-15) is a corpus for the evaluation of automatic vandalism detectors for Wikidata. For research purposes the corpus can be used free of charge.
Name | Size | |
---|---|---|
wikidata-vandalism-corpus-2015.tar.bz2
md5:34a68c8bb6023911d71539beeae001fa |
4.8 GB | Download |
Stefan Heindorf, Martin Potthast, Benno Stein, and Gregor Engels. Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis. In Ricardo Baeza-Yates, Mounia Lalmas, Alistair Moffat, and Berthier Ribeiro-Neto, editors, 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2015), pages 831-834, August 2015. ACM. ISBN 978-1-4503-3621-5
All versions | This version | |
---|---|---|
Views | 416 | 416 |
Downloads | 52 | 52 |
Data volume | 250.4 GB | 250.4 GB |
Unique views | 353 | 353 |
Unique downloads | 42 | 42 |