Published August 13, 2015 | Version v1
Dataset Open

Wikidata Vandalism Corpus 2015 (WDVC-15)

  • 1. Universität Paderborn
  • 2. Bauhaus-Universität Weimar


The Wikidata vandalism corpus 2015 (WDVC-15) is a corpus for the evaluation of automatic vandalism detectors for Wikidata. For research purposes the corpus can be used free of charge.


Files (4.8 GB)

Name Size Download all
4.8 GB Download

Additional details


  • Stefan Heindorf, Martin Potthast, Benno Stein, and Gregor Engels. Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis. In Ricardo Baeza-Yates, Mounia Lalmas, Alistair Moffat, and Berthier Ribeiro-Neto, editors, 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2015), pages 831-834, August 2015. ACM. ISBN 978-1-4503-3621-5