Dataset Open Access

Wikidata Vandalism Corpus 2015 (WDVC-15)

Stein, Benno; Potthast, Martin; Heindorf, Stefan; Engels, Gregor

The Wikidata vandalism corpus 2015 (WDVC-15) is a corpus for the evaluation of automatic vandalism detectors for Wikidata. For research purposes the corpus can be used free of charge.

Files (4.8 GB)
Name Size
wikidata-vandalism-corpus-2015.tar.bz2
md5:34a68c8bb6023911d71539beeae001fa
4.8 GB Download
  • Stefan Heindorf, Martin Potthast, Benno Stein, and Gregor Engels. Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis. In Ricardo Baeza-Yates, Mounia Lalmas, Alistair Moffat, and Berthier Ribeiro-Neto, editors, 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2015), pages 831-834, August 2015. ACM. ISBN 978-1-4503-3621-5

32
7
views
downloads
All versions This version
Views 3232
Downloads 77
Data volume 33.7 GB33.7 GB
Unique views 1717
Unique downloads 33

Share

Cite as