Dataset Open Access

Wikidata Vandalism Corpus 2015 (WDVC-15)

Heindorf, Stefan; Potthast, Martin; Stein, Benno; Engels, Gregor

The Wikidata vandalism corpus 2015 (WDVC-15) is a corpus for the evaluation of automatic vandalism detectors for Wikidata. For research purposes the corpus can be used free of charge.

Files (4.8 GB)
Name Size
wikidata-vandalism-corpus-2015.tar.bz2
md5:34a68c8bb6023911d71539beeae001fa
4.8 GB Download
  • Stefan Heindorf, Martin Potthast, Benno Stein, and Gregor Engels. Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis. In Ricardo Baeza-Yates, Mounia Lalmas, Alistair Moffat, and Berthier Ribeiro-Neto, editors, 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2015), pages 831-834, August 2015. ACM. ISBN 978-1-4503-3621-5

295
42
views
downloads
All versions This version
Views 295295
Downloads 4242
Data volume 202.3 GB202.3 GB
Unique views 241241
Unique downloads 3232

Share

Cite as