Dataset Open Access

Wikidata Vandalism Corpus 2015 (WDVC-15)

Heindorf, Stefan; Potthast, Martin; Stein, Benno; Engels, Gregor

The Wikidata vandalism corpus 2015 (WDVC-15) is a corpus for the evaluation of automatic vandalism detectors for Wikidata. For research purposes the corpus can be used free of charge.

Files (4.8 GB)
Name Size
wikidata-vandalism-corpus-2015.tar.bz2
md5:34a68c8bb6023911d71539beeae001fa
4.8 GB Download
  • Stefan Heindorf, Martin Potthast, Benno Stein, and Gregor Engels. Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis. In Ricardo Baeza-Yates, Mounia Lalmas, Alistair Moffat, and Berthier Ribeiro-Neto, editors, 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 2015), pages 831-834, August 2015. ACM. ISBN 978-1-4503-3621-5

243
39
views
downloads
All versions This version
Views 243243
Downloads 3939
Data volume 187.8 GB187.8 GB
Unique views 192192
Unique downloads 2929

Share

Cite as