Published July 27, 2022 | Version 1.0.0
Dataset Open

wiki-category-consistency-dataset

  • 1. Heinz Nixdorf Chair for Distributed Information Systems, Friedrich Schiller University Jena, Jena, Germany
  • 2. Institute of Data Science, German Aerospace Center DLR, Jena, Germany

Description

Candidate generation and cleaning results produced in the context of analyzing the consistency between Wikipedia and Wikidata categories using the Wikidata JSON dump of 2022-05-02 and the Wikipedia SQL dumps of 2022-05-01.

Detailed information can be found on the Github page.

Files

wiki-category-consistency-dataset.zip

Files (363.0 MB)

Name Size Download all
md5:0d2febf4a123d2ecbd73da355613ca6e
363.0 MB Preview Download

Additional details

Related works

Is supplement to
Software: 10.5281/zenodo.6963599 (DOI)
Conference paper: https://ceur-ws.org/Vol-3262/paper4.pdf (URL)