Datasets for Non-Parametric Class Completeness Estimators for Collaborative Knowledge Graphs
Description
Non-Parametric Class Completeness Estimators for Collaborative Knowledge Graphs
This are intermediary datasets used for the calculation of the Class Completeness Estimators on Wikidata. For more information see: https://github.com/eXascaleInfolab/cardinal/
edits_wikidatawiki-20181001-pages.csv
This is an extract from wikidatawiki-20181001-pages-meta-history (All pages with complete page edit history (.bz2)) found at https://dumps.wikimedia.org/wikidatawiki/.
The extract was created by the following SQL query:
SELECT page_title, rev_comment, rev_user_text, rev_timestamp FROM revisions WHERE rev_comment LIKE '%[[Property:%]]%[[Q%' ORDER BY rev_id INTO OUTFILE 'edits_wikidatawiki-20181001-pages.csv';
wikidata-20180813-all.json.bz2.universe.noattr.gt.bz2
This is a graph-tool representation of the WikiData graph. Output of https://github.com/eXascaleInfolab/cardinal/blob/master/1_create_inmemory_graph.py.
observations_wikidatawiki-20181001-pages.pickle
Extracted observations. Output of https://github.com/eXascaleInfolab/cardinal/blob/master/2_extract_observations.py.
estimates_wikidatawiki-20181001-pages.pickle
Extracted estimates. Output of https://github.com/eXascaleInfolab/cardinal/blob/master/3_calculate_estimates.py
results_wikidatawiki-20181001-pages.pickle
Results. Output of https://github.com/eXascaleInfolab/cardinal/blob/master/4_draw_graphs.py
Files
edits_wikidatawiki-20181001-pages.csv
Files
(37.2 GB)
Additional details
Related works
- Is supplement to
- 10.1007/978-3-030-30793-6_26 (DOI)
- References
- 10.5281/zenodo.3268725 (DOI)