Published October 21, 2022 | Version 2.0
Dataset Open

Inductive Freebase and Wikidata for KG Completion

  • 1. Mila, McGill University

Description

UPD 2.0: Regenerated datasets free of potential test set leakages

This repository contains 10 inductive link prediction datasets (graphs only) published in "Inductive Logical Query Answering in Knowledge Graphs" (NeurIPS 2022). 9 datasets (106-550) were created from FB15k-237, the wikikg dataset was created from OGB WikiKG 2 graph. In the datasets, all inference graphs extend training graphs and include new nodes and edges. Dataset numbers indicate a relative size of the inference graph compared to the training graph, e.g., in 175, the number of nodes in the inference graph is 175% compared to the number of nodes in the training graph. The higher the ratio, the more new unseen nodes appear at inference time, the more complex the task is. The Wikikg split has a fixed 133% ratio.

Each dataset is a zip archive containing 5 files:

  • train_graph.txt (pt for wikikg) - original training graph
  • val_inference.txt (pt) - inference graph (validation split), new nodes in validation are disjoint with the test inference graph
  • val_predict.txt (pt) - missing edges in the validation inference graph to be predicted. 
  • test_intference.txt (pt) - inference graph (test splits), new nodes in test are disjoint with the validation inference graph
  • test_predict.txt (pt) - missing edges in the test inference graph to be predicted;

This is a light-weight version of the full datasets for inductive query answering published here: https://zenodo.org/record/7231344 

Here, we only provide graph data for training inductive link prediction models.

Paper pre-print: https://arxiv.org/abs/2210.08008

The full source code of training/inference models is available at https://github.com/DeepGraphLearning/InductiveQE

Files

106.zip

Files (145.7 MB)

Name Size Download all
md5:c50dfd54468d8fe201a9b134bd91dc9c
1.6 MB Preview Download
md5:988834178228705461003653c9ace306
1.6 MB Preview Download
md5:45390c537a8a00642d7f9414a00b247a
1.6 MB Preview Download
md5:4d4c5a3db8b13f560a0641664a33c9b2
1.5 MB Preview Download
md5:910cf5a6af861f06252fc5f77c343305
1.4 MB Preview Download
md5:9511585e9488c0367b93b33f903bcc93
1.3 MB Preview Download
md5:e3c99cddfe8e7e2276a4592aecb051ad
1.2 MB Preview Download
md5:e0fedd89784b133036b85c93b55b2848
1.0 MB Preview Download
md5:1d86fc686774a6ee14d7c401f520a62b
806.6 kB Preview Download
md5:3fad5736b016a31009158b29461ca329
133.7 MB Preview Download

Additional details

References

  • Galkin et al. Inductive Logical Query Answering in Knowledge Graphs. NeurIPS 2022