Inductive Freebase and Wikidata for KG Completion
Description
UPD 2.0: Regenerated datasets free of potential test set leakages
This repository contains 10 inductive link prediction datasets (graphs only) published in "Inductive Logical Query Answering in Knowledge Graphs" (NeurIPS 2022). 9 datasets (106-550) were created from FB15k-237, the wikikg dataset was created from OGB WikiKG 2 graph. In the datasets, all inference graphs extend training graphs and include new nodes and edges. Dataset numbers indicate a relative size of the inference graph compared to the training graph, e.g., in 175, the number of nodes in the inference graph is 175% compared to the number of nodes in the training graph. The higher the ratio, the more new unseen nodes appear at inference time, the more complex the task is. The Wikikg split has a fixed 133% ratio.
Each dataset is a zip archive containing 5 files:
- train_graph.txt (pt for wikikg) - original training graph
- val_inference.txt (pt) - inference graph (validation split), new nodes in validation are disjoint with the test inference graph
- val_predict.txt (pt) - missing edges in the validation inference graph to be predicted.
- test_intference.txt (pt) - inference graph (test splits), new nodes in test are disjoint with the validation inference graph
- test_predict.txt (pt) - missing edges in the test inference graph to be predicted;
This is a light-weight version of the full datasets for inductive query answering published here: https://zenodo.org/record/7231344
Here, we only provide graph data for training inductive link prediction models.
Paper pre-print: https://arxiv.org/abs/2210.08008
The full source code of training/inference models is available at https://github.com/DeepGraphLearning/InductiveQE
Files
106.zip
Files
(145.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:c50dfd54468d8fe201a9b134bd91dc9c
|
1.6 MB | Preview Download |
|
md5:988834178228705461003653c9ace306
|
1.6 MB | Preview Download |
|
md5:45390c537a8a00642d7f9414a00b247a
|
1.6 MB | Preview Download |
|
md5:4d4c5a3db8b13f560a0641664a33c9b2
|
1.5 MB | Preview Download |
|
md5:910cf5a6af861f06252fc5f77c343305
|
1.4 MB | Preview Download |
|
md5:9511585e9488c0367b93b33f903bcc93
|
1.3 MB | Preview Download |
|
md5:e3c99cddfe8e7e2276a4592aecb051ad
|
1.2 MB | Preview Download |
|
md5:e0fedd89784b133036b85c93b55b2848
|
1.0 MB | Preview Download |
|
md5:1d86fc686774a6ee14d7c401f520a62b
|
806.6 kB | Preview Download |
|
md5:3fad5736b016a31009158b29461ca329
|
133.7 MB | Preview Download |
Additional details
References
- Galkin et al. Inductive Logical Query Answering in Knowledge Graphs. NeurIPS 2022