Published April 24, 2020 | Version 1
Dataset Open

Lifelong Learning of Graph Neural Networks for Open-World Node Classification

  • 1. Kiel University, Germany
  • 2. Ulm University, Germany

Contributors

Data collector:

  • 1. Kiel University

Description

Three temporal graph datasets for node classification under distribution shift.

DBLP-Easy and DBLP-Hard are citation graph datasets. PharmaBio is a collaboration graph dataset.

Vertices are scientific publications, edges are either citations (DBLP) or at-least-one-common-author relationships (PharmaBio).

The task is to classify the vertices of the graph into the respective conference/journal venues (DBLP) or journal categories (PharmaBio). In the DBLP datasets, new classes may appear over time.

Each dataset follows the structure:

- adjlist.txt -- the graph structure encoded as adjacency lists: in each row, the first entry is the source vertex, the remaining entries are adjacent vertices

- X.npy -- numpy serialized format for node features indexed by node id corresponding to adjlist.txt

- y.npy -- numpy serialized format for node labels indexed by node id corresponding to adjlist.txt

- t.npy -- numpy serialized format for time steps indexed by node id corresponding to adjlist.txt

A paper describing our incremental training and evaluation framework is published in IJCNN 2021 (Pre-print on arXiv: https://arxiv.org/abs/2006.14422).

If you use these datasets in your research, please cite the corresponding paper:

@inproceedings{galke2021lifelong,
  author={Galke, Lukas and Franke, Benedikt and Zielke, Tobias and Scherp, Ansgar},
  booktitle={2021 International Joint Conference on Neural Networks (IJCNN)},
  title={Lifelong Learning of Graph Neural Networks for Open-World Node Classification},
  year={2021},
  volume={},
  number={},
  pages={1-8},
  doi={10.1109/IJCNN52387.2021.9533412}
}


 

Files

dblp-easy-citation-graph.zip

Files (45.4 MB)

Name Size Download all
md5:399e63ff863bcfa573fa08da990bf3a3
5.1 MB Preview Download
md5:e7ee66a6fe33cba2e79b489ac80b04a1
26.7 MB Preview Download
md5:46dbea08c0c64c751250fab1633ad693
13.6 MB Preview Download

Additional details

Related works

Is compiled by
Preprint: https://arxiv.org/abs/2006.14422 (URL)
Conference paper: 10.1109/IJCNN52387.2021.9533412 (DOI)
Is documented by
Conference paper: 10.1109/IJCNN52387.2021.9533412 (DOI)
Is required by
Conference paper: 10.1109/IJCNN52387.2021.9533412 (DOI)
Is supplement to
Conference paper: 10.1109/IJCNN52387.2021.9533412 (DOI)