Dataset Open Access

SeminalSurveyDBLP Dataset for Classification of Seminal and Survey Publications

Kreutz, Christin Katharina; Sahitaj, Premtim; Schenkel, Ralf

This data set contains citation network data for 1320 publications from dblp ( enriched with data from AMiner ( for classification of seminal and survey publications.

Citations and references are contained for every publication. For each of the 121,084 papers, dblp key, publication year as well as stemmed and unstemmed concatenations of its title and abstract are given. Seminal papers come from A* conferences, surveys were extracted from venues specialized in publishing reviews.

For details, see Revaluating Semantometrics from Computer Science Publications, Christin Katharina Kreutz, Premtim Sahitaj, and Ralf Schenkel, 2019, submitted to BIRNDL@SIGIR.

Files (90.1 MB)
Name Size
90.1 MB Download
All versions This version
Views 314314
Downloads 2020
Data volume 1.8 GB1.8 GB
Unique views 306306
Unique downloads 2020


Cite as