SUSdblp Dataset for Classification of Seminal, Uninfluential and Survey Publications
Description
This data set contains citation network data for 1980 publications from dblp (https://dblp.uni-trier.de/) enriched with data from AMiner (https://aminer.org/) for classification of seminal and survey publications. It is an extension of the SeminalSurveyDBLP dataset (https://zenodo.org/record/3258164#.XlztuUoxmUm).
Citations and references are contained for every publication. For each of the 129,442 papers, dblp key, publication year as well as stemmed and unstemmed concatenations of its title and abstract are given. For citing and referenced papers, their number of citations as well as their field and time normalised citation count are contained. Seminal papers come from A* conferences, surveys were extracted from venues specialized in publishing reviews. Uninfluential publications come from C conferences and obtained less than ten citations.
For details, see Evaluating Semantometrics from Computer Science Publications, Christin Katharina Kreutz, Premtim Sahitaj, and Ralf Schenkel, to appear in Scientometrics.
Files
SUSdblp.zip
Files
(103.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:ba4aa81e5a1dd9fcbd1c3ceee9d55d0b
|
103.1 MB | Preview Download |