Dataset Open Access
Following and in parallel to the recently released dataset CORD-19 of scholarly articles, we provide the literature graph LG-covid19-HOTP composed of not only articles (graph nodes) that are relevant to the study of coronavirus, but also citation links (graph edges) for facilitating navigation and search among the articles. The article records are related and connected, not isolated, in the same spirit of other existing literature graphs, and focused around the particular theme of covid-19 study. The graph nodes include 2,754 hot-off-the-press (HOTP) articles since January 2020. The graph contains about one hundred thousand articles and nearly one million links. In addition to the dataset, we provide basic meta-data analysis and visualization in terms of publication growth over time, ranking by citation, similarity in co-citation, and similarity in co-reference, available at lg-covid-19-hotp.cs.duke.edu.
COVID-19 Open Research Dataset (CORD-19). 2020. Version 2020-03-20. Retrieved from https://pages.semanticscholar.org/coronavirus-research. Accessed 2020-03-26. 10.5281/zenodo.3727291
Crossref REST API. Available at www.crossref.org. Accessed 2020-03-25.
Elsevier Scopus Citation Overview API. Accessed 2020-03-25.
Semantic Scholar Open Research Corpus. 2019. Version 2019-11-01. Retrieved from http://s2-public-api-prod.us-west-2.elasticbeanstalk.com/corpus/download/. Accessed 2019-12-06.