Dataset Open Access

The growth of COVID-19 scientific literature: A forecast analysis of different daily time series in specific settings

Torres-Salinas, Daniel; Robinson-García, Nicolás; van Schalkwyk, François; Nane, Gabriela F.; Castillo-Valdivieso, Pedro

Submitted to The ISSI 2021 Conference. The conference is organised by KU Leuven in close collaboration with the university of Antwerp under the auspices of ISSI – the International Society for Informetrics and Scientometrics ( 

We present a forecasting analysis on the growth of scientific literature related to COVID-19 expected for 2021. Considering the paramount scientific and financial efforts made by the research community to find solutions to end the COVID-19 pandemic, an unprecedented volume of scientific outputs is being produced. This questions the capacity of scientists, politicians and citizens to maintain infrastructure, digest content and take scientifically informed decisions. A crucial aspect is to make predictions to prepare for such a large corpus of scientific literature. Here we base our predictions on the ARIMA model and use two different data sources: the Dimensions and World Health Organization COVID-19 databases. These two sources have the particularity of including in the metadata information on the date in which papers were indexed.  We present global predictions, plus predictions in three specific settings: by type of access (Open Access), by NLM source (PubMed and PMC), and by domain-specific repository (SSRN and MedRxiv). We conclude by discussing our findings.

Files (224.2 kB)
Name Size
Open data2 - The growth of COVID-19 scientific literature.xlsx
224.2 kB Download
All versions This version
Views 206206
Downloads 3232
Data volume 7.2 MB7.2 MB
Unique views 170170
Unique downloads 3131


Cite as