Dataset Open Access
This dataset is associated with the publication "G.Coro, (2020), A global-scale ecological niche model to predict SARS-CoV-2 coronavirus infection rate, Ecological Modelling, Volume 431, 109187, https://doi.org/10.1016/j.ecolmodel.2020.109187"
This image reports a Maximum Entropy model that estimates suitable locations for COVID-19 spread, i.e. places that could favour the spread of the virus just in terms of environmental parameters.
The model was trained just on locations in Italy that have reported a rate of new infections higher than the geometric mean of all Italian infection rates. The following environmental parameters were used, which are correlated to those used by other studies:
A higher resolution map, the model file (in ASC format) and all parameters used are also attached.
The model indicates highest correlation with infection rate for CO2 around 0.03 gCm^−2day^−1, for Temperature around 11.8 °C, and for Precipitation around 0.3 kg m^-2 s^-1, whereas Elevation and Population density are poorly correlated with infection rate.
One interesting result is that the model indicates, among others, the Hubei region in China as a high-probability location, and Iran (around Teheran) as a suited location for virus' spread, but the model was not trained on these regions, i.e. it did not know about the actual spread in these regions.
A risk score was calculated for each country/region reported by the JHU monitoring system (https://gisanddata.maps.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6). This score is calculated as the summed normalised probability in the populated locations divided by their total surface. This score represents how much the zone would potentially foster the virus' spread.
We assessed the reliability of this score, by selecting the country/regions that reported the highest rates of infection. These zones were selected as those with a rate higher than the upper confidence of a log-normal distribution of the rates.
The agreement between the two maps (covid_high_rate_vs_high_risk.png, where violet dots indicate high infection rates and countries' colours indicate estimated high risk score) is the following:
Accuracy (overall percentage of correctly predicted high-rate zones): 77.25%
Kappa (agreement between the two maps): 0.46 (Good, according to Fleiss' intepretation of the score)
This assessment demonstrates that our map can be used to estimate the risk of a certain country to have a high rate of infection, and indicates that the influence of environmental parameters on virus's spread should be further investigated.
Gianpaolo Coro, A global-scale ecological niche model to predict SARS-CoV-2 coronavirus infection rate, Ecological Modelling, Volume 431, 2020, 109187, ISSN 0304-3800, https://doi.org/10.1016/j.ecolmodel.2020.109187. (http://www.sciencedirect.com/science/article/pii/S0304380020302581)
Coro, G., Panichi, G., Scarponi, P., & Pagano, P. (2017). Cloud computing in a distributed e‐infrastructure using the web processing service standard. Concurrency and Computation: Practice and Experience, 29(18), e4219.
|All versions||This version|
|Data volume||17.5 GB||2.1 GB|