Published January 26, 2023 | Version v1
Dataset Open

The benefit of augmenting open data with clinical data-warehouse EHR for forecasting SARS-CoV-2 hospitalizations in Bordeaux area, France

  • 1. Centre Hospitalier Universitaire de Bordeaux
  • 2. Bordeaux Population Health

Description

Objective

The aim of this study was to develop an accurate regional forecast algorithm to predict the number of hospitalized patients and to assess the benefit of the Electronic Health Records (EHR) information to perform those predictions. Materials and Methods Aggregated data from SARS-CoV-2 and weather public database and data warehouse of the Bordeaux hospital were extracted from May 16, 2020, to January 17, 2022. The outcomes were the number of hospitalized patients in the Bordeaux Hospital at 7 and 14 days. We compared the performance of different data sources, feature engineering, and machine learning models.

Results

During the period of 88 weeks, 2561 hospitalizations due to COVID-19 were recorded at the Bordeaux Hospital. The model achieving the best performance was an elastic-net penalized linear regression using all available data with a median relative error at 7 and 14 days of 0.136 [0.063; 0.223] and 0.198 [0.105; 0.302] hospitalizations, respectively. Electronic health records (EHRs) from the hospital data warehouse improved median relative error at 7 and 14 days by 10.9% and 19.8%, respectively. Graphical evaluation showed remaining forecast error was mainly due to delay in slope shift detection.

Discussion

Forecast models showed overall good performance both at 7 and 14 days which was improved by the addition of the data from Bordeaux Hospital data warehouse.

Conclusions

The development of hospital data warehouses might help to get more specific and faster information than traditional surveillance systems, which in turn will help to improve epidemic forecasting at a larger and finer scale.

Notes

Data are stored in a .rdata file. Please use R (https://www.r-project.org/) software to open the data.

Funding provided by: Institut national de recherche en informatique et en automatique (INRIA)
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100012950
Award Number: Mission COVID19, GESTEPID project

Funding provided by: Conseil Régional Aquitaine
Crossref Funder Registry ID: http://dx.doi.org/10.13039/501100009468
Award Number: Prediction territorial COVID N°1333140

Funding provided by: Mission COVID19
Award Number: 1333140

Files

README.md

Files (153.6 kB)

Name Size Download all
md5:ac59eccb526e5b74942d5f2f86dd0d08
142.3 kB Download
md5:fbef82a3721d8ac466f242551e58ea8a
11.3 kB Preview Download

Additional details

Related works

Is cited by
10.1093/jamiaopen/ooac086 (DOI)
Is derived from
10.5281/zenodo.6595011 (DOI)