Published January 9, 2022 | Version v1
Dataset Open

Generalizable EHR-R-REDCap pipeline for a national multi-institutional rare tumor patient registry

  • 1. Harvard Medical School*
  • 2. Massachusetts General Hospital

Description

Objective: To develop a clinical informatics pipeline designed to capture large-scale structured EHR data for a national patient registry.

Materials and Methods: The EHR-R-REDCap pipeline is implemented using R-statistical software to remap and import structured EHR data into the REDCap-based multi-institutional Merkel Cell Carcinoma (MCC) Patient Registry using an adaptable data dictionary.

Results: Clinical laboratory data were extracted from EPIC Clarity across several participating institutions. Labs were transformed, remapped and imported into the MCC registry using the EHR labs abstraction (eLAB) pipeline. Forty-nine clinical tests encompassing 482,450 results were imported into the registry for 1,109 enrolled MCC patients. Data-quality assessment revealed highly accurate, valid labs. Univariate modeling was performed for labs at baseline on overall survival (N=176) using this clinical informatics pipeline.

Conclusion: We demonstrate feasibility of the facile eLAB workflow. EHR data is successfully transformed, and bulk-loaded/imported into a REDCap-based national registry to execute real-world data analysis and interoperability.

Files

DataDictionary_eLAB.csv

Files (467.2 kB)

Name Size Download all
md5:cd8b797459a9976c470909eaebe84bc6
29.5 kB Preview Download
md5:5b8c391ca251908a96bea44044cd8ab2
19.3 kB Download
md5:b3af080eff18496ae7f14afc37ad78ec
12.5 kB Download
md5:c0c6d4684c35b7ef45856484508f24d6
23.1 kB Download
md5:8d36f997ea96716c11256a577ed28846
382.9 kB Download

Additional details

Related works

Is source of
10.5281/zenodo.5822605 (DOI)