Ontologizing Health Systems Data at Scale: Making Translational Discovery a Reality
Creators
- Callahan, Tiffany J1
- Stefanski, Adrianne L2
- Wyrwa, Jordan M2
- Zeng, Chenjie3
- Ostropolets, Anna4
- Banda, Juan M5
- Baumgartner Jr, William A2
- Boyce, Richard D6
- Casiraghi, Elena7
- Coleman, B8
- Collins, Janine H9
- Deakyne Davies, Sara J10
- Feinstein, James A11
- Haendel, Melissa A2
- Lin, Asiyah Y3
- Martin, Blake12
- Matentzoglu, Nico13
- Meeker, Daniella14
- Reese, Justin15
- Sinclair, Jessica16
- Taneja, Sanya B17
- Trinkley, Katy E18
- Vasilevsky, Nicole A2
- Williams, Andrew19
- Zhang, Xingman A20
- Robinson, Peter N8
- Ryan, Patrick B21
- Hripcsak, George4
- Bennett, Tellen D12
- Hunter, Lawrence E12
- Kahn, Michael G12
- 1. University of Colorado Anschutz Medical Campus; Columbia University Irving Medical Center
- 2. University of Colorado Anschutz Medical Campus
- 3. National Institutes of Health
- 4. Columbia University Irving Medical Center
- 5. Georgia State University
- 6. University of Pittsburgh School of Medicine
- 7. Università degli Studi di Milano
- 8. The Jackson Laboratory for Genomic Medicine
- 9. University of Cambridge
- 10. Children's Hospital Colorado
- 11. University of Colorado Anschutz School of Medicine
- 12. University of Colorado School of Medicine
- 13. Semanticly
- 14. University of Southern California
- 15. Lawrence Berkeley National Laboratory
- 16. HealthLinc
- 17. University of Pittsburgh
- 18. University of Colorado Anschutz Skaggs School of Pharmacy and Pharmaceutical Sciences and School of Medicine
- 19. Tufts University
- 20. Sema4
- 21. Janssen Research and Development
Description
Common data models solve many challenges of standardizing electronic health record (EHR) data, but are unable to semantically integrate the resources needed for deep phenotyping. Open Biological and Biomedical Ontology (OBO) Foundry ontologies provide semantically computable representations of biological knowledge and enable the integration of a variety of biomedical data. However, mapping EHR data to OBO Foundry ontologies requires significant manual curation and domain expertise. We introduce a framework for mapping Observational Medical Outcomes Partnership (OMOP) standard vocabularies to OBO Foundry ontologies. Using this framework, we produced mappings for 92,367 conditions, 8,615 drug ingredients, and 10,673 measurement results. Mapping accuracy was verified by domain experts and when examined across 24 hospitals, the mappings covered 99% of conditions and drug ingredients and 68% of measurements. Finally, we demonstrate that OMOP2OBO mappings can aid in the systematic identification of undiagnosed rare disease patients who might benefit from genetic testing.
Notes
Files
OMOP2OBO_Manuscript_v2.pdf
Additional details
Related works
- References
- Software: https://github.com/callahantiff/OMOP2OBO (URL)