Published July 29, 2022 | Version v2
Dataset Open

Adapting the Harmonized Data Quality Framework for Ontology Quality Assessment

Description

Ontologies play an important role in the representation, standardization, and integration of biomedical data, but are known to have data quality (DQ) issues. We aimed to understand if the Harmonized Data Quality Framework (HDQF), developed to standardize electronic health record DQ assessment strategies, could be used to improve ontology quality assessment. A novel set of 14 ontology checks was developed. These DQ checks were aligned to the HDQF and examined by HDQF developers. The ontology checks were evaluated using 11 Open Biomedical Ontology Foundry ontologies. 85.7% of the ontology checks were successfully aligned to at least 1 HDQF category. Accommodating the unmapped DQ checks (n=2), required modifying an original HDQF category and adding a new Data Dependency category. While all of the ontology checks were mapped to an HDQF category, not all HDQF categories were represented by an ontology check presenting opportunities to strategically develop new ontology checks. The HDQF is a valuable resource and this work demonstrates its ability to categorize ontology quality assessment strategies.

Files

Callahan_2022ISMB_Poster.pdf

Files (28.5 MB)

Name Size Download all
md5:25b69d26b53aa91993ed843c173698ee
220.0 kB Preview Download
md5:b09db2dd359fa089c7a5e41251182677
185.3 kB Preview Download
md5:63e67c8b58519fb3255f1fc0328abfa2
13.9 MB Preview Download
md5:1070f1fd5102657beffc4ff8ac2093df
12.7 MB Preview Download
md5:887853c62cba7898dc576cd902593aab
31.3 kB Preview Download
md5:59c3a347cc1a91314280f2eb0905c028
33.1 kB Download
md5:aaf3cf23757300ef8edebc68eeafc02a
1.4 MB Preview Download