Published December 17, 2025 | Version v1
Publication Open

Artificial Intelligence Based Data Governance for Chinese Electronic Health Record Analysis

Authors/Creators

  • 1. Inspur USA Inc

Description

Electronic health record (EHR) analysis can leverage great insights to improve the quality of human healthcare. However, the low data quality problems of missing values, inconsistency, and errors in the data set severely hinder building robust machine learning models for data analysis. In this paper, we develop a methodology of artificial intelligence (AI)-based data governance to predict the missing values or verify if the existing values are correct and what they should be when they are wrong. We demonstrate the performance of this methodology through a case study of patient gender prediction and verification. Experimental results show that the deep learning algorithm of convolutional neural network (CNN) works very well according to the testing performance measured by the quantitative metric of F1-Score, and it out performs the support vector machine (SVM) models with different vector representations for documents.

Files

8318ijdkp03.pdf

Files (332.6 kB)

Name Size Download all
md5:2227dc36f57a33f5decf95195c21afc5
332.6 kB Preview Download

Additional details

Related works

Is published in
Publication: 10.5121/ijdkp.2018.8303 (DOI)

Dates

Available
2025-12-17
10.5121/ijdkp.2018.8303

References

  • 10.5121/ijdkp.2018.8303