Published May 31, 2018 | Version v1
Journal article Open

Artificial Intelligence Based Data Governance for Chinese Electronic Health Record Analysis

Authors/Creators

  • 1. Inspur USA Inc 2010 156th Ave NE Bellevue, WA 98052

Description

ABSTRACT

Electronic health record (EHR) analysis can leverage great insights to improve the quality of human healthcare. However, the low data quality problems of missing values, inconsistency, and errors in the data setseverely hinder buildingrobust machine learning models for data analysis. In this paper, we develop a methodology ofartificial intelligence (AI)-based data governance to predict the missing values or verify if the existing values are correct and what they should be when they are wrong. We demonstrate the performance of this methodology through a case study ofpatient gender prediction and verification. Experimental resultsshow that the deep learning algorithm of convolutional neural network (CNN) works very wellaccording to the testing performance measured by the quantitative metric of F1-Score, and it outperformsthe support vector machine (SVM) models with different vector representations for documents.

KEYWORDS

EHR Analysis, Data Governance, Vector Space Model, Word Embeddings, Machine Learning, Convolutional Neural Networks, Deep Learning.

Original Source URL: http://aircconline.com/ijdkp/V8N3/8318ijdkp03.pdf

For more details...

Files

8318ijdkp03.pdf

Files (332.6 kB)

Name Size Download all
md5:2227dc36f57a33f5decf95195c21afc5
332.6 kB Preview Download