Published September 24, 2021 | Version v1
Conference paper Open

Investigating Presence of Ethnoracial Bias in Clinical Data using Machine Learning

  • 1. Ss. Cyril and Methodius University in Skopje
  • 2. Harvard-MIT Health Sciences and Technology
  • 3. Massachusetts Institute of Technology
  • 4. Fondazione Bruno Kessler

Description

An important target for machine learning research is obtaining unbiased results, which require addressing bias that might be present in the data as well as the methodology. This is of utmost importance in medical applications of machine learning, where trained models should be unbiased so as to result in systems that are widely applicable, reliable and fair. Since bias can sometimes be introduced through the data itself, in this paper we investigate the presence of ethnoracial bias in patients' clinical data. We focus primarily on vital signs and demographic information and classify patient ethnoraces in subsets of two from the three ethnoracial groups (African Americans, Caucasians, and Hispanics). Our results show that ethnorace can be identified in two out of three patients, setting the initial base for further investigation of the complex issue of ehtnoracial bias.

Files

04_ETAI21_WH.pdf

Files (519.0 kB)

Name Size Download all
md5:bc9ec1d9f5eb9ecea67560ea820e176d
519.0 kB Preview Download

Additional details

Funding

European Commission
WideHealth - Widening Research on Pervasive and eHealth - WideHealth 952279