Published March 11, 2020 | Version v1
Journal article Open

INVESTIGATING THE SYNTHETIC MINORITY CLASS OVERSAMPLING TECHNIQUE (SMOTE) ON AN IMBALANCED CARDIOVASCULAR DISEASE (CVD) DATASET

  • 1. University of Patras

Description

 In this work, we employ the Synthetic Minority
Oversampling Technique (SMOTE) to generate instances
of the minority class in an imbalanced Coronary Artery
Disease dataset. We firstly analyze the public dataset Z –
Alizadeh sani, a dataset used for non-invasive prediction of
CAD. We perform feature selection to exclude attributes
unrelated to Coronary Artery Disease risk. The generation
of new samples is performed using SMOTE, a technique
commonly employed in machine learning tasks. We design
Artificial Neural Networks, Decision Trees, and Support
Vector Machines to classify both the original dataset and
the augmented. The results demonstrate that data
augmentation may be beneficial in specific cases, but it is
not a panacea, and its application in a specific dataset
should be carefully examined. 

Files

431-434,Tesma409,IJEAST.pdf

Files (362.5 kB)

Name Size Download all
md5:0715c7bb9ad8247cdf8deb2a2b720076
362.5 kB Preview Download