INVESTIGATING THE SYNTHETIC MINORITY CLASS OVERSAMPLING TECHNIQUE (SMOTE) ON AN IMBALANCED CARDIOVASCULAR DISEASE (CVD) DATASET
Description
In this work, we employ the Synthetic Minority
Oversampling Technique (SMOTE) to generate instances
of the minority class in an imbalanced Coronary Artery
Disease dataset. We firstly analyze the public dataset Z –
Alizadeh sani, a dataset used for non-invasive prediction of
CAD. We perform feature selection to exclude attributes
unrelated to Coronary Artery Disease risk. The generation
of new samples is performed using SMOTE, a technique
commonly employed in machine learning tasks. We design
Artificial Neural Networks, Decision Trees, and Support
Vector Machines to classify both the original dataset and
the augmented. The results demonstrate that data
augmentation may be beneficial in specific cases, but it is
not a panacea, and its application in a specific dataset
should be carefully examined.
Files
431-434,Tesma409,IJEAST.pdf
Files
(362.5 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:0715c7bb9ad8247cdf8deb2a2b720076
|
362.5 kB | Preview Download |