Evaluation of feature scaling for improving the performance of supervised learning methods
Authors/Creators
- 1. Department of Computer Science, College of Engineering and Technology, Injibara University, Injibara, Ethiopia
- 2. Department of of Electrical and Electronics Engineering, St. Peter's Institute of Higher Education and Research, Avadi, India
- 3. Department of Artificial Intelligence and Data Science, R.M.K College of Engineering and Technology, Chennai, India
- 4. Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, India
- 5. Department of Computer Science and Engineering, Velammal Engineering College, Chennai, India
Description
This article evaluates the performance of the support vector machine (SVM), decision tree (DT), and random forest (RF) on the dataset that contains the medical records of 299 patients with heart failure (HF) collected at the Faisalabad Institute of Cardiology and the Allied hospital in Pakistan. The dataset contains 13 descriptive features of physical, clinical, and lifestyle information. The study compared the performance of three classification algorithms employing pre-processing techniques such as min-max scaling, and principal component analysis (PCA). The simulation result shows that the performance of the DT, and RF decreased with dimensionality reduction while the SVM improved with dimensionality reduction. The SVM achieved 84.44%. Thus, feature scaling improves the performance of the SVM. The RF performs at 82.22%, the DT at 81.11%, and the SVM shows an improvement of 1.64% with scaled features, compared to the original dataset.
Files
63-5170.pdf
Files
(279.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:3d2165b708f738caec17311651829d89
|
279.0 kB | Preview Download |