A predictive analysis framework of heart disease using machine learning approaches
Creators
- 1. Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
- 2. Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh
- 3. Department of Aeronautical Engineering, Military Institute of Science &Technology, Dhaka, Bangladesh
Description
Heart disease is among the leading causes for death globally. Thus, early identification and treatment are indispensable to prevent the disease. In this work, we propose a framework based on machine learning algorithms to tackle such problems through the identification of risk variables associated to this disease. To ensure the success of our proposed model, influential data pre-processing and data transformation strategies are used to generate accurate data for the training model that utilizes the five most popular datasets (Hungarian, Stat log, Switzerland, Long Beach VA, and Cleveland) from UCI. The univariate feature selection technique is applied to identify essential features and during the training phase, classifiers, namely extreme gradient boosting (XGBoost), support vector machine (SVM), random forest (RF), gradient boosting (GB), and decision tree (DT), are deployed. Subsequently, various performance evaluations are measured to demonstrate accurate predictions using the introduced algorithms. The inclusion of Univariate results indicated that the DT classifier achieves a comparatively higher accuracy of around 97.75% than others. Thus, a machine learning approach is recognize, that can predict heart disease with high accuracy. Furthermore, the 10 attributes chosen are used to analyze the model's outcomes explainability, indicating which attributes are more significant in the model's outcome.
Files
36-3942.pdf
Files
(658.8 kB)
Name | Size | Download all |
---|---|---|
md5:1da94e983cac17373f72001229905ec1
|
658.8 kB | Preview Download |