» Articles » PMID: 39397946

Early Heart Disease Prediction Using Feature Engineering and Machine Learning Algorithms

Overview
Journal Heliyon
Specialty Social Sciences
Date 2024 Oct 14
PMID 39397946
Authors
Affiliations
Soon will be listed here.
Abstract

Heart disease is one of the most widespread global health issues, it is the reason behind around 32 % of deaths worldwide every year. The early prediction and diagnosis of heart diseases are critical for effective treatment and sickness management. Despite the efforts of healthcare professionals, cardiovascular surgeons and cardiologists' misdiagnosis and misinterpretation of test results may happen every day. This study addresses the growing global health challenge raised by Cardiovascular Diseases (CVDs), which account for 32 % of all deaths worldwide, according to the World Health Organization (WHO). With the progress of Machine Learning (ML) and Deep Learning (DL) techniques as part of Artificial Intelligence (AI), these technologies have become crucial for predicting and diagnosing CVDs. This research aims to develop an ML system for the early prediction of cardiovascular diseases by choosing one of the powerful existing ML algorithms after a deep comparative analysis of several. To achieve this work, the Cleveland and Statlog heart datasets from international platforms are used in this study to evaluate and validate the system's performance. The Cleveland dataset is categorized and used to train various ML algorithms, including decision tree, random forest, support vector machine, logistic regression, adaptive boosting, and K-nearest neighbors. The performance of each algorithm is assessed based on accuracy, precision, recall, F1 score, and the Area Under the Curve metrics. Hyperparameter tuning approaches have been employed to find the best hyperparameters that reflect the optimal performance of the used algorithms based on different evaluation approaches including 10-fold cross-validation with a 95 % confidence interval. The study's findings highlight the potential of ML in improving the early prediction and diagnosis of cardiovascular diseases. By comparing and analyzing the performance of the applied algorithms on both the Cleveland and Statlog heart datasets, this research contributes to the advancement of ML techniques in the medical field. The developed ML system offers a valuable tool for healthcare professionals in the early prediction and diagnosis of cardiovascular diseases, with implications for the prediction and diagnosis of other diseases as well.

Citing Articles

Development and validation of a machine learning approach for screening new leprosy cases based on the leprosy suspicion questionnaire.

Mendonca Ramos Simoes M, Rocha Lima F, Barbosa Lugao H, de Paula N, Lincoln Silva C, Ramos A Sci Rep. 2025; 15(1):6912.

PMID: 40011614 PMC: 11865526. DOI: 10.1038/s41598-025-91462-6.

References
1.
Schober P, Vetter T . Logistic Regression in Medical Research. Anesth Analg. 2021; 132(2):365-366. PMC: 7785709. DOI: 10.1213/ANE.0000000000005247. View

2.
Bjurgert J, Valenzuela P, Rojas C . On Adaptive Boosting for System Identification. IEEE Trans Neural Netw Learn Syst. 2017; 29(9):4510-4514. DOI: 10.1109/TNNLS.2017.2754319. View

3.
Islam M, Raiyan K, Mitra S, Mannan M, Tasnim T, Putul A . Predictis: an IoT and machine learning-based system to predict risk level of cardio-vascular diseases. BMC Health Serv Res. 2023; 23(1):171. PMC: 9940443. DOI: 10.1186/s12913-023-09104-4. View

4.
Zobair K, Houghton L, Tjondronegoro D, Sanzogni L, Islam M, Sarker T . Systematic review of Internet of medical things for cardiovascular disease prevention among Australian first nations. Heliyon. 2023; 9(11):e22420. PMC: 10700651. DOI: 10.1016/j.heliyon.2023.e22420. View

5.
Agnese M, Toia P, Sollami G, Militello C, Rundo L, Vitabile S . Epicardial and thoracic subcutaneous fat texture analysis in patients undergoing cardiac CT. Heliyon. 2023; 9(5):e15984. PMC: 10196784. DOI: 10.1016/j.heliyon.2023.e15984. View