» Articles » PMID: 34966445

Score and Correlation Coefficient-Based Feature Selection for Predicting Heart Failure Diagnosis by Using Machine Learning Algorithms

Overview
Publisher Hindawi
Date 2021 Dec 30
PMID 34966445
Citations 21
Authors
Affiliations
Soon will be listed here.
Abstract

Cardiovascular disease (CVD) is one of the most common causes of death that kills approximately 17 million people annually. The main reasons behind CVD are myocardial infarction and the failure of the heart to pump blood normally. Doctors could diagnose heart failure (HF) through electronic medical records on the basis of patient's symptoms and clinical laboratory investigations. However, accurate diagnosis of HF requires medical resources and expert practitioners that are not always available, thus making the diagnosing challengeable. Therefore, predicting the patients' condition by using machine learning algorithms is a necessity to save time and efforts. This paper proposed a machine-learning-based approach that distinguishes the most important correlated features amongst patients' electronic clinical records. The SelectKBest function was applied with chi-squared statistical method to determine the most important features, and then feature engineering method has been applied to create new features correlated strongly in order to train machine learning models and obtain promising results. Optimised hyperparameter classification algorithms SVM, KNN, Decision Tree, Random Forest, and Logistic Regression were used to train two different datasets. The first dataset, called Cleveland, consisted of 303 records. The second dataset, which was used for predicting HF, consisted of 299 records. Experimental results showed that the Random Forest algorithm achieved accuracy, precision, recall, and F1 scores of 95%, 97.62%, 95.35%, and 96.47%, respectively, during the test phase for the second dataset. The same algorithm achieved accuracy scores of 100% for the first dataset and 97.68% for the second dataset, while 100% precision, recall, and F1 scores were reached for both datasets.

Citing Articles

Chaotic gradient based optimization with fuzzy temporal optimized CNN for heart failure prediction.

Kumar G, Muthurajkumar S Sci Rep. 2025; 15(1):3867.

PMID: 39890898 PMC: 11785986. DOI: 10.1038/s41598-025-88277-w.


Predictive Analytics in Heart Failure Risk, Readmission, and Mortality Prediction: A Review.

Hidayaturrohman Q, Hanada E Cureus. 2024; 16(11):e73876.

PMID: 39697926 PMC: 11652958. DOI: 10.7759/cureus.73876.


Cardiovascular risk factors and development of nomograms in an Italian cohort of patients with suspected coronary artery disease undergoing SPECT or PET stress myocardial perfusion imaging.

Megna R, Petretta M, Nappi C, Assante R, Zampella E, Gaudieri V Front Nucl Med. 2024; 4:1232135.

PMID: 39355219 PMC: 11440955. DOI: 10.3389/fnume.2024.1232135.


Classifying breast cancer subtypes on multi-omics data via sparse canonical correlation analysis and deep learning.

Huang Y, Zeng P, Zhong C BMC Bioinformatics. 2024; 25(1):132.

PMID: 38539064 PMC: 10967155. DOI: 10.1186/s12859-024-05749-y.


Feature selection and association rule learning identify risk factors of malnutrition among Ethiopian schoolchildren.

Russel W, Perry J, Bonzani C, Dontino A, Mekonnen Z, Ay A Front Epidemiol. 2024; 3:1150619.

PMID: 38455884 PMC: 10910994. DOI: 10.3389/fepid.2023.1150619.


References
1.
Chapman B, DeVore A, Mentz R, Metra M . Clinical profiles in acute heart failure: an urgent need for a new approach. ESC Heart Fail. 2019; 6(3):464-474. PMC: 6487835. DOI: 10.1002/ehf2.12439. View

2.
Gallagher J, McCormack D, Zhou S, Ryan F, Watson C, McDonald K . A systematic review of clinical prediction rules for the diagnosis of chronic heart failure. ESC Heart Fail. 2019; 6(3):499-508. PMC: 6487728. DOI: 10.1002/ehf2.12426. View

3.
Tan J, Hagiwara Y, Pang W, Lim I, Oh S, Adam M . Application of stacked convolutional and long short-term memory network for accurate identification of CAD ECG signals. Comput Biol Med. 2018; 94:19-26. DOI: 10.1016/j.compbiomed.2017.12.023. View

4.
Honda S, Kataoka Y, Kanaya T, Noguchi T, Ogawa H, Yasuda S . Characterization of coronary atherosclerosis by intravascular imaging modalities. Cardiovasc Diagn Ther. 2016; 6(4):368-81. PMC: 4960070. DOI: 10.21037/cdt.2015.12.05. View

5.
Lewis G, Schelbert E, Williams S, Cunnington C, Ahmed F, McDonagh T . Biological Phenotypes of Heart Failure With Preserved Ejection Fraction. J Am Coll Cardiol. 2017; 70(17):2186-2200. DOI: 10.1016/j.jacc.2017.09.006. View