» Articles » PMID: 31703364

A Machine-Learning-Based Prediction Method for Hypertension Outcomes Based on Medical Data

Overview
Specialty Radiology
Date 2019 Nov 10
PMID 31703364
Citations 53
Authors
Affiliations
Soon will be listed here.
Abstract

The outcomes of hypertension refer to the death or serious complications (such as myocardial infarction or stroke) that may occur in patients with hypertension. The outcomes of hypertension are very concerning for patients and doctors, and are ideally avoided. However, there is no satisfactory method for predicting the outcomes of hypertension. Therefore, this paper proposes a prediction method for outcomes based on physical examination indicators of hypertension patients. In this work, we divide the patients' outcome prediction into two steps. The first step is to extract the key features from the patients' many physical examination indicators. The second step is to use the key features extracted from the first step to predict the patients' outcomes. To this end, we propose a model combining recursive feature elimination with a cross-validation method and classification algorithm. In the first step, we use the recursive feature elimination algorithm to rank the importance of all features, and then extract the optimal features subset using cross-validation. In the second step, we use four classification algorithms (support vector machine (SVM), C4.5 decision tree, random forest (RF), and extreme gradient boosting (XGBoost)) to accurately predict patient outcomes by using their optimal features subset. The selected model prediction performance evaluation metrics are accuracy, F1 measure, and area under receiver operating characteristic curve. The 10-fold cross-validation shows that C4.5, RF, and XGBoost can achieve very good prediction results with a small number of features, and the classifier after recursive feature elimination with cross-validation feature selection has better prediction performance. Among the four classifiers, XGBoost has the best prediction performance, and its accuracy, F1, and area under receiver operating characteristic curve (AUC) values are 94.36%, 0.875, and 0.927, respectively, using the optimal features subset. This article's prediction of hypertension outcomes contributes to the in-depth study of hypertension complications and has strong practical significance.

Citing Articles

A Study on Prevalence and Factors Affecting Hypertension in an Iranian Population: Results from the Fasa Cohort Study.

Taheri Ghaleno S, Safari A, Homayounfar R, Farjam M, Rezaeian M, Asadi F Med J Islam Repub Iran. 2025; 38:123.

PMID: 39968471 PMC: 11835403. DOI: 10.47176/mjiri.38.123.


Using a machine learning algorithm and clinical data to predict the risk factors of disease recurrence after adjuvant treatment of advanced-stage oral cavity cancer.

Huang S, Hsu R, Liu D, Hsu W Tzu Chi Med J. 2025; 37(1):91-98.

PMID: 39850389 PMC: 11753515. DOI: 10.4103/tcmj.tcmj_56_24.


A machine learning tool for early identification of celiac disease autoimmunity.

Dreyfuss M, Getz B, Lebwohl B, Ramni O, Underberger D, Ber T Sci Rep. 2024; 14(1):30760.

PMID: 39730479 PMC: 11681168. DOI: 10.1038/s41598-024-80817-0.


Optimizing hypertension prediction using ensemble learning approaches.

Sifat I, Kibria M PLoS One. 2024; 19(12):e0315865.

PMID: 39715219 PMC: 11666061. DOI: 10.1371/journal.pone.0315865.


Prevalence of childhood hypertension and associated factors in Zhejiang Province: a cross-sectional analysis based on random forest model and logistic regression.

Zhou J, Sun W, Zhang C, Hou L, Luo Z, Jiang D BMC Public Health. 2024; 24(1):2101.

PMID: 39097727 PMC: 11298091. DOI: 10.1186/s12889-024-19630-3.


References
1.
Li X, Peng S, Chen J, Lu B, Zhang H, Lai M . SVM-T-RFE: a novel gene selection algorithm for identifying metastasis-related genes in colorectal cancer using gene expression profiles. Biochem Biophys Res Commun. 2012; 419(2):148-53. DOI: 10.1016/j.bbrc.2012.01.087. View

2.
Kearney P, Whelton M, Reynolds K, Muntner P, Whelton P, He J . Global burden of hypertension: analysis of worldwide data. Lancet. 2005; 365(9455):217-23. DOI: 10.1016/S0140-6736(05)17741-1. View

3.
Flack J, Peters R, Shafi T, Alrefai H, Nasser S, Crook E . Prevention of hypertension and its complications: theoretical basis and guidelines for treatment. J Am Soc Nephrol. 2003; 14(7 Suppl 2):S92-8. DOI: 10.1097/01.asn.0000070142.14843.8e. View

4.
Ding Y, Wilkins D . Improving the performance of SVM-RFE to select genes in microarray data. BMC Bioinformatics. 2006; 7 Suppl 2:S12. PMC: 1683561. DOI: 10.1186/1471-2105-7-S2-S12. View

5.
Ye C, Fu T, Hao S, Zhang Y, Wang O, Jin B . Prediction of Incident Hypertension Within the Next Year: Prospective Study Using Statewide Electronic Health Records and Machine Learning. J Med Internet Res. 2018; 20(1):e22. PMC: 5811646. DOI: 10.2196/jmir.9268. View