» Articles » PMID: 38792496

Using Machine Learning to Evaluate the Value of Genetic Liabilities in the Classification of Hypertension Within the UK Biobank

Overview
Journal J Clin Med
Specialty General Medicine
Date 2024 May 25
PMID 38792496
Authors
Affiliations
Soon will be listed here.
Abstract

Hypertension increases the risk of cardiovascular diseases (CVD) such as stroke, heart attack, heart failure, and kidney disease, contributing to global disease burden and premature mortality. Previous studies have utilized statistical and machine learning techniques to develop hypertension prediction models. Only a few have included genetic liabilities and evaluated their predictive values. This study aimed to develop an effective hypertension classification model and investigate the potential influence of genetic liability for multiple risk factors linked to CVD on hypertension risk using the random forest and the neural network. The study involved 244,718 European participants, who were divided into training and testing sets. Genetic liabilities were constructed using genetic variants associated with CVD risk factors obtained from genome-wide association studies (GWAS). Various combinations of machine learning models before and after feature selection were tested to develop the best classification model. The models were evaluated using area under the curve (AUC), calibration, and net reclassification improvement in the testing set. The models without genetic liabilities achieved AUCs of 0.70 and 0.72 using the random forest and the neural network methods, respectively. Adding genetic liabilities improved the AUC for the random forest but not for the neural network. The best classification model was achieved when feature selection and classification were performed using random forest (AUC = 0.71, Spiegelhalter z score = 0.10, -value = 0.92, calibration slope = 0.99). This model included genetic liabilities for total cholesterol and low-density lipoprotein (LDL). The study highlighted that incorporating genetic liabilities for lipids in a machine learning model may provide incremental value for hypertension classification beyond baseline characteristics.

References
1.
Biau D, Kerneis S, Porcher R . Statistics in brief: the importance of sample size in the planning and interpretation of medical research. Clin Orthop Relat Res. 2008; 466(9):2282-8. PMC: 2493004. DOI: 10.1007/s11999-008-0346-9. View

2.
Ehret G . Genome-wide association studies: contribution of genomics to understanding blood pressure and essential hypertension. Curr Hypertens Rep. 2010; 12(1):17-25. PMC: 2865585. DOI: 10.1007/s11906-009-0086-6. View

3.
Giontella A, Lotta L, Overton J, Baras A, Minuz P, Melander O . Causal Effect of Adiposity Measures on Blood Pressure Traits in 2 Urban Swedish Cohorts: A Mendelian Randomization Study. J Am Heart Assoc. 2021; 10(13):e020405. PMC: 8403279. DOI: 10.1161/JAHA.120.020405. View

4.
Rajula H, Verlato G, Manchia M, Antonucci N, Fanos V . Comparison of Conventional Statistical Methods with Machine Learning in Medicine: Diagnosis, Drug Development, and Treatment. Medicina (Kaunas). 2020; 56(9). PMC: 7560135. DOI: 10.3390/medicina56090455. View

5.
Winkler T, Justice A, Graff M, Barata L, Feitosa M, Chu S . Correction: The Influence of Age and Sex on Genetic Associations with Adult Body Size and Shape: A Large-Scale Genome-Wide Interaction Study. PLoS Genet. 2016; 12(6):e1006166. PMC: 4927064. DOI: 10.1371/journal.pgen.1006166. View