Machine Learning-based Predictive Modeling of Depression in Hypertensive Populations
Overview
Authors
Affiliations
We aimed to develop prediction models for depression among U.S. adults with hypertension using various machine learning (ML) approaches. Moreover, we analyzed the mechanisms of the developed models. This cross-sectional study included 8,628 adults with hypertension (11.3% with depression) from the National Health and Nutrition Examination Survey (2011-2020). We selected several significant features using feature selection methods to build the models. Data imbalance was managed with random down-sampling. Six different ML classification methods implemented in the R package caret-artificial neural network, random forest, AdaBoost, stochastic gradient boosting, XGBoost, and support vector machine-were employed with 10-fold cross-validation for predictions. Model performance was assessed by examining the area under the receiver operating characteristic curve (AUC), accuracy, precision, sensitivity, specificity, and F1-score. For an interpretable algorithm, we used the variable importance evaluation function in caret. Of all classification models, artificial neural network trained with selected features (n = 30) achieved the highest AUC (0.813) and specificity (0.780) in predicting depression. Support vector machine predicted depression with the highest accuracy (0.771), precision (0.969), sensitivity (0.774), and F1-score (0.860). The most frequent and important features contributing to the models included the ratio of family income to poverty, triglyceride level, white blood cell count, age, sleep disorder status, the presence of arthritis, hemoglobin level, marital status, and education level. In conclusion, ML algorithms performed comparably in predicting depression among hypertensive populations. Furthermore, the developed models shed light on variables' relative importance, paving the way for further clinical research.
Guo J, Zhao J, Han P, Wu Y, Zheng K, Huang C Front Psychiatry. 2024; 15:1370602.
PMID: 38993388 PMC: 11236531. DOI: 10.3389/fpsyt.2024.1370602.
Machine Learning-Based Etiologic Subtyping of Ischemic Stroke Using Circulating Exosomal microRNAs.
Bang J, Kim E, Kim H, Chung J, Seo W, Kim G Int J Mol Sci. 2024; 25(12).
PMID: 38928481 PMC: 11203849. DOI: 10.3390/ijms25126761.
Application of machine learning algorithms to identify people with low bone density.
Xu R, Chen Y, Yao Z, Wu W, Cui J, Wang R Front Public Health. 2024; 12:1347219.
PMID: 38726233 PMC: 11080984. DOI: 10.3389/fpubh.2024.1347219.
Zhou Y, Zhang Z, Li Q, Mao G, Zhou Z BMC Psychol. 2024; 12(1):230.
PMID: 38659077 PMC: 11044386. DOI: 10.1186/s40359-024-01696-8.
Shi H, Yuan X, Yang X, Huang R, Fan W, Liu G BMC Genomics. 2024; 25(1):125.
PMID: 38287255 PMC: 10826017. DOI: 10.1186/s12864-024-10038-2.