» Articles » PMID: 39776620

Development and Validation of Computable Social Phenotypes for Health-related Social Needs

Overview
Journal JAMIA Open
Date 2025 Jan 8
PMID 39776620
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: Measurement of health-related social needs (HRSNs) is complex. We sought to develop and validate computable phenotypes (CPs) using structured electronic health record (EHR) data for food insecurity, housing instability, financial insecurity, transportation barriers, and a composite-type measure of these, using human-defined rule-based and machine learning (ML) classifier approaches.

Materials And Methods: We collected HRSN surveys as the reference standard and obtained EHR data from 1550 patients in 3 health systems from 2 states. We followed a Delphi-like approach to develop the human-defined rule-based CP. For the ML classifier approach, we trained supervised ML (XGBoost) models using 78 features. Using surveys as the reference standard, we calculated sensitivity, specificity, positive predictive values, and area under the curve (AUC). We compared AUCs using the Delong test and other performance measures using McNemar's test, and checked for differential performance.

Results: Most patients (63%) reported at least one HRSN on the reference standard survey. Human-defined rule-based CPs exhibited poor performance (AUCs=.52 to .68). ML classifier CPs performed significantly better, but still poor-to-fair (AUCs = .68 to .75). Significant differences for race/ethnicity were found for ML classifier CPs (higher AUCs for White non-Hispanic patients). Important features included number of encounters and Medicaid insurance.

Discussion: Using a supervised ML classifier approach, HRSN CPs approached thresholds of fair performance, but exhibited differential performance by race/ethnicity.

Conclusion: CPs may help to identify patients who may benefit from additional social needs screening. Future work should explore the use of area-level features via geospatial data and natural language processing to improve model performance.

References
1.
Wang M, Pantell M, Gottlieb L, Adler-Milstein J . Documentation and review of social determinants of health data in the EHR: measures and associated insights. J Am Med Inform Assoc. 2021; 28(12):2608-2616. PMC: 8633631. DOI: 10.1093/jamia/ocab194. View

2.
Chartash D, Paek H, Dziura J, Ross B, Nogee D, Boccio E . Identifying Opioid Use Disorder in the Emergency Department: Multi-System Electronic Health Record-Based Computable Phenotype Derivation and Validation Study. JMIR Med Inform. 2019; 7(4):e15794. PMC: 6913746. DOI: 10.2196/15794. View

3.
Feller D, Bear Dont Walk Iv O, Zucker J, Yin M, Gordon P, Elhadad N . Detecting Social and Behavioral Determinants of Health with Structured and Free-Text Clinical Data. Appl Clin Inform. 2020; 11(1):172-181. PMC: 7056402. DOI: 10.1055/s-0040-1702214. View

4.
Savitz S, Nyman M, Kaduk A, Loftus C, Phelan S, Barry B . Association of Patient and System-Level Factors With Social Determinants of Health Screening. Med Care. 2022; 60(9):700-708. DOI: 10.1097/MLR.0000000000001754. View

5.
Richesson R, Hammond W, Nahm M, Wixted D, Simon G, Robinson J . Electronic health records based phenotyping in next-generation clinical trials: a perspective from the NIH Health Care Systems Collaboratory. J Am Med Inform Assoc. 2013; 20(e2):e226-31. PMC: 3861929. DOI: 10.1136/amiajnl-2013-001926. View