» Articles » PMID: 38817450

Machine Learning Data Sources in Pediatric Sleep Research: Assessing Racial/ethnic Differences in Electronic Health Record-based Clinical Notes Prior to Model Training

Overview
Journal Front Sleep
Date 2024 May 31
PMID 38817450
Authors
Affiliations
Soon will be listed here.
Abstract

Introduction: Pediatric sleep problems can be detected across racial/ethnic subpopulations in primary care settings. However, the electronic health record (EHR) data documentation that describes patients' sleep problems may be inherently biased due to both historical biases and informed presence. This study assessed racial/ethnic differences in natural language processing (NLP) training data (e.g., pediatric sleep-related keywords in primary care clinical notes) prior to model training.

Methods: We used a predefined keyword features set containing 178 Peds B-SATED keywords. We then queried all the clinical notes from patients seen in pediatric primary care between the ages of 5 and 18 from January 2018 to December 2021. A least absolute shrinkage and selection operator (LASSO) regression model was used to investigate whether there were racial/ethnic differences in the documentation of Peds B-SATED keywords. Then, mixed-effects logistic regression was used to determine whether the odds of the presence of global Peds B-SATED dimensions also differed across racial/ethnic subpopulations.

Results: Using both LASSO and multilevel modeling approaches, the current study found that there were racial/ethnic differences in providers' documentation of Peds B-SATED keywords and global dimensions. In addition, the most frequently documented Peds B-SATED keyword rankings qualitatively differed across racial/ethnic subpopulations.

Conclusion: This study revealed providers' differential patterns of documenting Peds B-SATED keywords and global dimensions that may account for the under-detection of pediatric sleep problems among racial/ethnic subpopulations. In research, these findings have important implications for the equitable clinical documentation of sleep problems in pediatric primary care settings and extend prior retrospective work in pediatric sleep specialty settings.

References
1.
Reynolds A, Spaeth A, Hale L, Williamson A, LeBourgeois M, Wong S . Pediatric sleep: current knowledge, gaps, and opportunities for the future. Sleep. 2023; 46(7). PMC: 10334737. DOI: 10.1093/sleep/zsad060. View

2.
Kang J, Hanif M, Mirza E, Khan M, Malik M . Machine learning in primary care: potential to improve public health. J Med Eng Technol. 2020; 45(1):75-80. DOI: 10.1080/03091902.2020.1853839. View

3.
Harada T, Miyagami T, Kunitomo K, Shimizu T . Clinical Decision Support Systems for Diagnosis in Primary Care: A Scoping Review. Int J Environ Res Public Health. 2021; 18(16). PMC: 8391274. DOI: 10.3390/ijerph18168435. View

4.
Honaker S, Dugan T, Daftary A, Davis S, Saha C, Baye F . Unexplained Practice Variation in Primary Care Providers' Concern for Pediatric Obstructive Sleep Apnea. Acad Pediatr. 2018; 18(4):418-424. DOI: 10.1016/j.acap.2018.01.011. View

5.
Honaker S, Street A, Daftary A, Downs S . The Use of Computer Decision Support for Pediatric Obstructive Sleep Apnea Detection in Primary Care. J Clin Sleep Med. 2019; 15(3):453-462. PMC: 6411194. DOI: 10.5664/jcsm.7674. View