» Articles » PMID: 35476787

Machine Learning for Passive Mental Health Symptom Prediction: Generalization Across Different Longitudinal Mobile Sensing Studies

Overview
Journal PLoS One
Date 2022 Apr 27
PMID 35476787
Authors
Affiliations
Soon will be listed here.
Abstract

Mobile sensing data processed using machine learning models can passively and remotely assess mental health symptoms from the context of patients' lives. Prior work has trained models using data from single longitudinal studies, collected from demographically homogeneous populations, over short time periods, using a single data collection platform or mobile application. The generalizability of model performance across studies has not been assessed. This study presents a first analysis to understand if models trained using combined longitudinal study data to predict mental health symptoms generalize across current publicly available data. We combined data from the CrossCheck (individuals living with schizophrenia) and StudentLife (university students) studies. In addition to assessing generalizability, we explored if personalizing models to align mobile sensing data, and oversampling less-represented severe symptoms, improved model performance. Leave-one-subject-out cross-validation (LOSO-CV) results were reported. Two symptoms (sleep quality and stress) had similar question-response structures across studies and were used as outcomes to explore cross-dataset prediction. Models trained with combined data were more likely to be predictive (significant improvement over predicting training data mean) than models trained with single-study data. Expected model performance improved if the distance between training and validation feature distributions decreased using combined versus single-study data. Personalization aligned each LOSO-CV participant with training data, but only improved predicting CrossCheck stress. Oversampling significantly improved severe symptom classification sensitivity and positive predictive value, but decreased model specificity. Taken together, these results show that machine learning models trained on combined longitudinal study data may generalize across heterogeneous datasets. We encourage researchers to disseminate collected de-identified mobile sensing and mental health symptom data, and further standardize data types collected across studies to enable better assessment of model generalizability.

Citing Articles

A systematic review of passive data for remote monitoring in psychosis and schizophrenia.

Bladon S, Eisner E, Bucci S, Oluwatayo A, Martin G, Sperrin M NPJ Digit Med. 2025; 8(1):62.

PMID: 39870797 PMC: 11772847. DOI: 10.1038/s41746-025-01451-2.


Beyond Detection: Towards Actionable Sensing Research in Clinical Mental Healthcare.

Adler D, Yang Y, Viranda T, Xu X, Mohr D, VAN Meter A Proc ACM Interact Mob Wearable Ubiquitous Technol. 2024; 8(4).

PMID: 39639863 PMC: 11620792. DOI: 10.1145/3699755.


Design Guidelines for Improving Mobile Sensing Data Collection: Prospective Mixed Methods Study.

Slade C, Benzo R, Washington P J Med Internet Res. 2024; 26:e55694.

PMID: 39556828 PMC: 11632896. DOI: 10.2196/55694.


Passive sensing data predicts stress in university students: a supervised machine learning method for digital phenotyping.

Shvetcov A, Funke Kupper J, Zheng W, Slade A, Han J, Whitton A Front Psychiatry. 2024; 15:1422027.

PMID: 39252756 PMC: 11381371. DOI: 10.3389/fpsyt.2024.1422027.


Capturing the College Experience: A Four-Year Mobile Sensing Study of Mental Health, Resilience and Behavior of College Students during the Pandemic.

Nepal S, Liu W, Pillai A, Wang W, Vojdanovski V, Huckins J Proc ACM Interact Mob Wearable Ubiquitous Technol. 2024; 8(1).

PMID: 39086982 PMC: 11290409. DOI: 10.1145/3643501.


References
1.
Tseng V, Sano A, Ben-Zeev D, Brian R, Campbell A, Hauser M . Using behavioral rhythms and multi-task learning to predict fine-grained symptoms of schizophrenia. Sci Rep. 2020; 10(1):15100. PMC: 7492221. DOI: 10.1038/s41598-020-71689-1. View

2.
Birnbaum M, Ernala S, Rizvi A, Arenare E, R Van Meter A, De Choudhury M . Detecting relapse in youth with psychotic disorders utilizing patient-generated and patient-contributed digital data from Facebook. NPJ Schizophr. 2019; 5(1):17. PMC: 6779748. DOI: 10.1038/s41537-019-0085-9. View

3.
Adler D, Ben-Zeev D, Tseng V, Kane J, Brian R, Campbell A . Predicting Early Warning Signs of Psychotic Relapse From Passive Sensing Data: An Approach Using Encoder-Decoder Neural Networks. JMIR Mhealth Uhealth. 2020; 8(8):e19962. PMC: 7490673. DOI: 10.2196/19962. View

4.
Wiens J, Guttag J, Horvitz E . A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions. J Am Med Inform Assoc. 2014; 21(4):699-706. PMC: 4078276. DOI: 10.1136/amiajnl-2013-002162. View

5.
Rosner B, Glynn R, Lee M . The Wilcoxon signed rank test for paired comparisons of clustered data. Biometrics. 2006; 62(1):185-92. DOI: 10.1111/j.1541-0420.2005.00389.x. View