» Articles » PMID: 37767217

A Multimodal Dialog Approach to Mental State Characterization in Clinically Depressed, Anxious, and Suicidal Populations

Overview
Journal Front Psychol
Date 2023 Sep 28
PMID 37767217
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The rise of depression, anxiety, and suicide rates has led to increased demand for telemedicine-based mental health screening and remote patient monitoring (RPM) solutions to alleviate the burden on, and enhance the efficiency of, mental health practitioners. Multimodal dialog systems (MDS) that conduct on-demand, structured interviews offer a scalable and cost-effective solution to address this need.

Objective: This study evaluates the feasibility of a cloud based MDS agent, Tina, for mental state characterization in participants with depression, anxiety, and suicide risk.

Method: Sixty-eight participants were recruited through an online health registry and completed 73 sessions, with 15 (20.6%), 21 (28.8%), and 26 (35.6%) sessions screening positive for depression, anxiety, and suicide risk, respectively using conventional screening instruments. Participants then interacted with Tina as they completed a structured interview designed to elicit calibrated, open-ended responses regarding the participants' feelings and emotional state. Simultaneously, the platform streamed their speech and video recordings in real-time to a HIPAA-compliant cloud server, to compute speech, language, and facial movement-based biomarkers. After their sessions, participants completed user experience surveys. Machine learning models were developed using extracted features and evaluated with the area under the receiver operating characteristic curve (AUC).

Results: For both depression and suicide risk, affected individuals tended to have a higher percent pause time, while those positive for anxiety showed reduced lip movement relative to healthy controls. In terms of single-modality classification models, speech features performed best for depression (AUC = 0.64; 95% CI = 0.51-0.78), facial features for anxiety (AUC = 0.57; 95% CI = 0.43-0.71), and text features for suicide risk (AUC = 0.65; 95% CI = 0.52-0.78). Best overall performance was achieved by decision fusion of all models in identifying suicide risk (AUC = 0.76; 95% CI = 0.65-0.87). Participants reported the experience comfortable and shared their feelings.

Conclusion: MDS is a feasible, useful, effective, and interpretable solution for RPM in real-world clinical depression, anxiety, and suicidal populations. Facial information is more informative for anxiety classification, while speech and language are more discriminative of depression and suicidality markers. In general, combining speech, language, and facial information improved model performance on all classification tasks.

Citing Articles

Multimodal speech biomarkers for remote monitoring of ALS disease progression.

Neumann M, Kothare H, Ramanarayanan V Comput Biol Med. 2024; 180:108949.

PMID: 39126786 PMC: 11357899. DOI: 10.1016/j.compbiomed.2024.108949.


Multimodal Speech Biomarkers for Remote Monitoring of ALS Disease Progression.

Neumann M, Kothare H, Ramanarayanan V medRxiv. 2024; .

PMID: 38978682 PMC: 11230328. DOI: 10.1101/2024.06.26.24308811.

References
1.
Posner K, Brown G, Stanley B, Brent D, Yershova K, Oquendo M . The Columbia-Suicide Severity Rating Scale: initial validity and internal consistency findings from three multisite studies with adolescents and adults. Am J Psychiatry. 2011; 168(12):1266-77. PMC: 3893686. DOI: 10.1176/appi.ajp.2011.10111704. View

2.
Mundt J, Vogel A, Feltner D, Lenderking W . Vocal acoustic biomarkers of depression severity and treatment response. Biol Psychiatry. 2012; 72(7):580-7. PMC: 3409931. DOI: 10.1016/j.biopsych.2012.03.015. View

3.
Wright-Berryman J, Cohen J, Haq A, Black D, Pease J . Virtually screening adults for depression, anxiety, and suicide risk using machine learning and language from an open-ended interview. Front Psychiatry. 2023; 14:1143175. PMC: 10291825. DOI: 10.3389/fpsyt.2023.1143175. View

4.
Murray I, Arnott J . Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion. J Acoust Soc Am. 1993; 93(2):1097-108. DOI: 10.1121/1.405558. View

5.
Cannizzaro M, Harel B, Reilly N, Chappell P, Snyder P . Voice acoustical measurement of the severity of major depression. Brain Cogn. 2004; 56(1):30-5. DOI: 10.1016/j.bandc.2004.05.003. View