Predicting Satisfaction With Chat-Counseling at a 24/7 Chat Hotline for the Youth: Natural Language Processing Study

Overview

Journal JMIR AI

Publisher JMIR Publications

Specialty Biomedical Engineering

Date 2025 Feb 18

PMID 39965198

Authors

Silvan Hornstein

Ulrike Lueken

Richard Wundrack

Kevin Hilbert

Affiliations

Soon will be listed here.

Abstract

Background: Chat-based counseling services are popular for the low-threshold provision of mental health support to youth. In addition, they are particularly suitable for the utilization of natural language processing (NLP) for improved provision of care.

Objective: Consequently, this paper evaluates the feasibility of such a use case, namely, the NLP-based automated evaluation of satisfaction with the chat interaction. This preregistered approach could be used for evaluation and quality control procedures, as it is particularly relevant for those services.

Methods: The consultations of 2609 young chatters (around 140,000 messages) and corresponding feedback were used to train and evaluate classifiers to predict whether a chat was perceived as helpful or not. On the one hand, we trained a word vectorizer in combination with an extreme gradient boosting (XGBoost) classifier, applying cross-validation and extensive hyperparameter tuning. On the other hand, we trained several transformer-based models, comparing model types, preprocessing, and over- and undersampling techniques. For both model types, we selected the best-performing approach on the training set for a final performance evaluation on the 522 users in the final test set.

Results: The fine-tuned XGBoost classifier achieved an area under the receiver operating characteristic score of 0.69 (P<.001), as well as a Matthews correlation coefficient of 0.25 on the previously unseen test set. The selected Longformer-based model did not outperform this baseline, scoring 0.68 (P=.69). A Shapley additive explanations explainability approach suggested that help seekers rating a consultation as helpful commonly expressed their satisfaction already within the conversation. In contrast, the rejection of offered exercises predicted perceived unhelpfulness.

Conclusions: Chat conversations include relevant information regarding the perceived quality of an interaction that can be used by NLP-based prediction approaches. However, to determine if the moderate predictive performance translates into meaningful service improvements requires randomized trials. Further, our results highlight the relevance of contrasting pretrained models with simpler baselines to avoid the implementation of unnecessarily complex models.

Trial Registration: Open Science Framework SR4Q9; https://osf.io/sr4q9.

References

Chicco D, Totsch N, Jurman G . The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation. BioData Min. 2021; 14(1):13. PMC: 7863449. DOI: 10.1186/s13040-021-00244-z. View

Xu Z, Xu Y, Cheung F, Cheng M, Lung D, Law Y . Detecting suicide risk using knowledge-aware natural language processing and counseling service data. Soc Sci Med. 2021; 283:114176. DOI: 10.1016/j.socscimed.2021.114176. View

Eckert M, Efe Z, Guenthner L, Baldofski S, Kuehne K, Wundrack R . Acceptability and feasibility of a messenger-based psychological chat counselling service for children and young adults ("krisenchat"): A cross-sectional study. Internet Interv. 2022; 27:100508. PMC: 8857586. DOI: 10.1016/j.invent.2022.100508. View

Dwyer D, Falkai P, Koutsouleris N . Machine Learning Approaches for Clinical Psychology and Psychiatry. Annu Rev Clin Psychol. 2018; 14:91-118. DOI: 10.1146/annurev-clinpsy-032816-045037. View

DeLong E, Delong D, Clarke-Pearson D . Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988; 44(3):837-45. View

Thompson L, Sugg M, Runkle J . Adolescents in crisis: A geographic exploration of help-seeking behavior using data from Crisis Text Line. Soc Sci Med. 2018; 215:69-79. DOI: 10.1016/j.socscimed.2018.08.025. View

Kohls E, Guenthner L, Baldofski S, Eckert M, Efe Z, Kuehne K . Suicidal Ideation Among Children and Young Adults in a 24/7 Messenger-Based Psychological Chat Counseling Service. Front Psychiatry. 2022; 13:862298. PMC: 8995430. DOI: 10.3389/fpsyt.2022.862298. View

Mathieu S, Uddin R, Brady M, Batchelor S, Ross V, Spence S . Systematic Review: The State of Research Into Youth Helplines. J Am Acad Child Adolesc Psychiatry. 2020; 60(10):1190-1233. DOI: 10.1016/j.jaac.2020.12.028. View

McGorry P, Mei C . Early intervention in youth mental health: progress and future directions. Evid Based Ment Health. 2018; 21(4):182-184. PMC: 10270418. DOI: 10.1136/ebmental-2018-300060. View

10.

Kessler R, Angermeyer M, Anthony J, de Graaf R, Demyttenaere K, Gasquet I . Lifetime prevalence and age-of-onset distributions of mental disorders in the World Health Organization's World Mental Health Survey Initiative. World Psychiatry. 2008; 6(3):168-76. PMC: 2174588. View

11.

de Winter A, Oldehinkel A, Veenstra R, Brunnekreef J, Verhulst F, Ormel J . Evaluation of non-response bias in mental health determinants and outcomes in a large sample of pre-adolescents. Eur J Epidemiol. 2005; 20(2):173-81. DOI: 10.1007/s10654-004-4948-6. View

12.

Christensen M, Lim C, Saha S, Plana-Ripoll O, Cannon D, Presley F . The cost of mental disorders: a systematic review. Epidemiol Psychiatr Sci. 2020; 29:e161. PMC: 7443800. DOI: 10.1017/S204579602000075X. View

13.

Chekroud A, Hawrilenko M, Loho H, Bondar J, Gueorguieva R, Hasan A . Illusory generalizability of clinical prediction models. Science. 2024; 383(6679):164-167. DOI: 10.1126/science.adg8538. View

14.

McGorry P, Mei C, Chanen A, Hodges C, Alvarez-Jimenez M, Killackey E . Designing and scaling up integrated youth mental health care. World Psychiatry. 2022; 21(1):61-76. PMC: 8751571. DOI: 10.1002/wps.20938. View

15.

Klement W, El Emam K . Consolidated Reporting Guidelines for Prognostic and Diagnostic Machine Learning Modeling Studies: Development and Validation. J Med Internet Res. 2023; 25:e48763. PMC: 10502599. DOI: 10.2196/48763. View

16.

Broadbent M, Medina Grespan M, Axford K, Zhang X, Srikumar V, Kious B . A machine learning approach to identifying suicide risk among text-based crisis counseling encounters. Front Psychiatry. 2023; 14:1110527. PMC: 10076638. DOI: 10.3389/fpsyt.2023.1110527. View

17.

Tibbs M, OReilly A, OReilly M, Fitzgerald A . Online synchronous chat counselling for young people aged 12-25: a mixed methods systematic review protocol. BMJ Open. 2022; 12(4):e061084. PMC: 9039377. DOI: 10.1136/bmjopen-2022-061084. View

18.

Swaminathan A, Lopez I, Mar R, Heist T, McClintock T, Caoili K . Natural language processing system for rapid detection and intervention of mental health crisis chat messages. NPJ Digit Med. 2023; 6(1):213. PMC: 10663535. DOI: 10.1038/s41746-023-00951-3. View

19.

Wang S, Dang Y, Sun Z, Ding Y, Pathak J, Tao C . An NLP approach to identify SDoH-related circumstance and suicide crisis from death investigation narratives. J Am Med Inform Assoc. 2023; 30(8):1408-1417. PMC: 10354765. DOI: 10.1093/jamia/ocad068. View

20.

Colizzi M, Lasalvia A, Ruggeri M . Prevention and early intervention in youth mental health: is it time for a multidisciplinary and trans-diagnostic model for care?. Int J Ment Health Syst. 2020; 14:23. PMC: 7092613. DOI: 10.1186/s13033-020-00356-9. View