» Articles » PMID: 37350929

TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments

Overview
Specialty Biology
Date 2023 Jun 23
PMID 37350929
Authors
Affiliations
Soon will be listed here.
Abstract

The evidence is growing that machine and deep learning methods can learn the subtle differences between the language produced by people with various forms of cognitive impairment such as dementia and cognitively healthy individuals. Valuable public data repositories such as TalkBank have made it possible for researchers in the computational community to join forces and learn from each other to make significant advances in this area. However, due to variability in approaches and data selection strategies used by various researchers, results obtained by different groups have been difficult to compare directly. In this paper, we present TRESTLE (oolkit for eproducible xecution of peech ext and anguage xperiments), an open source platform that focuses on two datasets from the TalkBank repository with dementia detection as an illustrative domain. Successfully deployed in the hackallenge (Hackathon/Challenge) of the International Workshop on Health Intelligence at AAAI 2022, TRESTLE provides a precise digital blueprint of the data pre-processing and selection strategies that can be reused via TRESTLE by other researchers seeking comparable results with their peers and current state-of-the-art (SOTA) approaches.

Citing Articles

A curious case of retrogenesis in language: Automated analysis of language patterns observed in dementia patients and young children.

Li C, Solinsky J, Cohen T, Pakhomov S Neurosci Inform. 2024; 4(1).

PMID: 38433986 PMC: 10907010. DOI: 10.1016/j.neuri.2023.100155.


Useful blunders: Can automated speech recognition errors improve downstream dementia classification?.

Li C, Xu W, Cohen T, Pakhomov S J Biomed Inform. 2024; 150:104598.

PMID: 38253228 PMC: 10922372. DOI: 10.1016/j.jbi.2024.104598.

References
1.
Petti U, Baker S, Korhonen A . A systematic literature review of automatic Alzheimer's disease detection from speech and language. J Am Med Inform Assoc. 2020; 27(11):1784-1797. PMC: 7671617. DOI: 10.1093/jamia/ocaa174. View

2.
Digan W, Neveol A, Neuraz A, Wack M, Baudoin D, Burgun A . Can reproducibility be improved in clinical natural language processing? A study of 7 clinical NLP suites. J Am Med Inform Assoc. 2020; 28(3):504-515. PMC: 7936396. DOI: 10.1093/jamia/ocaa261. View

3.
Herd P, Carr D, Roan C . Cohort profile: Wisconsin longitudinal study (WLS). Int J Epidemiol. 2014; 43(1):34-41. PMC: 3937969. DOI: 10.1093/ije/dys194. View

4.
Kapoor S, Narayanan A . Leakage and the reproducibility crisis in machine-learning-based science. Patterns (N Y). 2023; 4(9):100804. PMC: 10499856. DOI: 10.1016/j.patter.2023.100804. View

5.
Canning S, Leach L, Stuss D, Ngo L, Black S . Diagnostic utility of abbreviated fluency measures in Alzheimer disease and vascular dementia. Neurology. 2004; 62(4):556-62. DOI: 10.1212/wnl.62.4.556. View