Laryngeal Pressure Estimation With a Recurrent Neural Network
Overview
Affiliations
Quantifying the physical parameters of voice production is essential for understanding the process of phonation and can aid in voice research and diagnosis. As an alternative to invasive measurements, they can be estimated by formulating an inverse problem using a numerical forward model. However, high-fidelity numerical models are often computationally too expensive for this. This paper presents a novel approach to train a long short-term memory network to estimate the subglottal pressure in the larynx at massively reduced computational cost using solely synthetic training data. We train the network on synthetic data from a numerical two-mass model and validate it on experimental data from 288 high-speed video recordings of porcine vocal folds from a previous study. The training requires significantly fewer model evaluations compared with the previous optimization approach. On the test set, we maintain a comparable performance of 21.2% versus previous 17.7% mean absolute percentage error in estimating the subglottal pressure. The evaluation of one sample requires a vanishingly small amount of computation time. The presented approach is able to maintain estimation accuracy of the subglottal pressure at significantly reduced computational cost. The methodology is likely transferable to estimate other parameters and training with other numerical models. This improvement should allow the adoption of more sophisticated, high-fidelity numerical models of the larynx. The vast speedup is a critical step to enable a future clinical application and knowledge of parameters such as the subglottal pressure will aid in diagnosis and treatment selection.
Veltrup R, Angerer S, Gessner E, Matheis F, Summerer E, Henningson J Bioengineering (Basel). 2024; 11(10).
PMID: 39451353 PMC: 11505270. DOI: 10.3390/bioengineering11100977.
Neural network-based estimation of biomechanical vocal fold parameters.
Donhauser J, Tur B, Dollinger M Front Physiol. 2024; 15:1282574.
PMID: 38449783 PMC: 10916882. DOI: 10.3389/fphys.2024.1282574.
Ghasemzadeh H, Hillman R, Mehta D J Speech Lang Hear Res. 2024; 67(3):753-781.
PMID: 38386017 PMC: 11005022. DOI: 10.1044/2023_JSLHR-23-00273.
Zhang Z J Acoust Soc Am. 2022; 151(2):1337.
PMID: 35232110 PMC: 9013286. DOI: 10.1121/10.0009616.
Ibarra E, Parra J, Alzamendi G, Cortes J, Espinoza V, Mehta D Front Physiol. 2021; 12:732244.
PMID: 34539451 PMC: 8440844. DOI: 10.3389/fphys.2021.732244.