» Articles » PMID: 30680252

Laryngeal Pressure Estimation With a Recurrent Neural Network

Overview
Date 2019 Jan 26
PMID 30680252
Citations 11
Authors
Affiliations
Soon will be listed here.
Abstract

Quantifying the physical parameters of voice production is essential for understanding the process of phonation and can aid in voice research and diagnosis. As an alternative to invasive measurements, they can be estimated by formulating an inverse problem using a numerical forward model. However, high-fidelity numerical models are often computationally too expensive for this. This paper presents a novel approach to train a long short-term memory network to estimate the subglottal pressure in the larynx at massively reduced computational cost using solely synthetic training data. We train the network on synthetic data from a numerical two-mass model and validate it on experimental data from 288 high-speed video recordings of porcine vocal folds from a previous study. The training requires significantly fewer model evaluations compared with the previous optimization approach. On the test set, we maintain a comparable performance of 21.2% versus previous 17.7% mean absolute percentage error in estimating the subglottal pressure. The evaluation of one sample requires a vanishingly small amount of computation time. The presented approach is able to maintain estimation accuracy of the subglottal pressure at significantly reduced computational cost. The methodology is likely transferable to estimate other parameters and training with other numerical models. This improvement should allow the adoption of more sophisticated, high-fidelity numerical models of the larynx. The vast speedup is a critical step to enable a future clinical application and knowledge of parameters such as the subglottal pressure will aid in diagnosis and treatment selection.

Citing Articles

Three-Dimensional Analysis of Vocal Fold Oscillations: Correlating Superior and Medial Surface Dynamics Using Ex Vivo Human Hemilarynges.

Veltrup R, Angerer S, Gessner E, Matheis F, Summerer E, Henningson J Bioengineering (Basel). 2024; 11(10).

PMID: 39451353 PMC: 11505270. DOI: 10.3390/bioengineering11100977.


Neural network-based estimation of biomechanical vocal fold parameters.

Donhauser J, Tur B, Dollinger M Front Physiol. 2024; 15:1282574.

PMID: 38449783 PMC: 10916882. DOI: 10.3389/fphys.2024.1282574.


Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting.

Ghasemzadeh H, Hillman R, Mehta D J Speech Lang Hear Res. 2024; 67(3):753-781.

PMID: 38386017 PMC: 11005022. DOI: 10.1044/2023_JSLHR-23-00273.


Estimating subglottal pressure and vocal fold adduction from the produced voice in a single-subject study (L).

Zhang Z J Acoust Soc Am. 2022; 151(2):1337.

PMID: 35232110 PMC: 9013286. DOI: 10.1121/10.0009616.


Estimation of Subglottal Pressure, Vocal Fold Collision Pressure, and Intrinsic Laryngeal Muscle Activation From Neck-Surface Vibration Using a Neural Network Framework and a Voice Production Model.

Ibarra E, Parra J, Alzamendi G, Cortes J, Espinoza V, Mehta D Front Physiol. 2021; 12:732244.

PMID: 34539451 PMC: 8440844. DOI: 10.3389/fphys.2021.732244.


References
1.
Zhuang P, Swinarska J, Robieux C, Hoffman M, Lin S, Jiang J . Measurement of phonation threshold power in normal and disordered voice production. Ann Otol Rhinol Laryngol. 2013; 122(9):555-60. PMC: 4583103. DOI: 10.1177/000348941312200904. View

2.
Mills R, Dodd K, Ablavsky A, Devine E, Jiang J . Parameters From the Complete Phonatory Range of an Excised Rabbit Larynx. J Voice. 2017; 31(4):517.e9-517.e17. DOI: 10.1016/j.jvoice.2016.12.018. View

3.
Dollinger M, Hoppe U, Hettlich F, Lohscheller J, Schuberth S, Eysholdt U . Vibration parameter extraction from endoscopic image series of the vocal folds. IEEE Trans Biomed Eng. 2002; 49(8):773-81. DOI: 10.1109/TBME.2002.800755. View

4.
Alipour F, Jaiswal S . Phonatory characteristics of excised pig, sheep, and cow larynges. J Acoust Soc Am. 2008; 123(6):4572-81. PMC: 2468220. DOI: 10.1121/1.2908289. View

5.
Dollinger M, Gomez P, Patel R, Alexiou C, Bohr C, Schutzenberger A . Biomechanical simulation of vocal fold dynamics in adults based on laryngeal high-speed videoendoscopy. PLoS One. 2017; 12(11):e0187486. PMC: 5679561. DOI: 10.1371/journal.pone.0187486. View