» Articles » PMID: 21895088

Predicting Speech Intelligibility Based on the Signal-to-noise Envelope Power Ratio After Modulation-frequency Selective Processing

Overview
Journal J Acoust Soc Am
Date 2011 Sep 8
PMID 21895088
Citations 53
Authors
Affiliations
Soon will be listed here.
Abstract

A model for predicting the intelligibility of processed noisy speech is proposed. The speech-based envelope power spectrum model has a similar structure as the model of Ewert and Dau [(2000). J. Acoust. Soc. Am. 108, 1181-1196], developed to account for modulation detection and masking data. The model estimates the speech-to-noise envelope power ratio, SNR(env), at the output of a modulation filterbank and relates this metric to speech intelligibility using the concept of an ideal observer. Predictions were compared to data on the intelligibility of speech presented in stationary speech-shaped noise. The model was further tested in conditions with noisy speech subjected to reverberation and spectral subtraction. Good agreement between predictions and data was found in all cases. For spectral subtraction, an analysis of the model's internal representation of the stimuli revealed that the predicted decrease of intelligibility was caused by the estimated noise envelope power exceeding that of the speech. The classical concept of the speech transmission index fails in this condition. The results strongly suggest that the signal-to-noise ratio at the output of a modulation frequency selective process provides a key measure of speech intelligibility.

Citing Articles

FrAMBI: A Software Framework for Auditory Modeling Based on Bayesian Inference.

Barumerli R, Majdak P Neuroinformatics. 2025; 23(2):20.

PMID: 39928214 DOI: 10.1007/s12021-024-09702-5.


Neurometric amplitude modulation detection in the inferior colliculus of Young and Aged rats.

Bartlett E, Han E, Parthasarathy A Hear Res. 2024; 447:109028.

PMID: 38733711 PMC: 11129790. DOI: 10.1016/j.heares.2024.109028.


Original speech and its echo are segregated and separately processed in the human brain.

Gao J, Chen H, Fang M, Ding N PLoS Biol. 2024; 22(2):e3002498.

PMID: 38358954 PMC: 10868781. DOI: 10.1371/journal.pbio.3002498.


Neural Fluctuation Contrast as a Code for Complex Sounds: The Role and Control of Peripheral Nonlinearities.

Carney L Hear Res. 2024; 443:108966.

PMID: 38310710 PMC: 10923127. DOI: 10.1016/j.heares.2024.108966.


Sentence recognition with modulation-filtered speech segments for younger and older adults: Effects of hearing impairment and cognition.

Fogerty D, Ahlstrom J, Dubno J J Acoust Soc Am. 2023; 154(5):3328-3343.

PMID: 37983296 PMC: 10663055. DOI: 10.1121/10.0022445.