» Articles » PMID: 21743809

Linking Speech Perception and Neurophysiology: Speech Decoding Guided by Cascaded Oscillators Locked to the Input Rhythm

Overview
Journal Front Psychol
Date 2011 Jul 12
PMID 21743809
Citations 142
Authors
Affiliations
Soon will be listed here.
Abstract

The premise of this study is that current models of speech perception, which are driven by acoustic features alone, are incomplete, and that the role of decoding time during memory access must be incorporated to account for the patterns of observed recognition phenomena. It is postulated that decoding time is governed by a cascade of neuronal oscillators, which guide template-matching operations at a hierarchy of temporal scales. Cascaded cortical oscillations in the theta, beta, and gamma frequency bands are argued to be crucial for speech intelligibility. Intelligibility is high so long as these oscillations remain phase locked to the auditory input rhythm. A model (Tempo) is presented which is capable of emulating recent psychophysical data on the intelligibility of speech sentences as a function of "packaging" rate (Ghitza and Greenberg, 2009). The data show that intelligibility of speech that is time-compressed by a factor of 3 (i.e., a high syllabic rate) is poor (above 50% word error rate), but is substantially restored when the information stream is re-packaged by the insertion of silent gaps in between successive compressed-signal intervals - a counterintuitive finding, difficult to explain using classical models of speech perception, but emerging naturally from the Tempo architecture.

Citing Articles

Physiological Entrainment: A Key Mind-Body Mechanism for Cognitive, Motor and Affective Functioning, and Well-Being.

Barbaresi M, Nardo D, Fagioli S Brain Sci. 2025; 15(1).

PMID: 39851371 PMC: 11763407. DOI: 10.3390/brainsci15010003.


The human auditory cortex concurrently tracks syllabic and phonemic timescales via acoustic spectral flux.

Giroud J, Trebuchon A, Mercier M, Davis M, Morillon B Sci Adv. 2024; 10(51):eado8915.

PMID: 39705351 PMC: 11661434. DOI: 10.1126/sciadv.ado8915.


Concurrent processing of the prosodic hierarchy is supported by cortical entrainment and phase-amplitude coupling.

Oderbolz C, Stark E, Sauppe S, Meyer M Cereb Cortex. 2024; 34(12).

PMID: 39704246 PMC: 11659776. DOI: 10.1093/cercor/bhae479.


Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks.

Bittar A, Garner P Front Neurosci. 2024; 18:1449181.

PMID: 39385848 PMC: 11461475. DOI: 10.3389/fnins.2024.1449181.


Dog-human vocal interactions match dogs' sensory-motor tuning.

Deaux E, Piette T, Gaunet F, Legou T, Arnal L, Giraud A PLoS Biol. 2024; 22(10):e3002789.

PMID: 39352912 PMC: 11444399. DOI: 10.1371/journal.pbio.3002789.


References
1.
Pulvermuller F . Words in the brain's language. Behav Brain Sci. 2001; 22(2):253-79; discussion 280-336. View

2.
Garvey W . The intelligibility of speeded speech. J Exp Psychol. 1953; 45(2):102-8. DOI: 10.1037/h0054381. View

3.
Lakatos P, Shah A, Knuth K, Ulbert I, Karmos G, Schroeder C . An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex. J Neurophysiol. 2005; 94(3):1904-11. DOI: 10.1152/jn.00263.2005. View

4.
Zatorre R, Belin P, Penhune V . Structure and function of auditory cortex: music and speech. Trends Cogn Sci. 2002; 6(1):37-46. DOI: 10.1016/s1364-6613(00)01816-7. View

5.
Palva J, Monto S, Kulashekhar S, Palva S . Neuronal synchrony reveals working memory networks and predicts individual memory capacity. Proc Natl Acad Sci U S A. 2010; 107(16):7580-5. PMC: 2867688. DOI: 10.1073/pnas.0913113107. View