Reconstructing the Spectrotemporal Modulations of Real-life Sounds from FMRI Response Patterns
Overview
Authors
Affiliations
Ethological views of brain functioning suggest that sound representations and computations in the auditory neural system are optimized finely to process and discriminate behaviorally relevant acoustic features and sounds (e.g., spectrotemporal modulations in the songs of zebra finches). Here, we show that modeling of neural sound representations in terms of frequency-specific spectrotemporal modulations enables accurate and specific reconstruction of real-life sounds from high-resolution functional magnetic resonance imaging (fMRI) response patterns in the human auditory cortex. Region-based analyses indicated that response patterns in separate portions of the auditory cortex are informative of distinctive sets of spectrotemporal modulations. Most relevantly, results revealed that in early auditory regions, and progressively more in surrounding regions, temporal modulations in a range relevant for speech analysis (∼2-4 Hz) were reconstructed more faithfully than other temporal modulations. In early auditory regions, this effect was frequency-dependent and only present for lower frequencies (<∼2 kHz), whereas for higher frequencies, reconstruction accuracy was higher for faster temporal modulations. Further analyses suggested that auditory cortical processing optimized for the fine-grained discrimination of speech and vocal sounds underlies this enhanced reconstruction accuracy. In sum, the present study introduces an approach to embed models of neural sound representations in the analysis of fMRI response patterns. Furthermore, it reveals that, in the human brain, even general purpose and fundamental neural processing mechanisms are shaped by the physical features of real-world stimuli that are most relevant for behavior (i.e., speech, voice).
Markow Z, Trobaugh J, Richter E, Tripathy K, Rafferty S, Svoboda A Sci Rep. 2025; 15(1):3175.
PMID: 39863633 PMC: 11762274. DOI: 10.1038/s41598-025-85858-7.
Speech prosody enhances the neural processing of syntax.
Degano G, Donhauser P, Gwilliams L, Merlo P, Golestani N Commun Biol. 2024; 7(1):748.
PMID: 38902370 PMC: 11190187. DOI: 10.1038/s42003-024-06444-7.
A hierarchy of processing complexity and timescales for natural sounds in human auditory cortex.
Rupp K, Hect J, Harford E, Holt L, Ghuman A, Abel T bioRxiv. 2024; .
PMID: 38826304 PMC: 11142240. DOI: 10.1101/2024.05.24.595822.
The human auditory system uses amplitude modulation to distinguish music from speech.
Chang A, Teng X, Assaneo M, Poeppel D PLoS Biol. 2024; 22(5):e3002631.
PMID: 38805517 PMC: 11132470. DOI: 10.1371/journal.pbio.3002631.
Linguistic modulation of the neural encoding of phonemes.
Kim S, De Martino F, Overath T Cereb Cortex. 2024; 34(4.
PMID: 38687241 PMC: 11059272. DOI: 10.1093/cercor/bhae155.