» Articles » PMID: 37005064

Humans Predict the Forest, Not the Trees: Statistical Learning of Spatiotemporal Structure in Visual Scenes

Overview
Journal Cereb Cortex
Specialty Neurology
Date 2023 Apr 2
PMID 37005064
Authors
Affiliations
Soon will be listed here.
Abstract

The human brain is capable of using statistical regularities to predict future inputs. In the real world, such inputs typically comprise a collection of objects (e.g. a forest constitutes numerous trees). The present study aimed to investigate whether perceptual anticipation relies on lower-level or higher-level information. Specifically, we examined whether the human brain anticipates each object in a scene individually or anticipates the scene as a whole. To explore this issue, we first trained participants to associate co-occurring objects within fixed spatial arrangements. Meanwhile, participants implicitly learned temporal regularities between these displays. We then tested how spatial and temporal violations of the structure modulated behavior and neural activity in the visual system using fMRI. We found that participants only showed a behavioral advantage of temporal regularities when the displays conformed to their previously learned spatial structure, demonstrating that humans form configuration-specific temporal expectations instead of predicting individual objects. Similarly, we found suppression of neural responses for temporally expected compared with temporally unexpected objects in lateral occipital cortex only when the objects were embedded within expected configurations. Overall, our findings indicate that humans form expectations about object configurations, demonstrating the prioritization of higher-level over lower-level information in temporal expectation.

Citing Articles

The representation of contextual cue is stimulus-specific yet its expression is flexible.

Chen X, Bai S, Ren Q, Chen Y, Long F, Jiang Y PeerJ. 2024; 12:e17318.

PMID: 38708357 PMC: 11067901. DOI: 10.7717/peerj.17318.


Location-specific deviant responses to object sequences in macaque inferior temporal cortex.

Esmailpour H, Vogels R Sci Rep. 2024; 14(1):3757.

PMID: 38355712 PMC: 10866936. DOI: 10.1038/s41598-024-54298-0.

References
1.
Alvarez G, Oliva A . Spatial ensemble statistics are efficient codes that can be represented with reduced attention. Proc Natl Acad Sci U S A. 2009; 106(18):7345-50. PMC: 2670879. DOI: 10.1073/pnas.0808981106. View

2.
Julian J, Fedorenko E, Webster J, Kanwisher N . An algorithmic method for functionally defining regions of interest in the ventral visual pathway. Neuroimage. 2012; 60(4):2357-64. DOI: 10.1016/j.neuroimage.2012.02.055. View

3.
Brady T, Konkle T, Alvarez G, Oliva A . Visual long-term memory has a massive storage capacity for object details. Proc Natl Acad Sci U S A. 2008; 105(38):14325-9. PMC: 2533687. DOI: 10.1073/pnas.0803390105. View

4.
Hayworth K, Lescroart M, Biederman I . Neural encoding of relative position. J Exp Psychol Hum Percept Perform. 2011; 37(4):1032-50. DOI: 10.1037/a0022338. View

5.
Lengyel G, Zalalyte G, Pantelides A, Ingram J, Fiser J, Lengyel M . Unimodal statistical learning produces multimodal object-like representations. Elife. 2019; 8. PMC: 6529220. DOI: 10.7554/eLife.43942. View