Examining the Coding Strength of Object Identity and Nonidentity Features in Human Occipito-Temporal Cortex and Convolutional Neural Networks

Overview

Journal J Neurosci

Specialty Neurology

Date 2021 Apr 1

PMID 33789916

Citations 12

Authors

Yaoda Xu

Maryam Vaziri-Pashkam

Affiliations

Soon will be listed here.

Abstract

A visual object is characterized by multiple visual features, including its identity, position and size. Despite the usefulness of identity and nonidentity features in vision and their joint coding throughout the primate ventral visual processing pathway, they have so far been studied relatively independently. Here in both female and male human participants, the coding of identity and nonidentity features was examined together across the human ventral visual pathway. The nonidentity features tested included two Euclidean features (position and size) and two non-Euclidean features (image statistics and spatial frequency (SF) content of an image). Overall, identity representation increased and nonidentity feature representation decreased along the ventral visual pathway, with identity outweighing the non-Euclidean but not the Euclidean features at higher levels of visual processing. In 14 convolutional neural networks (CNNs) pretrained for object categorization with varying architecture, depth, and with/without recurrent processing, nonidentity feature representation showed an initial large increase from early to mid-stage of processing, followed by a decrease at later stages of processing, different from brain responses. Additionally, from lower to higher levels of visual processing, position became more underrepresented and image statistics and SF became more overrepresented compared with identity in CNNs than in the human brain. Similar results were obtained in a CNN trained with stylized images that emphasized shape representations. Overall, by measuring the coding strength of object identity and nonidentity features together, our approach provides a new tool for characterizing feature coding in the human brain and the correspondence between the brain and CNNs. This study examined the coding strength of object identity and four types of nonidentity features along the human ventral visual processing pathway and compared brain responses with those of 14 convolutional neural networks (CNNs) pretrained to perform object categorization. Overall, identity representation increased and nonidentity feature representation decreased along the ventral visual pathway, with some notable differences among the different nonidentity features. CNNs differed from the brain in a number of aspects in their representations of identity and nonidentity features over the course of visual processing. Our approach provides a new tool for characterizing feature coding in the human brain and the correspondence between the brain and CNNs.

Citing Articles

The human posterior parietal cortices orthogonalize the representation of different streams of information concurrently coded in visual working memory.

Xu Y PLoS Biol. 2024; 22(11):e3002915.

PMID: 39570984 PMC: 11620661. DOI: 10.1371/journal.pbio.3002915.

Bridging the gap between EEG and DCNNs reveals a fatigue mechanism of facial repetition suppression.

Lu Z, Ku Y iScience. 2023; 26(12):108501.

PMID: 38089588 PMC: 10711494. DOI: 10.1016/j.isci.2023.108501.

Multiple visual objects are represented differently in the human brain and convolutional neural networks.

Mocz V, Jeong S, Chun M, Xu Y Sci Rep. 2023; 13(1):9088.

PMID: 37277406 PMC: 10241785. DOI: 10.1038/s41598-023-36029-z.

Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks.

Mocz V, Jeong S, Chun M, Xu Y bioRxiv. 2023; .

PMID: 36909506 PMC: 10002658. DOI: 10.1101/2023.02.28.530472.

Comparing the Dominance of Color and Form Information across the Human Ventral Visual Pathway and Convolutional Neural Networks.

Taylor J, Xu Y J Cogn Neurosci. 2023; 35(5):816-840.

PMID: 36877074 PMC: 11283826. DOI: 10.1162/jocn_a_01979.

References

Sereno M, Dale A, Reppas J, Kwong K, Belliveau J, Brady T . Borders of multiple visual areas in humans revealed by functional magnetic resonance imaging. Science. 1995; 268(5212):889-93. DOI: 10.1126/science.7754376. View

Eger E, Kell C, Kleinschmidt A . Graded size sensitivity of object-exemplar-evoked activity patterns within human LOC subregions. J Neurophysiol. 2008; 100(4):2038-47. DOI: 10.1152/jn.90305.2008. View

Xu Y . A Tale of Two Visual Systems: Invariant and Adaptive Visual Information Representations in the Primate Brain. Annu Rev Vis Sci. 2018; 4:311-336. DOI: 10.1146/annurev-vision-091517-033954. View

Kheradpisheh S, Ghodrati M, Ganjtabesh M, Masquelier T . Deep Networks Can Resemble Human Feed-forward Vision in Invariant Object Recognition. Sci Rep. 2016; 6:32672. PMC: 5013454. DOI: 10.1038/srep32672. View

Kravitz D, Kriegeskorte N, Baker C . High-level visual object representations are constrained by position. Cereb Cortex. 2010; 20(12):2916-25. PMC: 2978243. DOI: 10.1093/cercor/bhq042. View

Vaziri-Pashkam M, Xu Y . Goal-Directed Visual Processing Differentially Impacts Human Ventral and Dorsal Visual Representations. J Neurosci. 2017; 37(36):8767-8782. PMC: 5588467. DOI: 10.1523/JNEUROSCI.3392-16.2017. View

Karimi-Rouzbahani H, Bagheri N, Ebrahimpour R . Invariant object recognition is a personalized selection of invariant features in humans, not simply explained by hierarchical feed-forward vision models. Sci Rep. 2017; 7(1):14402. PMC: 5663844. DOI: 10.1038/s41598-017-13756-8. View

Orban G, Van Essen D, Vanduffel W . Comparative mapping of higher visual areas in monkeys and humans. Trends Cogn Sci. 2004; 8(7):315-24. DOI: 10.1016/j.tics.2004.05.009. View

Carlson T, Hogendoorn H, Fonteijn H, Verstraten F . Spatial coding and invariance in object-selective cortex. Cortex. 2009; 47(1):14-22. DOI: 10.1016/j.cortex.2009.08.015. View

10.

Cadieu C, Hong H, Yamins D, Pinto N, Ardila D, Solomon E . Deep neural networks rival the representation of primate IT cortex for core visual object recognition. PLoS Comput Biol. 2014; 10(12):e1003963. PMC: 4270441. DOI: 10.1371/journal.pcbi.1003963. View

11.

Swisher J, Halko M, Merabet L, McMains S, Somers D . Visual topography of human intraparietal sulcus. J Neurosci. 2007; 27(20):5326-37. PMC: 6672354. DOI: 10.1523/JNEUROSCI.0991-07.2007. View

12.

Goodale M, Milner A, Jakobson L, Carey D . A neurological dissociation between perceiving objects and grasping them. Nature. 1991; 349(6305):154-6. DOI: 10.1038/349154a0. View

13.

Reithler J, Peters J, Goebel R . Characterizing object- and position-dependent response profiles to uni- and bilateral stimulus configurations in human higher visual cortex: a 7T fMRI study. Neuroimage. 2017; 152:551-562. DOI: 10.1016/j.neuroimage.2017.03.038. View

14.

Rajalingham R, Issa E, Bashivan P, Kar K, Schmidt K, DiCarlo J . Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks. J Neurosci. 2018; 38(33):7255-7269. PMC: 6096043. DOI: 10.1523/JNEUROSCI.0388-18.2018. View

15.

Yamins D, DiCarlo J . Using goal-driven deep learning models to understand sensory cortex. Nat Neurosci. 2016; 19(3):356-65. DOI: 10.1038/nn.4244. View

16.

Bao P, She L, McGill M, Tsao D . A map of object space in primate inferotemporal cortex. Nature. 2020; 583(7814):103-108. PMC: 8088388. DOI: 10.1038/s41586-020-2350-5. View

17.

Tarhan L, Konkle T . Reliability-based voxel selection. Neuroimage. 2019; 207:116350. DOI: 10.1016/j.neuroimage.2019.116350. View

18.

LeCun Y, Bengio Y, Hinton G . Deep learning. Nature. 2015; 521(7553):436-44. DOI: 10.1038/nature14539. View

19.

Khaligh-Razavi S, Kriegeskorte N . Deep supervised, but not unsupervised, models may explain IT cortical representation. PLoS Comput Biol. 2014; 10(11):e1003915. PMC: 4222664. DOI: 10.1371/journal.pcbi.1003915. View

20.

Cichy R, Chen Y, Haynes J . Encoding the identity and location of objects in human LOC. Neuroimage. 2010; 54(3):2297-307. DOI: 10.1016/j.neuroimage.2010.09.044. View