» Articles » PMID: 33789916

Examining the Coding Strength of Object Identity and Nonidentity Features in Human Occipito-Temporal Cortex and Convolutional Neural Networks

Overview
Journal J Neurosci
Specialty Neurology
Date 2021 Apr 1
PMID 33789916
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

A visual object is characterized by multiple visual features, including its identity, position and size. Despite the usefulness of identity and nonidentity features in vision and their joint coding throughout the primate ventral visual processing pathway, they have so far been studied relatively independently. Here in both female and male human participants, the coding of identity and nonidentity features was examined together across the human ventral visual pathway. The nonidentity features tested included two Euclidean features (position and size) and two non-Euclidean features (image statistics and spatial frequency (SF) content of an image). Overall, identity representation increased and nonidentity feature representation decreased along the ventral visual pathway, with identity outweighing the non-Euclidean but not the Euclidean features at higher levels of visual processing. In 14 convolutional neural networks (CNNs) pretrained for object categorization with varying architecture, depth, and with/without recurrent processing, nonidentity feature representation showed an initial large increase from early to mid-stage of processing, followed by a decrease at later stages of processing, different from brain responses. Additionally, from lower to higher levels of visual processing, position became more underrepresented and image statistics and SF became more overrepresented compared with identity in CNNs than in the human brain. Similar results were obtained in a CNN trained with stylized images that emphasized shape representations. Overall, by measuring the coding strength of object identity and nonidentity features together, our approach provides a new tool for characterizing feature coding in the human brain and the correspondence between the brain and CNNs. This study examined the coding strength of object identity and four types of nonidentity features along the human ventral visual processing pathway and compared brain responses with those of 14 convolutional neural networks (CNNs) pretrained to perform object categorization. Overall, identity representation increased and nonidentity feature representation decreased along the ventral visual pathway, with some notable differences among the different nonidentity features. CNNs differed from the brain in a number of aspects in their representations of identity and nonidentity features over the course of visual processing. Our approach provides a new tool for characterizing feature coding in the human brain and the correspondence between the brain and CNNs.

Citing Articles

The human posterior parietal cortices orthogonalize the representation of different streams of information concurrently coded in visual working memory.

Xu Y PLoS Biol. 2024; 22(11):e3002915.

PMID: 39570984 PMC: 11620661. DOI: 10.1371/journal.pbio.3002915.


Bridging the gap between EEG and DCNNs reveals a fatigue mechanism of facial repetition suppression.

Lu Z, Ku Y iScience. 2023; 26(12):108501.

PMID: 38089588 PMC: 10711494. DOI: 10.1016/j.isci.2023.108501.


Multiple visual objects are represented differently in the human brain and convolutional neural networks.

Mocz V, Jeong S, Chun M, Xu Y Sci Rep. 2023; 13(1):9088.

PMID: 37277406 PMC: 10241785. DOI: 10.1038/s41598-023-36029-z.


Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks.

Mocz V, Jeong S, Chun M, Xu Y bioRxiv. 2023; .

PMID: 36909506 PMC: 10002658. DOI: 10.1101/2023.02.28.530472.


Comparing the Dominance of Color and Form Information across the Human Ventral Visual Pathway and Convolutional Neural Networks.

Taylor J, Xu Y J Cogn Neurosci. 2023; 35(5):816-840.

PMID: 36877074 PMC: 11283826. DOI: 10.1162/jocn_a_01979.


References
1.
Sereno M, Dale A, Reppas J, Kwong K, Belliveau J, Brady T . Borders of multiple visual areas in humans revealed by functional magnetic resonance imaging. Science. 1995; 268(5212):889-93. DOI: 10.1126/science.7754376. View

2.
Eger E, Kell C, Kleinschmidt A . Graded size sensitivity of object-exemplar-evoked activity patterns within human LOC subregions. J Neurophysiol. 2008; 100(4):2038-47. DOI: 10.1152/jn.90305.2008. View

3.
Xu Y . A Tale of Two Visual Systems: Invariant and Adaptive Visual Information Representations in the Primate Brain. Annu Rev Vis Sci. 2018; 4:311-336. DOI: 10.1146/annurev-vision-091517-033954. View

4.
Kheradpisheh S, Ghodrati M, Ganjtabesh M, Masquelier T . Deep Networks Can Resemble Human Feed-forward Vision in Invariant Object Recognition. Sci Rep. 2016; 6:32672. PMC: 5013454. DOI: 10.1038/srep32672. View

5.
Kravitz D, Kriegeskorte N, Baker C . High-level visual object representations are constrained by position. Cereb Cortex. 2010; 20(12):2916-25. PMC: 2978243. DOI: 10.1093/cercor/bhq042. View