» Articles » PMID: 38781156

Diverse Task-driven Modeling of Macaque V4 Reveals Functional Specialization Towards Semantic Tasks

Overview
Specialty Biology
Date 2024 May 23
PMID 38781156
Authors
Affiliations
Soon will be listed here.
Abstract

Responses to natural stimuli in area V4-a mid-level area of the visual ventral stream-are well predicted by features from convolutional neural networks (CNNs) trained on image classification. This result has been taken as evidence for the functional role of V4 in object classification. However, we currently do not know if and to what extent V4 plays a role in solving other computational objectives. Here, we investigated normative accounts of V4 (and V1 for comparison) by predicting macaque single-neuron responses to natural images from the representations extracted by 23 CNNs trained on different computer vision tasks including semantic, geometric, 2D, and 3D types of tasks. We found that V4 was best predicted by semantic classification features and exhibited high task selectivity, while the choice of task was less consequential to V1 performance. Consistent with traditional characterizations of V4 function that show its high-dimensional tuning to various 2D and 3D stimulus directions, we found that diverse non-semantic tasks explained aspects of V4 function that are not captured by individual semantic tasks. Nevertheless, jointly considering the features of a pair of semantic classification tasks was sufficient to yield one of our top V4 models, solidifying V4's main functional role in semantic processing and suggesting that V4's selectivity to 2D or 3D stimulus properties found by electrophysiologists can result from semantic functional goals.

Citing Articles

A large-scale examination of inductive biases shaping high-level visual representation in brains and machines.

Conwell C, Prince J, Kay K, Alvarez G, Konkle T Nat Commun. 2024; 15(1):9383.

PMID: 39477923 PMC: 11526138. DOI: 10.1038/s41467-024-53147-y.


Heterogeneous orientation tuning in the primary visual cortex of mice diverges from Gabor-like receptive fields in primates.

Fu J, Pierzchlewicz P, Willeke K, Bashiri M, Muhammad T, Diamantaki M Cell Rep. 2024; 43(8):114639.

PMID: 39167488 PMC: 11463840. DOI: 10.1016/j.celrep.2024.114639.


How well do models of visual cortex generalize to out of distribution samples?.

Ren Y, Bashivan P PLoS Comput Biol. 2024; 20(5):e1011145.

PMID: 38820563 PMC: 11216589. DOI: 10.1371/journal.pcbi.1011145.


High-performing neural network models of visual cortex benefit from high latent dimensionality.

Elmoznino E, Bonner M PLoS Comput Biol. 2024; 20(1):e1011792.

PMID: 38198504 PMC: 10805290. DOI: 10.1371/journal.pcbi.1011792.


A Unifying Principle for the Functional Organization of Visual Cortex.

Margalit E, Lee H, Finzi D, DiCarlo J, Grill-Spector K, Yamins D bioRxiv. 2023; .

PMID: 37292946 PMC: 10245753. DOI: 10.1101/2023.05.18.541361.


References
1.
Franke K, Willeke K, Ponder K, Galdamez M, Zhou N, Muhammad T . State-dependent pupil dilation rapidly shifts visual feature selectivity. Nature. 2022; 610(7930):128-134. PMC: 10635574. DOI: 10.1038/s41586-022-05270-3. View

2.
Yamins D, DiCarlo J . Using goal-driven deep learning models to understand sensory cortex. Nat Neurosci. 2016; 19(3):356-65. DOI: 10.1038/nn.4244. View

3.
Willmore B, Prenger R, Wu M, Gallant J . The berkeley wavelet transform: a biologically inspired orthogonal wavelet transform. Neural Comput. 2008; 20(6):1537-64. PMC: 5464375. DOI: 10.1162/neco.2007.05-07-513. View

4.
Pasupathy A, CONNOR C . Shape representation in area V4: position-specific tuning for boundary conformation. J Neurophysiol. 2001; 86(5):2505-19. DOI: 10.1152/jn.2001.86.5.2505. View

5.
Quian Quiroga R, Reddy L, Koch C, Fried I . Decoding visual inputs from multiple neurons in the human temporal lobe. J Neurophysiol. 2007; 98(4):1997-2007. DOI: 10.1152/jn.00125.2007. View