Improved Modeling of Human Vision by Incorporating Robustness to Blur in Convolutional Neural Networks

Overview

Journal Nat Commun

Specialty Biology

Date 2024 Mar 5

PMID 38443349

Authors

Hojin Jang

Frank Tong

Affiliations

Soon will be listed here.

Abstract

Whenever a visual scene is cast onto the retina, much of it will appear degraded due to poor resolution in the periphery; moreover, optical defocus can cause blur in central vision. However, the pervasiveness of blurry or degraded input is typically overlooked in the training of convolutional neural networks (CNNs). We hypothesized that the absence of blurry training inputs may cause CNNs to rely excessively on high spatial frequency information for object recognition, thereby causing systematic deviations from biological vision. We evaluated this hypothesis by comparing standard CNNs with CNNs trained on a combination of clear and blurry images. We show that blur-trained CNNs outperform standard CNNs at predicting neural responses to objects across a variety of viewing conditions. Moreover, blur-trained CNNs acquire increased sensitivity to shape information and greater robustness to multiple forms of visual noise, leading to improved correspondence with human perception. Our results provide multi-faceted neurocomputational evidence that blurry visual experiences may be critical for conferring robustness to biological visual systems.

Citing Articles

Configural processing as an optimized strategy for robust object recognition in neural networks.

Jang H, Sinha P, Boix X Commun Biol. 2025; 8(1):386.

PMID: 40055492 PMC: 11889204. DOI: 10.1038/s42003-025-07672-1.

Unraveling the complexity of rat object vision requires a full convolutional network and beyond.

Muratore P, Alemi A, Zoccolan D Patterns (N Y). 2025; 6(2):101149.

PMID: 40041851 PMC: 11873012. DOI: 10.1016/j.patter.2024.101149.

Convolutional neural network models applied to neuronal responses in macaque V1 reveal limited nonlinear processing.

Miao H, Tong F J Vis. 2024; 24(6):1.

PMID: 38829629 PMC: 11156204. DOI: 10.1167/jov.24.6.1.

References

Gold J, Bennett P, Sekuler A . Signal but not noise changes with perceptual learning. Nature. 2000; 402(6758):176-8. DOI: 10.1038/46027. View

Lu Z, Dosher B . External noise distinguishes attention mechanisms. Vision Res. 1998; 38(9):1183-98. DOI: 10.1016/s0042-6989(97)00273-3. View

Kwon M, Legge G . Spatial-frequency cutoff requirements for pattern recognition in central and peripheral vision. Vision Res. 2011; 51(18):1995-2007. PMC: 3291662. DOI: 10.1016/j.visres.2011.06.020. View

Pratte M, Ling S, Swisher J, Tong F . How attention extracts objects from noise. J Neurophysiol. 2013; 110(6):1346-56. PMC: 3763154. DOI: 10.1152/jn.00127.2013. View

Jang H, McCormack D, Tong F . Noise-trained deep neural networks effectively predict human vision and its neural responses to challenging images. PLoS Biol. 2021; 19(12):e3001418. PMC: 8659651. DOI: 10.1371/journal.pbio.3001418. View

Jang H, Tong F . Convolutional neural networks trained with a developmental sequence of blurry to clear images reveal core differences between face and object processing. J Vis. 2021; 21(12):6. PMC: 8590164. DOI: 10.1167/jov.21.12.6. View

Felleman D, Van Essen D . Distributed hierarchical processing in the primate cerebral cortex. Cereb Cortex. 1991; 1(1):1-47. DOI: 10.1093/cercor/1.1.1-a. View

Kamitani Y, Tong F . Decoding the visual and subjective contents of the human brain. Nat Neurosci. 2005; 8(5):679-85. PMC: 1808230. DOI: 10.1038/nn1444. View

Kanwisher N, McDermott J, Chun M . The fusiform face area: a module in human extrastriate cortex specialized for face perception. J Neurosci. 1997; 17(11):4302-11. PMC: 6573547. View

10.

DiCarlo J, Zoccolan D, Rust N . How does the brain solve visual object recognition?. Neuron. 2012; 73(3):415-34. PMC: 3306444. DOI: 10.1016/j.neuron.2012.01.010. View

11.

Grill-Spector K, Kourtzi Z, Kanwisher N . The lateral occipital complex and its role in object recognition. Vision Res. 2001; 41(10-11):1409-22. DOI: 10.1016/s0042-6989(01)00073-6. View

12.

Op de Beeck H, Haushofer J, Kanwisher N . Interpreting fMRI data: maps, modules and dimensions. Nat Rev Neurosci. 2008; 9(2):123-35. PMC: 2731480. DOI: 10.1038/nrn2314. View

13.

Pasupathy A, Connor C . Population coding of shape in area V4. Nat Neurosci. 2002; 5(12):1332-8. DOI: 10.1038/nn972. View

14.

Tsao D, Livingstone M . Mechanisms of face perception. Annu Rev Neurosci. 2008; 31:411-37. PMC: 2629401. DOI: 10.1146/annurev.neuro.30.051606.094238. View

15.

Kriegeskorte N, Mur M, Ruff D, Kiani R, Bodurka J, Esteky H . Matching categorical object representations in inferior temporal cortex of man and monkey. Neuron. 2008; 60(6):1126-41. PMC: 3143574. DOI: 10.1016/j.neuron.2008.10.043. View

16.

Tong F, Nakayama K, Vaughan J, Kanwisher N . Binocular rivalry and visual awareness in human extrastriate cortex. Neuron. 1998; 21(4):753-9. DOI: 10.1016/s0896-6273(00)80592-9. View

17.

Bar M . Visual objects in context. Nat Rev Neurosci. 2004; 5(8):617-29. DOI: 10.1038/nrn1476. View

18.

McKeeff T, Tong F . The timing of perceptual decisions for ambiguous face stimuli in the human ventral visual cortex. Cereb Cortex. 2006; 17(3):669-78. DOI: 10.1093/cercor/bhk015. View

19.

Khaligh-Razavi S, Kriegeskorte N . Deep supervised, but not unsupervised, models may explain IT cortical representation. PLoS Comput Biol. 2014; 10(11):e1003915. PMC: 4222664. DOI: 10.1371/journal.pcbi.1003915. View

20.

Kietzmann T, Spoerer C, Sorensen L, Cichy R, Hauk O, Kriegeskorte N . Recurrence is required to capture the representational dynamics of the human visual system. Proc Natl Acad Sci U S A. 2019; 116(43):21854-21863. PMC: 6815174. DOI: 10.1073/pnas.1905544116. View