Learning the 3-D Structure of Objects from 2-D Views Depends on Shape, Not Format

Overview

Journal J Vis

Specialty Ophthalmology

Date 2016 May 7

PMID 27153196

Citations 4

Authors

Moqian Tian

Daniel Yamins

Kalanit Grill-Spector

Affiliations

Soon will be listed here.

Abstract

Humans can learn to recognize new objects just from observing example views. However, it is unknown what structural information enables this learning. To address this question, we manipulated the amount of structural information given to subjects during unsupervised learning by varying the format of the trained views. We then tested how format affected participants' ability to discriminate similar objects across views that were rotated 90° apart. We found that, after training, participants' performance increased and generalized to new views in the same format. Surprisingly, the improvement was similar across line drawings, shape from shading, and shape from shading + stereo even though the latter two formats provide richer depth information compared to line drawings. In contrast, participants' improvement was significantly lower when training used silhouettes, suggesting that silhouettes do not have enough information to generate a robust 3-D structure. To test whether the learned object representations were format-specific or format-invariant, we examined if learning novel objects from example views transfers across formats. We found that learning objects from example line drawings transferred to shape from shading and vice versa. These results have important implications for theories of object recognition because they suggest that (a) learning the 3-D structure of objects does not require rich structural cues during training as long as shape information of internal and external features is provided and (b) learning generates shape-based object representations independent of the training format.

Citing Articles

The analysis of the structural parameter influences on measurement errors in a binocular 3D reconstruction system: a portable 3D system.

Sha O, Zhang H, Bai J, Zhang Y, Yang J PeerJ Comput Sci. 2023; 9:e1610.

PMID: 37810332 PMC: 10557943. DOI: 10.7717/peerj-cs.1610.

Standardised images of novel objects created with generative adversarial networks.

Cooper P, Colton E, Bode S, Chong T Sci Data. 2023; 10(1):575.

PMID: 37660073 PMC: 10475029. DOI: 10.1038/s41597-023-02483-7.

Combined Neural Tuning in Human Ventral Temporal Cortex Resolves the Perceptual Ambiguity of Morphed 2D Images.

Rosenke M, Davidenko N, Grill-Spector K, Weiner K Cereb Cortex. 2020; 30(9):4882-4898.

PMID: 32372098 PMC: 7391265. DOI: 10.1093/cercor/bhaa081.

The functional neuroanatomy of face perception: from brain measurements to deep neural networks.

Grill-Spector K, Weiner K, Gomez J, Stigliani A, Natu V Interface Focus. 2018; 8(4):20180013.

PMID: 29951193 PMC: 6015811. DOI: 10.1098/rsfs.2018.0013.

References

Liu Z, Knill D, Kersten D . Object classification for human and ideal observers. Vision Res. 1995; 35(4):549-68. DOI: 10.1016/0042-6989(94)00150-k. View

Bulthoff H, Mallot H . Integration of depth modules: stereo and shading. J Opt Soc Am A. 1988; 5(10):1749-58. DOI: 10.1364/josaa.5.001749. View

Kellman P, Garrigan P, Shipley T . Object interpolation in three dimensions. Psychol Rev. 2005; 112(3):586-609. DOI: 10.1037/0033-295X.112.3.586. View

Marr D, Nishihara H . Representation and recognition of the spatial organization of three-dimensional shapes. Proc R Soc Lond B Biol Sci. 1978; 200(1140):269-94. DOI: 10.1098/rspb.1978.0020. View

Sary G, Vogels R, Orban G . Cue-invariant shape selectivity of macaque inferior temporal neurons. Science. 1993; 260(5110):995-7. DOI: 10.1126/science.8493538. View

Kourtzi Z, Kanwisher N . Cortical regions involved in perceiving object shape. J Neurosci. 2000; 20(9):3310-8. PMC: 6773111. View

Lloyd-Jones T, Luckhurst L . Outline shape is a mediator of object recognition that is particularly important for living things. Mem Cognit. 2002; 30(4):489-98. DOI: 10.3758/bf03194950. View

Edelman S, Bulthoff H . Orientation dependence in the recognition of familiar and novel views of three-dimensional objects. Vision Res. 1992; 32(12):2385-400. DOI: 10.1016/0042-6989(92)90102-o. View

Bulthoff H, Edelman S . Psychophysical support for a two-dimensional view interpolation theory of object recognition. Proc Natl Acad Sci U S A. 1992; 89(1):60-4. PMC: 48175. DOI: 10.1073/pnas.89.1.60. View

10.

Foldiak P . Learning Invariance from Transformation Sequences. Neural Comput. 2019; 3(2):194-200. DOI: 10.1162/neco.1991.3.2.194. View

11.

HAYWARD W, Tarr M . Testing conditions for viewpoint invariance in object recognition. J Exp Psychol Hum Percept Perform. 1997; 23(5):1511-21. DOI: 10.1037//0096-1523.23.5.1511. View

12.

Hubel D, Wiesel T . RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE IN TWO NONSTRIATE VISUAL AREAS (18 AND 19) OF THE CAT. J Neurophysiol. 1965; 28:229-89. DOI: 10.1152/jn.1965.28.2.229. View

13.

McMahon D, Bondar I, Afuwape O, Ide D, Leopold D . One month in the life of a neuron: longitudinal single-unit electrophysiology in the monkey visual system. J Neurophysiol. 2014; 112(7):1748-62. PMC: 4157170. DOI: 10.1152/jn.00052.2014. View

14.

Wallis G, Bulthoff H . Effects of temporal association on recognition memory. Proc Natl Acad Sci U S A. 2001; 98(8):4800-4. PMC: 31914. DOI: 10.1073/pnas.071028598. View

15.

Yamins D, Hong H, Cadieu C, Solomon E, Seibert D, DiCarlo J . Performance-optimized hierarchical models predict neural responses in higher visual cortex. Proc Natl Acad Sci U S A. 2014; 111(23):8619-24. PMC: 4060707. DOI: 10.1073/pnas.1403112111. View

16.

Lee Y, Saunders J . Stereo improves 3D shape discrimination even when rich monocular shape cues are available. J Vis. 2011; 11(9). DOI: 10.1167/11.9.6. View

17.

Appelbaum L, Wade A, Vildavski V, Pettet M, Norcia A . Cue-invariant networks for figure and background processing in human visual cortex. J Neurosci. 2006; 26(45):11695-708. PMC: 2711040. DOI: 10.1523/JNEUROSCI.2741-06.2006. View

18.

Kourtzi Z, Erb M, Grodd W, Bulthoff H . Representation of the perceived 3-D object shape in the human lateral occipital complex. Cereb Cortex. 2003; 13(9):911-20. DOI: 10.1093/cercor/13.9.911. View

19.

Nefs H, Harris J . Vergence effects on the perception of motion-in-depth. Exp Brain Res. 2007; 183(3):313-22. DOI: 10.1007/s00221-007-1046-5. View

20.

Mendola J, Dale A, Fischl B, Liu A, Tootell R . The representation of illusory and real contours in human cortical visual areas revealed by functional magnetic resonance imaging. J Neurosci. 1999; 19(19):8560-72. PMC: 6783043. View