» Articles » PMID: 39242968

Anchor Objects Drive Realism While Diagnostic Objects Drive Categorization in GAN Generated Scenes

Overview
Journal Commun Psychol
Publisher Nature Portfolio
Date 2024 Sep 6
PMID 39242968
Authors
Affiliations
Soon will be listed here.
Abstract

Our visual surroundings are highly complex. Despite this, we understand and navigate them effortlessly. This requires transforming incoming sensory information into representations that not only span low- to high-level visual features (e.g., edges, object parts, objects), but likely also reflect co-occurrence statistics of objects in real-world scenes. Here, so-called anchor objects are defined as being highly predictive of the location and identity of frequently co-occuring (usually smaller) objects, derived from object clustering statistics in real-world scenes, while so-called diagnostic objects are predictive of the larger semantic context (i.e., scene category). Across two studies (N = 50, N = 44), we investigate which of these properties underlie scene understanding across two dimensions - realism and categorisation - using scenes generated from Generative Adversarial Networks (GANs) which naturally vary along these dimensions. We show that anchor objects and mainly high-level features extracted from a range of pre-trained deep neural networks (DNNs) drove realism both at first glance and after initial processing. Categorisation performance was mainly determined by diagnostic objects, regardless of realism, at first glance and after initial processing. Our results are testament to the visual system's ability to pick up on reliable, category specific sources of information that are flexible towards disturbances across the visual feature-hierarchy.

References
1.
Kriegeskorte N, Mur M, Bandettini P . Representational similarity analysis - connecting the branches of systems neuroscience. Front Syst Neurosci. 2008; 2:4. PMC: 2605405. DOI: 10.3389/neuro.06.004.2008. View

2.
Greene M, Hansen B . Disentangling the Independent Contributions of Visual and Conceptual Features to the Spatiotemporal Dynamics of Scene Categorization. J Neurosci. 2020; 40(27):5283-5299. PMC: 7329300. DOI: 10.1523/JNEUROSCI.2088-19.2020. View

3.
Greene M . Statistics of high-level scene context. Front Psychol. 2013; 4:777. PMC: 3810604. DOI: 10.3389/fpsyg.2013.00777. View

4.
Wyatte D, Curran T, OReilly R . The limits of feedforward vision: recurrent processing promotes robust object recognition when objects are degraded. J Cogn Neurosci. 2012; 24(11):2248-61. DOI: 10.1162/jocn_a_00282. View

5.
Swets J . Indices of discrimination or diagnostic accuracy: their ROCs and implied models. Psychol Bull. 1986; 99(1):100-17. View