» Articles » PMID: 37310489

Extreme Image Transformations Affect Humans and Machines Differently

Overview
Journal Biol Cybern
Specialties Neurology
Physiology
Date 2023 Jun 13
PMID 37310489
Authors
Affiliations
Soon will be listed here.
Abstract

Some recent artificial neural networks (ANNs) claim to model aspects of primate neural and human performance data. Their success in object recognition is, however, dependent on exploiting low-level features for solving visual tasks in a way that humans do not. As a result, out-of-distribution or adversarial input is often challenging for ANNs. Humans instead learn abstract patterns and are mostly unaffected by many extreme image distortions. We introduce a set of novel image transforms inspired by neurophysiological findings and evaluate humans and ANNs on an object recognition task. We show that machines perform better than humans for certain transforms and struggle to perform at par with humans on others that are easy for humans. We quantify the differences in accuracy for humans and machines and find a ranking of difficulty for our transforms for human data. We also suggest how certain characteristics of human visual processing can be adapted to improve the performance of ANNs for our difficult-for-machines transforms.

Citing Articles

What can computer vision learn from visual neuroscience? Introduction to the special issue.

Chen K, Kashyap H, Krichmar J, Li X Biol Cybern. 2023; 117(4-5):297-298.

PMID: 37812267 DOI: 10.1007/s00422-023-00977-6.

References
1.
Tanaka K . Mechanisms of visual object recognition: monkey and human studies. Curr Opin Neurobiol. 1997; 7(4):523-9. DOI: 10.1016/s0959-4388(97)80032-3. View

2.
Tarr M, Bulthoff H . Image-based object recognition in man, monkey and machine. Cognition. 1998; 67(1-2):1-20. DOI: 10.1016/s0010-0277(98)00026-2. View

3.
Hubel D, Wiesel T . Receptive fields, binocular interaction and functional architecture in the cat's visual cortex. J Physiol. 1962; 160:106-54. PMC: 1359523. DOI: 10.1113/jphysiol.1962.sp006837. View

4.
Ekstrom A, Isham E . Human spatial navigation: Representations across dimensions and scales. Curr Opin Behav Sci. 2017; 17:84-89. PMC: 5678987. DOI: 10.1016/j.cobeha.2017.06.005. View

5.
Koenderink J . The structure of images. Biol Cybern. 1984; 50(5):363-70. DOI: 10.1007/BF00336961. View