» Articles » PMID: 33229557

AI, Visual Imagery, and a Case Study on the Challenges Posed by Human Intelligence Tests

Overview
Specialty Science
Date 2020 Nov 24
PMID 33229557
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Observations abound about the power of visual imagery in human intelligence, from how Nobel prize-winning physicists make their discoveries to how children understand bedtime stories. These observations raise an important question for cognitive science, which is, what are the computations taking place in someone's mind when they use visual imagery? Answering this question is not easy and will require much continued research across the multiple disciplines of cognitive science. Here, we focus on a related and more circumscribed question from the perspective of artificial intelligence (AI): If you have an intelligent agent that uses visual imagery-based knowledge representations and reasoning operations, then what kinds of problem solving might be possible, and how would such problem solving work? We highlight recent progress in AI toward answering these questions in the domain of visuospatial reasoning, looking at a case study of how imagery-based artificial agents can solve visuospatial intelligence tests. In particular, we first examine several variations of imagery-based knowledge representations and problem-solving strategies that are sufficient for solving problems from the Raven's Progressive Matrices intelligence test. We then look at how artificial agents, instead of being designed manually by AI researchers, might learn portions of their own knowledge and reasoning procedures from experience, including learning visuospatial domain knowledge, learning and generalizing problem-solving strategies, and learning the actual definition of the task in the first place.

Citing Articles

Let's do it: Response times in Mental Paper Folding and its execution.

Dahm S, Sachse P Q J Exp Psychol (Hove). 2024; :17470218241249727.

PMID: 38616184 PMC: 11905326. DOI: 10.1177/17470218241249727.


Modeling Sequential Dependencies in Progressive Matrices: An Auto-Regressive Item Response Theory (AR-IRT) Approach.

Myszkowski N, Storme M J Intell. 2024; 12(1).

PMID: 38248905 PMC: 10817306. DOI: 10.3390/jintelligence12010007.


Responses to Raven matrices: Governed by visual complexity and centrality.

de Winter J, Dodou D, Eisma Y Perception. 2023; 52(9):645-661.

PMID: 37264787 PMC: 10469510. DOI: 10.1177/03010066231178149.


Multimodal Art Pose Recognition and Interaction With Human Intelligence Enhancement.

Ma C, Liu Q, Dang Y Front Psychol. 2021; 12:769509.

PMID: 34819900 PMC: 8606411. DOI: 10.3389/fpsyg.2021.769509.


The brain produces mind by modeling.

Shiffrin R, Bassett D, Kriegeskorte N, Tenenbaum J Proc Natl Acad Sci U S A. 2020; 117(47):29299-29301.

PMID: 33229525 PMC: 7703556. DOI: 10.1073/pnas.1912340117.

References
1.
Dawson M, Soulieres I, Gernsbacher M, Mottron L . The level and nature of autistic intelligence. Psychol Sci. 2007; 18(8):657-62. PMC: 4287210. DOI: 10.1111/j.1467-9280.2007.01954.x. View

2.
Herzog M, Ernst U, Etzold A, Eurich C . Local interactions in neural networks explain global effects in Gestalt processing and masking. Neural Comput. 2003; 15(9):2091-113. DOI: 10.1162/089976603322297304. View

3.
Wagemans J, Elder J, Kubovy M, Palmer S, Peterson M, Singh M . A century of Gestalt psychology in visual perception: I. Perceptual grouping and figure-ground organization. Psychol Bull. 2012; 138(6):1172-217. PMC: 3482144. DOI: 10.1037/a0029333. View

4.
Shepard R . Ecological constraints on internal representation: resonant kinematics of perceiving, imagining, thinking, and dreaming. Psychol Rev. 1984; 91(4):417-47. View

5.
Memisevic R, Hinton G . Learning to represent spatial transformations with factored higher-order Boltzmann machines. Neural Comput. 2010; 22(6):1473-92. DOI: 10.1162/neco.2010.01-09-953. View