» Articles » PMID: 33974037

A Deep-learning Framework for Human Perception of Abstract Art Composition

Overview
Journal J Vis
Specialty Ophthalmology
Date 2021 May 11
PMID 33974037
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Artistic composition (the structural organization of pictorial elements) is often characterized by some basic rules and heuristics, but art history does not offer quantitative tools for segmenting individual elements, measuring their interactions and related operations. To discover whether a metric description of this kind is even possible, we exploit a deep-learning algorithm that attempts to capture the perceptual mechanism underlying composition in humans. We rely on a robust behavioral marker with known relevance to higher-level vision: orientation judgements, that is, telling whether a painting is hung "right-side up." Humans can perform this task, even for abstract paintings. To account for this finding, existing models rely on "meaningful" content or specific image statistics, often in accordance with explicit rules from art theory. Our approach does not commit to any such assumptions/schemes, yet it outperforms previous models and for a larger database, encompassing a wide range of painting styles. Moreover, our model correctly reproduces human performance across several measurements from a new web-based experiment designed to test whole paintings, as well as painting fragments matched to the receptive-field size of different depths in the model. By exploiting this approach, we show that our deep learning model captures relevant characteristics of human orientation perception across styles and granularities. Interestingly, the more abstract the painting, the more our model relies on extended spatial integration of cues, a property supported by deeper layers.

Citing Articles

The perceptual primacy of feeling: Affectless visual machines explain a majority of variance in human visually evoked affect.

Conwell C, Graham D, Boccagno C, Vessel E Proc Natl Acad Sci U S A. 2025; 122(4):e2306025121.

PMID: 39847334 PMC: 11789064. DOI: 10.1073/pnas.2306025121.


How deep is your art: An experimental study on the limits of artistic understanding in a single-task, single-modality neural network.

Agha Zahedi M, Gholamrezaei N, Doboli A PLoS One. 2024; 19(11):e0305943.

PMID: 39504315 PMC: 11540182. DOI: 10.1371/journal.pone.0305943.


Universality and superiority in preference for chromatic composition of art paintings.

Nakauchi S, Kondo T, Kinzuka Y, Taniyama Y, Tamura H, Higashi H Sci Rep. 2022; 12(1):4294.

PMID: 35277597 PMC: 8917196. DOI: 10.1038/s41598-022-08365-z.

References
1.
Mamassian P . Ambiguities and conventions in the perception of visual art. Vision Res. 2008; 48(20):2143-53. DOI: 10.1016/j.visres.2008.06.010. View

2.
Neri P . How inherently noisy is human sensory processing?. Psychon Bull Rev. 2010; 17(6):802-8. DOI: 10.3758/PBR.17.6.802. View

3.
VanRullen R . Perception Science in the Age of Deep Neural Networks. Front Psychol. 2017; 8:142. PMC: 5288363. DOI: 10.3389/fpsyg.2017.00142. View

4.
Doerig A, Bornet A, Choung O, Herzog M . Crowding reveals fundamental differences in local vs. global processing in humans and machines. Vision Res. 2020; 167:39-45. DOI: 10.1016/j.visres.2019.12.006. View

5.
Vallortigara G, Regolin L . Gravity bias in the interpretation of biological motion by inexperienced chicks. Curr Biol. 2006; 16(8):R279-80. DOI: 10.1016/j.cub.2006.03.052. View