Redundancy in Perceptual and Linguistic Experience: Comparing Feature-based and Distributional Models of Semantic Representation
Overview
Authors
Affiliations
Since their inception, distributional models of semantics have been criticized as inadequate cognitive theories of human semantic learning and representation. A principal challenge is that the representations derived by distributional models are purely symbolic and are not grounded in perception and action; this challenge has led many to favor feature-based models of semantic representation. We argue that the amount of perceptual and other semantic information that can be learned from purely distributional statistics has been underappreciated. We compare the representations of three feature-based and nine distributional models using a semantic clustering task. Several distributional models demonstrated semantic clustering comparable with clustering-based on feature-based representations. Furthermore, when trained on child-directed speech, the same distributional models perform as well as sensorimotor-based feature representations of children's lexical semantic knowledge. These results suggest that, to a large extent, information relevant for extracting semantic categories is redundantly coded in perceptual and linguistic experience. Detailed analyses of the semantic clusters of the feature-based and distributional models also reveal that the models make use of complementary cues to semantic organization from the two data streams. Rather than conceptualizing feature-based and distributional models as competing theories, we argue that future focus should be on understanding the cognitive mechanisms humans use to integrate the two sources.
A Linguistic-Sensorimotor Model of the Basic-Level Advantage in Category Verification.
Wingfield C, van Hoef R, Connell L Cogn Sci. 2024; 48(12):e70025.
PMID: 39715230 PMC: 11666073. DOI: 10.1111/cogs.70025.
Crossmodal correspondence of elevation/pitch and size/pitch is driven by real-world features.
McEwan J, Kritikos A, Zeljko M Atten Percept Psychophys. 2024; 86(8):2821-2833.
PMID: 39461934 PMC: 11652408. DOI: 10.3758/s13414-024-02975-7.
Signatures of cross-modal alignment in children's early concepts.
Aho K, Roads B, Love B Proc Natl Acad Sci U S A. 2023; 120(42):e2309688120.
PMID: 37819984 PMC: 10589699. DOI: 10.1073/pnas.2309688120.
Sensorimotor distance: A grounded measure of semantic similarity for 800 million concept pairs.
Wingfield C, Connell L Behav Res Methods. 2022; 55(7):3416-3432.
PMID: 36131199 PMC: 10615916. DOI: 10.3758/s13428-022-01965-7.
Semantic projection recovers rich human knowledge of multiple object features from word embeddings.
Grand G, Blank I, Pereira F, Fedorenko E Nat Hum Behav. 2022; 6(7):975-987.
PMID: 35422527 PMC: 10349641. DOI: 10.1038/s41562-022-01316-8.