Distributional Language Learning: Mechanisms and Models of Ategory Formation
Overview
Authors
Affiliations
In the past 15 years, a substantial body of evidence has confirmed that a powerful distributional learning mechanism is present in infants, children, adults and (at least to some degree) in nonhuman animals as well. The present article briefly reviews this literature and then examines some of the fundamental questions that must be addressed for any distributional learning mechanism to operate effectively within the linguistic domain. In particular, how does a naive learner determine the number of categories that are present in a corpus of linguistic input and what distributional cues enable the learner to assign individual lexical items to those categories? Contrary to the hypothesis that distributional learning and category (or rule) learning are separate mechanisms, the present article argues that these two seemingly different processes---acquiring specific structure from linguistic input and generalizing beyond that input to novel exemplars---actually represent a single mechanism. Evidence in support of this single-mechanism hypothesis comes from a series of artificial grammar-learning studies that not only demonstrate that adults can learn grammatical categories from distributional information alone, but that the specific patterning of distributional information among attested utterances in the learning corpus enables adults to generalize to novel utterances or to restrict generalization when unattested utterances are consistently absent from the learning corpus. Finally, a computational model of distributional learning that accounts for the presence or absence of generalization is reviewed and the implications of this model for linguistic-category learning are summarized.
Montgomery J, Gillam R, Plante E Am J Speech Lang Pathol. 2023; 33(2):580-597.
PMID: 37678208 PMC: 11001167. DOI: 10.1044/2023_AJSLP-23-00079.
Dynamics of Functional Networks for Syllable and Word-Level Processing.
Rimmele J, Sun Y, Michalareas G, Ghitza O, Poeppel D Neurobiol Lang (Camb). 2023; 4(1):120-144.
PMID: 37229144 PMC: 10205074. DOI: 10.1162/nol_a_00089.
Kindergarteners Use Cross-Situational Statistics to Infer the Meaning of Grammatical Elements.
Spit S, Andringa S, Rispens J, Aboh E J Psycholinguist Res. 2022; 51(6):1311-1333.
PMID: 35794402 PMC: 9646556. DOI: 10.1007/s10936-022-09898-0.
Schneider J, Weng Y, Hu A, Qi Z Neuropsychologia. 2022; 172:108284.
PMID: 35667495 PMC: 10286817. DOI: 10.1016/j.neuropsychologia.2022.108284.
One model for the learning of language.
Yang Y, Piantadosi S Proc Natl Acad Sci U S A. 2022; 119(5).
PMID: 35074868 PMC: 8812683. DOI: 10.1073/pnas.2021865119.