Distributional Language Learning: Mechanisms and Models of Ategory Formation

Overview

Journal Lang Learn

Date 2016 Feb 9

PMID 26855443

Citations 22

Authors

Richard N Aslin

Elissa L Newport

Affiliations

Soon will be listed here.

Abstract

In the past 15 years, a substantial body of evidence has confirmed that a powerful distributional learning mechanism is present in infants, children, adults and (at least to some degree) in nonhuman animals as well. The present article briefly reviews this literature and then examines some of the fundamental questions that must be addressed for any distributional learning mechanism to operate effectively within the linguistic domain. In particular, how does a naive learner determine the number of categories that are present in a corpus of linguistic input and what distributional cues enable the learner to assign individual lexical items to those categories? Contrary to the hypothesis that distributional learning and category (or rule) learning are separate mechanisms, the present article argues that these two seemingly different processes---acquiring specific structure from linguistic input and generalizing beyond that input to novel exemplars---actually represent a single mechanism. Evidence in support of this single-mechanism hypothesis comes from a series of artificial grammar-learning studies that not only demonstrate that adults can learn grammatical categories from distributional information alone, but that the specific patterning of distributional information among attested utterances in the learning corpus enables adults to generalize to novel utterances or to restrict generalization when unattested utterances are consistently absent from the learning corpus. Finally, a computational model of distributional learning that accounts for the presence or absence of generalization is reviewed and the implications of this model for linguistic-category learning are summarized.

Citing Articles

Enhancing Syntactic Knowledge in School-Age Children With Developmental Language Disorder: The Promise of Syntactic Priming.

Montgomery J, Gillam R, Plante E Am J Speech Lang Pathol. 2023; 33(2):580-597.

PMID: 37678208 PMC: 11001167. DOI: 10.1044/2023_AJSLP-23-00079.

Dynamics of Functional Networks for Syllable and Word-Level Processing.

Rimmele J, Sun Y, Michalareas G, Ghitza O, Poeppel D Neurobiol Lang (Camb). 2023; 4(1):120-144.

PMID: 37229144 PMC: 10205074. DOI: 10.1162/nol_a_00089.

Kindergarteners Use Cross-Situational Statistics to Infer the Meaning of Grammatical Elements.

Spit S, Andringa S, Rispens J, Aboh E J Psycholinguist Res. 2022; 51(6):1311-1333.

PMID: 35794402 PMC: 9646556. DOI: 10.1007/s10936-022-09898-0.

Linking the neural basis of distributional statistical learning with transitional statistical learning: The paradox of attention.

Schneider J, Weng Y, Hu A, Qi Z Neuropsychologia. 2022; 172:108284.

PMID: 35667495 PMC: 10286817. DOI: 10.1016/j.neuropsychologia.2022.108284.

One model for the learning of language.

Yang Y, Piantadosi S Proc Natl Acad Sci U S A. 2022; 119(5).

PMID: 35074868 PMC: 8812683. DOI: 10.1073/pnas.2021865119.

References

Wu R, Gopnik A, Richardson D, Kirkham N . Infants learn about objects from statistics and people. Dev Psychol. 2011; 47(5):1220-9. DOI: 10.1037/a0024023. View

Shukla M, White K, Aslin R . Prosody guides the rapid mapping of auditory word forms onto visual objects in 6-mo-old infants. Proc Natl Acad Sci U S A. 2011; 108(15):6038-43. PMC: 3076873. DOI: 10.1073/pnas.1017617108. View

Saffran J, Pollak S, Seibel R, Shkolnik A . Dog is a dog is a dog: infant rule learning is not specific to language. Cognition. 2006; 105(3):669-80. PMC: 2066190. DOI: 10.1016/j.cognition.2006.11.004. View

Johnson S, Fernandas K, Frank M, Kirkham N, Marcus G, Rabagliati H . Abstract Rule Learning for Visual Sequences in 8- and 11-Month-Olds. Infancy. 2009; 14(1):2-18. PMC: 2654175. DOI: 10.1080/15250000802569611. View

Shukla M, Nespor M, Mehler J . An interaction between prosody and statistics in the segmentation of fluent speech. Cogn Psychol. 2006; 54(1):1-32. DOI: 10.1016/j.cogpsych.2006.04.002. View

Mintz T . Category induction from distributional cues in an artificial language. Mem Cognit. 2002; 30(5):678-86. DOI: 10.3758/bf03196424. View

Saffran J, Johnson E, Aslin R, Newport E . Statistical learning of tone sequences by human infants and adults. Cognition. 1999; 70(1):27-52. DOI: 10.1016/s0010-0277(98)00075-4. View

de la Mora D, Toro J . Rule learning over consonants and vowels in a non-human animal. Cognition. 2012; 126(2):307-12. PMC: 4217073. DOI: 10.1016/j.cognition.2012.09.015. View

Gentner T, Fenn K, Margoliash D, Nusbaum H . Recursive syntactic pattern learning by songbirds. Nature. 2006; 440(7088):1204-7. PMC: 2653278. DOI: 10.1038/nature04675. View

10.

Endress A, Bonatti L . Rapid learning of syllable classes from a perceptually continuous speech stream. Cognition. 2006; 105(2):247-99. DOI: 10.1016/j.cognition.2006.09.010. View

11.

Toro J, Sinnett S, Soto-Faraco S . Speech segmentation by statistical learning depends on attention. Cognition. 2005; 97(2):B25-34. DOI: 10.1016/j.cognition.2005.01.006. View

12.

Swingley D . Statistical clustering and the contents of the infant vocabulary. Cogn Psychol. 2004; 50(1):86-132. DOI: 10.1016/j.cogpsych.2004.06.001. View

13.

Mintz T . Frequent frames as a cue for grammatical categories in child directed speech. Cognition. 2003; 90(1):91-117. DOI: 10.1016/s0010-0277(03)00140-9. View

14.

EIMAS P, SIQUELAND E, Jusczyk P, Vigorito J . Speech perception in infants. Science. 1971; 171(3968):303-6. DOI: 10.1126/science.171.3968.303. View

15.

Marcus G, Fernandes K, Johnson S . Infant rule learning facilitated by speech. Psychol Sci. 2007; 18(5):387-91. DOI: 10.1111/j.1467-9280.2007.01910.x. View

16.

Thiessen E, Saffran J . When cues collide: use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. Dev Psychol. 2003; 39(4):706-16. DOI: 10.1037/0012-1649.39.4.706. View

17.

Gerken L . Decisions, decisions: infant language learning when multiple generalizations are possible. Cognition. 2005; 98(3):B67-74. DOI: 10.1016/j.cognition.2005.03.003. View

18.

Creel S, Newport E, Aslin R . Distant melodies: statistical learning of nonadjacent dependencies in tone sequences. J Exp Psychol Learn Mem Cogn. 2004; 30(5):1119-30. DOI: 10.1037/0278-7393.30.5.1119. View

19.

Reeder P, Newport E, Aslin R . From shared contexts to syntactic categories: the role of distributional information in learning linguistic form-classes. Cogn Psychol. 2012; 66(1):30-54. PMC: 3621024. DOI: 10.1016/j.cogpsych.2012.09.001. View

20.

Bonatti L, Pena M, Nespor M, Mehler J . Linguistic constraints on statistical computations: the role of consonants and vowels in continuous speech processing. Psychol Sci. 2005; 16(6):451-9. DOI: 10.1111/j.0956-7976.2005.01556.x. View