» Articles » PMID: 37662223

A Mathematical Theory of Relational Generalization in Transitive Inference

Overview
Journal bioRxiv
Date 2023 Sep 4
PMID 37662223
Authors
Affiliations
Soon will be listed here.
Abstract

Humans and animals routinely infer relations between different items or events and generalize these relations to novel combinations of items. This allows them to respond appropriately to radically novel circumstances and is fundamental to advanced cognition. However, how learning systems (including the brain) can implement the necessary inductive biases has been unclear. Here we investigated transitive inference (TI), a classic relational task paradigm in which subjects must learn a relation ( > and > ) and generalize it to new combinations of items ( > ). Through mathematical analysis, we found that a broad range of biologically relevant learning models (e.g. gradient flow or ridge regression) perform TI successfully and recapitulate signature behavioral patterns long observed in living subjects. First, we found that models with item-wise additive representations automatically encode transitive relations. Second, for more general representations, a single scalar "conjunctivity factor" determines model behavior on TI and, further, the principle of norm minimization (a standard statistical inductive bias) enables models with fixed, partly conjunctive representations to generalize transitively. Finally, neural networks in the "rich regime," which enables representation learning and has been found to improve generalization, unexpectedly show poor generalization and anomalous behavior. We find that such networks implement a form of norm minimization (over hidden weights) that yields a local encoding mechanism lacking transitivity. Our findings show how minimal statistical learning principles give rise to a classical relational inductive bias (transitivity), explain empirically observed behaviors, and establish a formal approach to understanding the neural basis of relational abstraction.

References
1.
Hinton E, Dymond S, von Hecker U, Evans C . Neural correlates of relational reasoning and the symbolic distance effect: involvement of parietal cortex. Neuroscience. 2010; 168(1):138-48. DOI: 10.1016/j.neuroscience.2010.03.052. View

2.
Komorowski R, Manns J, Eichenbaum H . Robust conjunctive item-place coding by hippocampal neurons parallels learning what happens where. J Neurosci. 2009; 29(31):9918-29. PMC: 2746931. DOI: 10.1523/JNEUROSCI.1378-09.2009. View

3.
Bryson J, Leong J . Primate errors in transitive 'inference': a two-tier learning model. Anim Cogn. 2006; 10(1):1-15. DOI: 10.1007/s10071-006-0024-9. View

4.
Vasconcelos M . Transitive inference in non-human animals: an empirical and theoretical analysis. Behav Processes. 2008; 78(3):313-34. DOI: 10.1016/j.beproc.2008.02.017. View

5.
Mahowald K, Ivanova A, Blank I, Kanwisher N, Tenenbaum J, Fedorenko E . Dissociating language and thought in large language models. Trends Cogn Sci. 2024; 28(6):517-540. PMC: 11416727. DOI: 10.1016/j.tics.2024.01.011. View