A Mathematical Theory of Relational Generalization in Transitive Inference

Overview

Journal bioRxiv

Date 2023 Sep 4

PMID 37662223

Authors

Samuel Lippl

Kenneth Kay

Greg Jensen

Vincent P Ferrera

L F Abbott

Affiliations

Soon will be listed here.

Abstract

Humans and animals routinely infer relations between different items or events and generalize these relations to novel combinations of items. This allows them to respond appropriately to radically novel circumstances and is fundamental to advanced cognition. However, how learning systems (including the brain) can implement the necessary inductive biases has been unclear. Here we investigated transitive inference (TI), a classic relational task paradigm in which subjects must learn a relation ( > and > ) and generalize it to new combinations of items ( > ). Through mathematical analysis, we found that a broad range of biologically relevant learning models (e.g. gradient flow or ridge regression) perform TI successfully and recapitulate signature behavioral patterns long observed in living subjects. First, we found that models with item-wise additive representations automatically encode transitive relations. Second, for more general representations, a single scalar "conjunctivity factor" determines model behavior on TI and, further, the principle of norm minimization (a standard statistical inductive bias) enables models with fixed, partly conjunctive representations to generalize transitively. Finally, neural networks in the "rich regime," which enables representation learning and has been found to improve generalization, unexpectedly show poor generalization and anomalous behavior. We find that such networks implement a form of norm minimization (over hidden weights) that yields a local encoding mechanism lacking transitivity. Our findings show how minimal statistical learning principles give rise to a classical relational inductive bias (transitivity), explain empirically observed behaviors, and establish a formal approach to understanding the neural basis of relational abstraction.

References

Hinton E, Dymond S, von Hecker U, Evans C . Neural correlates of relational reasoning and the symbolic distance effect: involvement of parietal cortex. Neuroscience. 2010; 168(1):138-48. DOI: 10.1016/j.neuroscience.2010.03.052. View

Komorowski R, Manns J, Eichenbaum H . Robust conjunctive item-place coding by hippocampal neurons parallels learning what happens where. J Neurosci. 2009; 29(31):9918-29. PMC: 2746931. DOI: 10.1523/JNEUROSCI.1378-09.2009. View

Bryson J, Leong J . Primate errors in transitive 'inference': a two-tier learning model. Anim Cogn. 2006; 10(1):1-15. DOI: 10.1007/s10071-006-0024-9. View

Vasconcelos M . Transitive inference in non-human animals: an empirical and theoretical analysis. Behav Processes. 2008; 78(3):313-34. DOI: 10.1016/j.beproc.2008.02.017. View

Mahowald K, Ivanova A, Blank I, Kanwisher N, Tenenbaum J, Fedorenko E . Dissociating language and thought in large language models. Trends Cogn Sci. 2024; 28(6):517-540. PMC: 11416727. DOI: 10.1016/j.tics.2024.01.011. View

Penn D, Povinelli D . Causal cognition in human and nonhuman animals: a comparative, critical review. Annu Rev Psychol. 2006; 58:97-118. DOI: 10.1146/annurev.psych.58.110405.085555. View

Nelli S, Braun L, Dumbalska T, Saxe A, Summerfield C . Neural knowledge assembly in humans and neural networks. Neuron. 2023; 111(9):1504-1516.e9. PMC: 10618408. DOI: 10.1016/j.neuron.2023.02.014. View

Kurth-Nelson Z, Behrens T, Wayne G, Miller K, Luettgau L, Dolan R . Replay and compositional computation. Neuron. 2023; 111(4):454-469. DOI: 10.1016/j.neuron.2022.12.028. View

Mckenzie S, Frank A, Kinsky N, Porter B, Riviere P, Eichenbaum H . Hippocampal representation of related and opposing memories develop within distinct, hierarchically organized neural schemas. Neuron. 2014; 83(1):202-15. PMC: 4082468. DOI: 10.1016/j.neuron.2014.05.019. View

10.

Kriegeskorte N, Mur M, Bandettini P . Representational similarity analysis - connecting the branches of systems neuroscience. Front Syst Neurosci. 2008; 2:4. PMC: 2605405. DOI: 10.3389/neuro.06.004.2008. View

11.

Munoz F, Jensen G, Kennedy B, Alkan Y, Terrace H, Ferrera V . Learned Representation of Implied Serial Order in Posterior Parietal Cortex. Sci Rep. 2020; 10(1):9386. PMC: 7287075. DOI: 10.1038/s41598-020-65838-9. View

12.

DE SOTO C, London M, Handel S . Social reasoning and spatial paralogic. J Pers Soc Psychol. 1965; 2(4):513-21. DOI: 10.1037/h0022492. View

13.

Merritt D, MacLean E, Jaffe S, Brannon E . A comparative analysis of serial ordering in ring-tailed lemurs (Lemur catta). J Comp Psychol. 2007; 121(4):363-71. PMC: 2953466. DOI: 10.1037/0735-7036.121.4.363. View

14.

Wilson R, Takahashi Y, Schoenbaum G, Niv Y . Orbitofrontal cortex as a cognitive map of task space. Neuron. 2014; 81(2):267-279. PMC: 4001869. DOI: 10.1016/j.neuron.2013.11.005. View

15.

Barnett S, Ceci S . When and where do we apply what we learn? A taxonomy for far transfer. Psychol Bull. 2002; 128(4):612-37. DOI: 10.1037/0033-2909.128.4.612. View

16.

Peake T, Terry A, McGregor P, Dabelsteen T . Do great tits assess rivals by combining direct experience with information gathered by eavesdropping?. Proc Biol Sci. 2002; 269(1503):1925-9. PMC: 1691105. DOI: 10.1098/rspb.2002.2112. View

17.

MacLean E, Merritt D, Brannon E . Social Complexity Predicts Transitive Reasoning in Prosimian Primates. Anim Behav. 2009; 76(2):479-486. PMC: 2598410. DOI: 10.1016/j.anbehav.2008.01.025. View

18.

Frankland S, Greene J . Two Ways to Build a Thought: Distinct Forms of Compositional Semantic Representation across Brain Regions. Cereb Cortex. 2020; 30(6):3838-3855. DOI: 10.1093/cercor/bhaa001. View

19.

Turrigiano G . The self-tuning neuron: synaptic scaling of excitatory synapses. Cell. 2008; 135(3):422-35. PMC: 2834419. DOI: 10.1016/j.cell.2008.10.008. View

20.

Acuna B, Sanes J, Donoghue J . Cognitive mechanisms of transitive inference. Exp Brain Res. 2002; 146(1):1-10. DOI: 10.1007/s00221-002-1092-y. View