On the Ability of Standard and Brain-constrained Deep Neural Networks to Support Cognitive Superposition: a Position Paper

Overview

Journal Cogn Neurodyn

Publisher Springer

Specialty Neurology

Date 2024 Dec 23

PMID 39712129

Authors

Max Garagnani

Affiliations

Soon will be listed here.

Abstract

The ability to coactivate (or "superpose") multiple conceptual representations is a fundamental function that we constantly rely upon; this is crucial in complex cognitive tasks requiring multi-item working memory, such as mental arithmetic, abstract reasoning, and language comprehension. As such, an artificial system aspiring to implement any of these aspects of general intelligence should be able to support this operation. I argue here that standard, feed-forward deep neural networks (DNNs) are unable to implement this function, whereas an alternative, fully brain-constrained class of neural architectures spontaneously exhibits it. On the basis of novel simulations, this proof-of-concept article shows that deep, brain-like networks trained with biologically realistic Hebbian learning mechanisms display the spontaneous emergence of internal circuits (cell assemblies) having features that make them natural candidates for supporting superposition. Building on previous computational modelling results, I also argue that, and offer an explanation as to why, in contrast, modern DNNs trained with gradient descent are generally unable to co-activate their internal representations. While deep brain-constrained neural architectures spontaneously develop the ability to support superposition as a result of (1) neurophysiologically accurate learning and (2) cortically realistic between-area connections, backpropagation-trained DNNs appear to be unsuited to implement this basic cognitive operation, arguably necessary for abstract thinking and general intelligence. The implications of this observation are briefly discussed in the larger context of existing and future artificial intelligence systems and neuro-realistic computational models.

References

Artola A, Brocher S, Singer W . Different voltage-dependent thresholds for inducing long-term depression and long-term potentiation in slices of rat visual cortex. Nature. 1990; 347(6288):69-72. DOI: 10.1038/347069a0. View

Vaz A, Wittig Jr J, Inati S, Zaghloul K . Replay of cortical spiking sequences during human memory retrieval. Science. 2020; 367(6482):1131-1134. PMC: 7211396. DOI: 10.1126/science.aba0672. View

Ursino M, Cesaretti N, Pirazzini G . A model of working memory for encoding multiple items and ordered sequences exploiting the theta-gamma code. Cogn Neurodyn. 2023; 17(2):489-521. PMC: 10050512. DOI: 10.1007/s11571-022-09836-9. View

Liang F, Li H, Chou X, Zhou M, Zhang N, Xiao Z . Sparse Representation in Awake Auditory Cortex: Cell-type Dependence, Synaptic Mechanisms, Developmental Emergence, and Modulation. Cereb Cortex. 2018; 29(9):3796-3812. PMC: 6686756. DOI: 10.1093/cercor/bhy260. View

Petrides M, Pandya D . Distinct parietal and temporal pathways to the homologues of Broca's area in the monkey. PLoS Biol. 2009; 7(8):e1000170. PMC: 2714989. DOI: 10.1371/journal.pbio.1000170. View

Bi G, Poo M . Synaptic modification by correlated activity: Hebb's postulate revisited. Annu Rev Neurosci. 2001; 24:139-66. DOI: 10.1146/annurev.neuro.24.1.139. View

Malenka R, Bear M . LTP and LTD: an embarrassment of riches. Neuron. 2004; 44(1):5-21. DOI: 10.1016/j.neuron.2004.09.012. View

Pulvermuller F . Syntactic circuits: how does the brain create serial order in sentences?. Brain Lang. 2000; 71(1):194-9. DOI: 10.1006/brln.1999.2249. View

Hampton J . Conceptual combination: conjunction and negation of natural concepts. Mem Cognit. 1998; 25(6):888-909. DOI: 10.3758/bf03211333. View

10.

Pulvermuller F, Kujala T, Shtyrov Y, Simola J, Tiitinen H, Alku P . Memory traces for words as revealed by the mismatch negativity. Neuroimage. 2001; 14(3):607-16. DOI: 10.1006/nimg.2001.0864. View

11.

Buzsaki G . Large-scale recording of neuronal ensembles. Nat Neurosci. 2004; 7(5):446-51. DOI: 10.1038/nn1233. View

12.

Conway A, Kane M, Engle R . Working memory capacity and its relation to general intelligence. Trends Cogn Sci. 2003; 7(12):547-52. DOI: 10.1016/j.tics.2003.10.005. View

13.

Yamins D, DiCarlo J . Using goal-driven deep learning models to understand sensory cortex. Nat Neurosci. 2016; 19(3):356-65. DOI: 10.1038/nn.4244. View

14.

Tomasello R, Wennekers T, Garagnani M, Pulvermuller F . Visual cortex recruitment during language processing in blind individuals is explained by Hebbian learning. Sci Rep. 2019; 9(1):3579. PMC: 6400975. DOI: 10.1038/s41598-019-39864-1. View

15.

Canolty R, Ganguly K, Kennerley S, Cadieu C, Koepsell K, Wallis J . Oscillatory phase coupling coordinates anatomically dispersed functional cell assemblies. Proc Natl Acad Sci U S A. 2010; 107(40):17356-61. PMC: 2951408. DOI: 10.1073/pnas.1008306107. View

16.

Baddeley R, Abbott L, Booth M, Sengpiel F, Freeman T, Wakeman E . Responses of neurons in primary and inferior temporal visual cortices to natural scenes. Proc Biol Sci. 1998; 264(1389):1775-83. PMC: 1688734. DOI: 10.1098/rspb.1997.0246. View

17.

Engle R, Tuholski S, Laughlin J, Conway A . Working memory, short-term memory, and general fluid intelligence: a latent-variable approach. J Exp Psychol Gen. 1999; 128(3):309-331. DOI: 10.1037//0096-3445.128.3.309. View

18.

Amit D, Brunel N . Model of global spontaneous activity and local structured activity during delay periods in the cerebral cortex. Cereb Cortex. 1997; 7(3):237-52. DOI: 10.1093/cercor/7.3.237. View

19.

Page M . Connectionist modelling in psychology: a localist manifesto. Behav Brain Sci. 2001; 23(4):443-67; discussion 467-512. DOI: 10.1017/s0140525x00003356. View

20.

Pulvermuller F . Words in the brain's language. Behav Brain Sci. 2001; 22(2):253-79; discussion 280-336. View