» Articles » PMID: 39712129

On the Ability of Standard and Brain-constrained Deep Neural Networks to Support Cognitive Superposition: a Position Paper

Overview
Journal Cogn Neurodyn
Publisher Springer
Specialty Neurology
Date 2024 Dec 23
PMID 39712129
Authors
Affiliations
Soon will be listed here.
Abstract

The ability to coactivate (or "superpose") multiple conceptual representations is a fundamental function that we constantly rely upon; this is crucial in complex cognitive tasks requiring multi-item working memory, such as mental arithmetic, abstract reasoning, and language comprehension. As such, an artificial system aspiring to implement any of these aspects of general intelligence should be able to support this operation. I argue here that standard, feed-forward deep neural networks (DNNs) are unable to implement this function, whereas an alternative, fully brain-constrained class of neural architectures spontaneously exhibits it. On the basis of novel simulations, this proof-of-concept article shows that deep, brain-like networks trained with biologically realistic Hebbian learning mechanisms display the spontaneous emergence of internal circuits (cell assemblies) having features that make them natural candidates for supporting superposition. Building on previous computational modelling results, I also argue that, and offer an explanation as to why, in contrast, modern DNNs trained with gradient descent are generally unable to co-activate their internal representations. While deep brain-constrained neural architectures spontaneously develop the ability to support superposition as a result of (1) neurophysiologically accurate learning and (2) cortically realistic between-area connections, backpropagation-trained DNNs appear to be unsuited to implement this basic cognitive operation, arguably necessary for abstract thinking and general intelligence. The implications of this observation are briefly discussed in the larger context of existing and future artificial intelligence systems and neuro-realistic computational models.

References
1.
Artola A, Brocher S, Singer W . Different voltage-dependent thresholds for inducing long-term depression and long-term potentiation in slices of rat visual cortex. Nature. 1990; 347(6288):69-72. DOI: 10.1038/347069a0. View

2.
Vaz A, Wittig Jr J, Inati S, Zaghloul K . Replay of cortical spiking sequences during human memory retrieval. Science. 2020; 367(6482):1131-1134. PMC: 7211396. DOI: 10.1126/science.aba0672. View

3.
Ursino M, Cesaretti N, Pirazzini G . A model of working memory for encoding multiple items and ordered sequences exploiting the theta-gamma code. Cogn Neurodyn. 2023; 17(2):489-521. PMC: 10050512. DOI: 10.1007/s11571-022-09836-9. View

4.
Liang F, Li H, Chou X, Zhou M, Zhang N, Xiao Z . Sparse Representation in Awake Auditory Cortex: Cell-type Dependence, Synaptic Mechanisms, Developmental Emergence, and Modulation. Cereb Cortex. 2018; 29(9):3796-3812. PMC: 6686756. DOI: 10.1093/cercor/bhy260. View

5.
Petrides M, Pandya D . Distinct parietal and temporal pathways to the homologues of Broca's area in the monkey. PLoS Biol. 2009; 7(8):e1000170. PMC: 2714989. DOI: 10.1371/journal.pbio.1000170. View