A Hypothesis for Basal Ganglia-dependent Reinforcement Learning in the Songbird
Overview
Authors
Affiliations
Most of our motor skills are not innately programmed, but are learned by a combination of motor exploration and performance evaluation, suggesting that they proceed through a reinforcement learning (RL) mechanism. Songbirds have emerged as a model system to study how a complex behavioral sequence can be learned through an RL-like strategy. Interestingly, like motor sequence learning in mammals, song learning in birds requires a basal ganglia (BG)-thalamocortical loop, suggesting common neural mechanisms. Here, we outline a specific working hypothesis for how BG-forebrain circuits could utilize an internally computed reinforcement signal to direct song learning. Our model includes a number of general concepts borrowed from the mammalian BG literature, including a dopaminergic reward prediction error and dopamine-mediated plasticity at corticostriatal synapses. We also invoke a number of conceptual advances arising from recent observations in the songbird. Specifically, there is evidence for a specialized cortical circuit that adds trial-to-trial variability to stereotyped cortical motor programs, and a role for the BG in "biasing" this variability to improve behavioral performance. This BG-dependent "premotor bias" may in turn guide plasticity in downstream cortical synapses to consolidate recently learned song changes. Given the similarity between mammalian and songbird BG-thalamocortical circuits, our model for the role of the BG in this process may have broader relevance to mammalian BG function.
Natural behaviour is learned through dopamine-mediated reinforcement.
Kasdin J, Duffy A, Nadler N, Raha A, Fairhall A, Stachenfeld K Nature. 2025; .
PMID: 40074908 DOI: 10.1038/s41586-025-08729-1.
Dual neuromodulatory dynamics underlie birdsong learning.
Qi J, Schreiner D, Martinez M, Pearson J, Mooney R Nature. 2025; .
PMID: 40074907 DOI: 10.1038/s41586-025-08694-9.
Social context affects sequence modification learning in birdsong.
Fortkord L, Veit L Front Psychol. 2025; 16:1488762.
PMID: 39973966 PMC: 11835814. DOI: 10.3389/fpsyg.2025.1488762.
Weight Transfer in the Reinforcement Learning Model of Songbird Acquisition.
Tran K, Koulakov A bioRxiv. 2025; .
PMID: 39803499 PMC: 11722242. DOI: 10.1101/2024.12.30.628217.
Ghanayim A, Benisty H, Cohen Rimon A, Schwartz S, Dabdoob S, Lifshitz S Nat Commun. 2025; 16(1):200.
PMID: 39746993 PMC: 11696230. DOI: 10.1038/s41467-024-55317-4.