A Cholinergic Feedback Circuit to Regulate Striatal Population Uncertainty and Optimize Reinforcement Learning
Authors
Affiliations
Convergent evidence suggests that the basal ganglia support reinforcement learning by adjusting action values according to reward prediction errors. However, adaptive behavior in stochastic environments requires the consideration of uncertainty to dynamically adjust the learning rate. We consider how cholinergic tonically active interneurons (TANs) may endow the striatum with such a mechanism in computational models spanning three Marr's levels of analysis. In the neural model, TANs modulate the excitability of spiny neurons, their population response to reinforcement, and hence the effective learning rate. Long TAN pauses facilitated robustness to spurious outcomes by increasing divergence in synaptic weights between neurons coding for alternative action values, whereas short TAN pauses facilitated stochastic behavior but increased responsiveness to change-points in outcome contingencies. A feedback control system allowed TAN pauses to be dynamically modulated by uncertainty across the spiny neuron population, allowing the system to self-tune and optimize performance across stochastic environments.
A TAN-dopamine interaction mechanism based computational model of basal ganglia in action selection.
Zhu Q, Han F, Yuan Y, Shen L Cogn Neurodyn. 2024; 18(5):2127-2144.
PMID: 39555280 PMC: 11564715. DOI: 10.1007/s11571-023-10046-0.
A mismatch between striatal cholinergic pauses and dopaminergic reward prediction errors.
Duhne M, Mohebi A, Kim K, Pelattini L, Berke J Proc Natl Acad Sci U S A. 2024; 121(41):e2410828121.
PMID: 39365823 PMC: 11474027. DOI: 10.1073/pnas.2410828121.
Song Y, Zhao S, Rong M, Liu Y, Gao Y, Chen W Behav Sci (Basel). 2024; 14(8).
PMID: 39199026 PMC: 11351138. DOI: 10.3390/bs14080630.
Szalisznyo K, Silverstein D Cogn Neurodyn. 2024; 18(1):217-232.
PMID: 38406202 PMC: 10881457. DOI: 10.1007/s11571-022-09865-4.
Acetylcholine modulates the precision of prediction error in the auditory cortex.
Perez-Gonzalez D, Lao-Rodriguez A, Aedo-Sanchez C, Malmierca M Elife. 2024; 12.
PMID: 38241174 PMC: 10942646. DOI: 10.7554/eLife.91475.