A Silent Eligibility Trace Enables Dopamine-dependent Synaptic Plasticity for Reinforcement Learning in the Mouse Striatum
Overview
Affiliations
Dopamine-dependent synaptic plasticity is a candidate mechanism for reinforcement learning. A silent eligibility trace - initiated by synaptic activity and transformed into synaptic strengthening by later action of dopamine - has been hypothesized to explain the retroactive effect of dopamine in reinforcing past behaviour. We tested this hypothesis by measuring time-dependent modulation of synaptic plasticity by dopamine in adult mouse striatum, using whole-cell recordings. Presynaptic activity followed by postsynaptic action potentials (pre-post) caused spike-timing-dependent long-term depression in D1-expressing neurons, but not in D2 neurons, and not if postsynaptic activity followed presynaptic activity. Subsequent experiments focused on D1 neurons. Applying a dopamine D1 receptor agonist during induction of pre-post plasticity caused long-term potentiation. This long-term potentiation was hidden by long-term depression occurring concurrently and was unmasked when long-term depression blocked an L-type calcium channel antagonist. Long-term potentiation was blocked by a Ca -permeable AMPA receptor antagonist but not by an NMDA antagonist or an L-type calcium channel antagonist. Pre-post stimulation caused transient elevation of rectification - a marker for expression of Ca -permeable AMPA receptors - for 2-4-s after stimulation. To test for an eligibility trace, dopamine was uncaged at specific time points before and after pre- and postsynaptic conjunction of activity. Dopamine caused potentiation selectively at synapses that were active 2-s before dopamine release, but not at earlier or later times. Our results provide direct evidence for a silent eligibility trace in the synapses of striatal neurons. This dopamine-timing-dependent plasticity may play a central role in reinforcement learning.
Choi J, Amjad U, Murray R, Shrivastav R, Teichert T, Goodell B bioRxiv. 2025; .
PMID: 39990309 PMC: 11844372. DOI: 10.1101/2025.02.10.636943.
Lee H Sensors (Basel). 2025; 25(3).
PMID: 39943618 PMC: 11820235. DOI: 10.3390/s25030979.
A minimal model of cognition based on oscillatory and current-based reinforcement processes.
Gyllingberg L, Tian Y, Sumpter D J R Soc Interface. 2025; 22(222):rsif20240402.
PMID: 39837485 PMC: 11750385. DOI: 10.1098/rsif.2024.0402.
Dopamine builds and reveals reward-associated latent behavioral attractors.
Naude J, Sarazin M, Mondoloni S, Hannesse B, Vicq E, Amegandjin F Nat Commun. 2024; 15(1):9825.
PMID: 39537606 PMC: 11561151. DOI: 10.1038/s41467-024-53976-x.
Seo I, Lee H Sensors (Basel). 2024; 24(19).
PMID: 39409459 PMC: 11479366. DOI: 10.3390/s24196419.