» Articles » PMID: 37082448

What is Dopamine Doing in Model-based Reinforcement Learning?

Overview
Date 2023 Apr 21
PMID 37082448
Authors
Affiliations
Soon will be listed here.
Abstract

Experiments have implicated dopamine in model-based reinforcement learning (RL). These findings are unexpected as dopamine is thought to encode a reward prediction error (RPE), which is the key teaching signal in model-free RL. Here we examine two possible accounts for dopamine's involvement in model-based RL: the first that dopamine neurons carry a prediction error used to update a type of predictive state representation called a successor representation, the second that two well established aspects of dopaminergic activity, RPEs and surprise signals, can together explain dopamine's involvement in model-based RL.

Citing Articles

Devaluing memories of reward: a case for dopamine.

Fry B, Russell N, Fex V, Mo B, Pence N, Beatty J Commun Biol. 2025; 8(1):161.

PMID: 39900665 PMC: 11790953. DOI: 10.1038/s42003-024-07440-7.


The curious case of dopaminergic prediction errors and learning associative information beyond value.

Kahnt T, Schoenbaum G Nat Rev Neurosci. 2025; 26(3):169-178.

PMID: 39779974 DOI: 10.1038/s41583-024-00898-8.


Biomarker discovery using machine learning in the psychosis spectrum.

Yassin W, Loedige K, Wannan C, Holton K, Chevinsky J, Torous J Biomark Neuropsychiatry. 2024; 11.

PMID: 39687745 PMC: 11649307. DOI: 10.1016/j.bionps.2024.100107.


Dopamine Release in the Nucleus Accumbens Core Encodes the General Excitatory Components of Learning.

Taira M, Millard S, Verghese A, DiFazio L, Hoang I, Jia R J Neurosci. 2024; 44(35).

PMID: 38969504 PMC: 11358529. DOI: 10.1523/JNEUROSCI.0120-24.2024.


Dopamine Increases Accuracy and Lengthens Deliberation Time in Explicit Motor Skill Learning.

Leow L, Bernheine L, Carroll T, Dux P, Filmer H eNeuro. 2024; 11(1).

PMID: 38238069 PMC: 10849023. DOI: 10.1523/ENEURO.0360-23.2023.


References
1.
Howe M, Dombeck D . Rapid signalling in distinct dopaminergic axons during locomotion and reward. Nature. 2016; 535(7613):505-10. PMC: 4970879. DOI: 10.1038/nature18942. View

2.
Takahashi Y, Batchelor H, Liu B, Khanna A, Morales M, Schoenbaum G . Dopamine Neurons Respond to Errors in the Prediction of Sensory Features of Expected Rewards. Neuron. 2017; 95(6):1395-1405.e3. PMC: 5658021. DOI: 10.1016/j.neuron.2017.08.025. View

3.
Deserno L, Huys Q, Boehme R, Buchert R, Heinze H, Grace A . Ventral striatal dopamine reflects behavioral and neural signatures of model-based control during sequential decision making. Proc Natl Acad Sci U S A. 2015; 112(5):1595-600. PMC: 4321318. DOI: 10.1073/pnas.1417219112. View

4.
Starkweather C, Babayan B, Uchida N, Gershman S . Dopamine reward prediction errors reflect hidden-state inference across time. Nat Neurosci. 2017; 20(4):581-589. PMC: 5374025. DOI: 10.1038/nn.4520. View

5.
Lerner T, Shilyansky C, Davidson T, Evans K, Beier K, Zalocusky K . Intact-Brain Analyses Reveal Distinct Information Carried by SNc Dopamine Subcircuits. Cell. 2015; 162(3):635-47. PMC: 4790813. DOI: 10.1016/j.cell.2015.07.014. View