Dopamine, Reward Learning, and Active Inference

Overview

Journal Front Comput Neurosci

Specialty Biology

Date 2015 Nov 20

PMID 26581305

Citations 46

Authors

Thomas H B FitzGerald

Raymond J Dolan

Karl Friston

Affiliations

Soon will be listed here.

Abstract

Temporal difference learning models propose phasic dopamine signaling encodes reward prediction errors that drive learning. This is supported by studies where optogenetic stimulation of dopamine neurons can stand in lieu of actual reward. Nevertheless, a large body of data also shows that dopamine is not necessary for learning, and that dopamine depletion primarily affects task performance. We offer a resolution to this paradox based on an hypothesis that dopamine encodes the precision of beliefs about alternative actions, and thus controls the outcome-sensitivity of behavior. We extend an active inference scheme for solving Markov decision processes to include learning, and show that simulated dopamine dynamics strongly resemble those actually observed during instrumental conditioning. Furthermore, simulated dopamine depletion impairs performance but spares learning, while simulated excitation of dopamine neurons drives reward learning, through aberrant inference about outcome states. Our formal approach provides a novel and parsimonious reconciliation of apparently divergent experimental findings.

Citing Articles

Policy Complexity Suppresses Dopamine Responses.

Gershman S, Lak A J Neurosci. 2025; 45(9).

PMID: 39788740 PMC: 11866995. DOI: 10.1523/JNEUROSCI.1756-24.2024.

Generalized cue reactivity in rat dopamine neurons after opioids.

Lehmann C, Miller N, Nair V, Costa K, Schoenbaum G, Moussawi K Nat Commun. 2025; 16(1):321.

PMID: 39747036 PMC: 11697388. DOI: 10.1038/s41467-024-55504-3.

Policy complexity suppresses dopamine responses.

Gershman S, Lak A bioRxiv. 2024; .

PMID: 39345642 PMC: 11429712. DOI: 10.1101/2024.09.15.613150.

Dopamine-mediated formation of a memory module in the nucleus accumbens for goal-directed navigation.

Jung K, Krussel S, Yoo S, An M, Burke B, Schappaugh N Nat Neurosci. 2024; 27(11):2178-2192.

PMID: 39333785 PMC: 11537966. DOI: 10.1038/s41593-024-01770-9.

An Integrated theory of false insights and beliefs under psychedelics.

McGovern H, Grimmer H, Doss M, Hutchinson B, Timmermann C, Lyon A Commun Psychol. 2024; 2(1):69.

PMID: 39242747 PMC: 11332244. DOI: 10.1038/s44271-024-00120-6.

References

Wunderlich K, Dayan P, Dolan R . Mapping value based planning and extensively trained choice in the human brain. Nat Neurosci. 2012; 15(5):786-91. PMC: 3378641. DOI: 10.1038/nn.3068. View

Pouget A, Beck J, Ma W, Latham P . Probabilistic brains: knowns and unknowns. Nat Neurosci. 2013; 16(9):1170-8. PMC: 4487650. DOI: 10.1038/nn.3495. View

Schultz W, Dayan P, Montague P . A neural substrate of prediction and reward. Science. 1997; 275(5306):1593-9. DOI: 10.1126/science.275.5306.1593. View

Salamone J, Correa M, Farrar A, Mingote S . Effort-related functions of nucleus accumbens dopamine and associated forebrain circuits. Psychopharmacology (Berl). 2007; 191(3):461-82. DOI: 10.1007/s00213-006-0668-9. View

Clark A . Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behav Brain Sci. 2013; 36(3):181-204. DOI: 10.1017/S0140525X12000477. View

Friston K, Samothrakis S, Montague R . Active inference and agency: optimal control without cost functions. Biol Cybern. 2012; 106(8-9):523-41. DOI: 10.1007/s00422-012-0512-8. View

Adams R, Stephan K, Brown H, Frith C, Friston K . The computational anatomy of psychosis. Front Psychiatry. 2013; 4:47. PMC: 3667557. DOI: 10.3389/fpsyt.2013.00047. View

Friston K, Schwartenbeck P, Fitzgerald T, Moutoussis M, Behrens T, Dolan R . The anatomy of choice: dopamine and decision-making. Philos Trans R Soc Lond B Biol Sci. 2014; 369(1655). PMC: 4186234. DOI: 10.1098/rstb.2013.0481. View

Collins A, Frank M . Opponent actor learning (OpAL): modeling interactive effects of striatal dopamine on reinforcement learning and choice incentive. Psychol Rev. 2014; 121(3):337-66. DOI: 10.1037/a0037015. View

10.

Robbins T, Everitt B . A role for mesencephalic dopamine in activation: commentary on Berridge (2006). Psychopharmacology (Berl). 2006; 191(3):433-7. DOI: 10.1007/s00213-006-0528-7. View

11.

Darvas M, Palmiter R . Restricting dopaminergic signaling to either dorsolateral or medial striatum facilitates cognition. J Neurosci. 2010; 30(3):1158-65. PMC: 3771669. DOI: 10.1523/JNEUROSCI.4576-09.2010. View

12.

Rossi M, Sukharnikova T, Hayrapetyan V, Yang L, Yin H . Operant self-stimulation of dopamine neurons in the substantia nigra. PLoS One. 2013; 8(6):e65799. PMC: 3673941. DOI: 10.1371/journal.pone.0065799. View

13.

Rutledge R, Lazzaro S, Lau B, Myers C, Gluck M, Glimcher P . Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task. J Neurosci. 2009; 29(48):15104-14. PMC: 3376711. DOI: 10.1523/JNEUROSCI.3524-09.2009. View

14.

Schwartenbeck P, Fitzgerald T, Dolan R, Friston K . Exploration, novelty, surprise, and free energy minimization. Front Psychol. 2013; 4:710. PMC: 3791848. DOI: 10.3389/fpsyg.2013.00710. View

15.

Friston K, Rigoli F, Ognibene D, Mathys C, Fitzgerald T, Pezzulo G . Active inference and epistemic value. Cogn Neurosci. 2015; 6(4):187-214. DOI: 10.1080/17588928.2015.1020053. View

16.

Dolan R, Dayan P . Goals and habits in the brain. Neuron. 2013; 80(2):312-25. PMC: 3807793. DOI: 10.1016/j.neuron.2013.09.007. View

17.

Shiner T, Seymour B, Wunderlich K, Hill C, Bhatia K, Dayan P . Dopamine and performance in a reinforcement learning task: evidence from Parkinson's disease. Brain. 2012; 135(Pt 6):1871-83. PMC: 3359751. DOI: 10.1093/brain/aws083. View

18.

Friston K . The free-energy principle: a unified brain theory?. Nat Rev Neurosci. 2010; 11(2):127-38. DOI: 10.1038/nrn2787. View

19.

Lee S, Shimojo S, ODoherty J . Neural computations underlying arbitration between model-based and model-free learning. Neuron. 2014; 81(3):687-99. PMC: 3968946. DOI: 10.1016/j.neuron.2013.11.028. View

20.

Friston K, Shiner T, Fitzgerald T, Galea J, Adams R, Brown H . Dopamine, affordance and active inference. PLoS Comput Biol. 2012; 8(1):e1002327. PMC: 3252266. DOI: 10.1371/journal.pcbi.1002327. View