Dopamine-mediated Learning and Switching in Cortico-striatal Circuit Explain Behavioral Changes in Reinforcement Learning

Overview

Journal Front Behav Neurosci

Specialty Psychology

Date 2011 Apr 8

PMID 21472026

Citations 32

Authors

Simon Hong

Okihide Hikosaka

Affiliations

Soon will be listed here.

Abstract

The basal ganglia are thought to play a crucial role in reinforcement learning. Central to the learning mechanism are dopamine (DA) D1 and D2 receptors located in the cortico-striatal synapses. However, it is still unclear how this DA-mediated synaptic plasticity is deployed and coordinated during reward-contingent behavioral changes. Here we propose a computational model of reinforcement learning that uses different thresholds of D1- and D2-mediated synaptic plasticity which are antagonized by DA-independent synaptic plasticity. A phasic increase in DA release caused by a larger-than-expected reward induces long-term potentiation (LTP) in the direct pathway, whereas a phasic decrease in DA release caused by a smaller-than-expected reward induces a cessation of long-term depression, leading to LTP in the indirect pathway. This learning mechanism can explain the robust behavioral adaptation observed in a location-reward-value-association task where the animal makes shorter latency saccades to reward locations. The changes in saccade latency become quicker as the monkey becomes more experienced. This behavior can be explained by a switching mechanism which activates the cortico-striatal circuit selectively. Our model also shows how D1- or D2-receptor blocking experiments affect selectively either reward or no-reward trials. The proposed mechanisms also explain the behavioral changes in Parkinson's disease.

Citing Articles

Implication of regional selectivity of dopamine deficits in impaired suppressing of involuntary movements in Parkinson's disease.

Lee H, Kim H, Hikosaka O Neurosci Biobehav Rev. 2024; 162:105719.

PMID: 38759470 PMC: 11167649. DOI: 10.1016/j.neubiorev.2024.105719.

Detecting Subtle Cognitive Impairment in Patients with Parkinson's Disease and Normal Cognition: A Novel Cognitive Control Challenge Task (C3T).

Resnik Robida K, Politakis V, Oblak A, Slana Ozimic A, Burger H, Pirtosek Z Brain Sci. 2023; 13(6).

PMID: 37371439 PMC: 10296602. DOI: 10.3390/brainsci13060961.

Lateral habenula neurons signal step-by-step changes of reward prediction.

Lee H, Hikosaka O iScience. 2022; 25(11):105440.

PMID: 36388993 PMC: 9641246. DOI: 10.1016/j.isci.2022.105440.

A role for adaptive developmental plasticity in learning and decision making.

Lin W, Delevich K, Wilbrecht L Curr Opin Behav Sci. 2022; 36:48-54.

PMID: 35891805 PMC: 9311400. DOI: 10.1016/j.cobeha.2020.07.010.

Response Systems, Antagonistic Responses, and the Behavioral Repertoire.

Ortu D, Bugg R Front Behav Neurosci. 2022; 15:778420.

PMID: 35095436 PMC: 8792759. DOI: 10.3389/fnbeh.2021.778420.

References

Richfield E, Penney J, Young A . Anatomical and affinity state comparisons between dopamine D1 and D2 receptors in the rat central nervous system. Neuroscience. 1989; 30(3):767-77. DOI: 10.1016/0306-4522(89)90168-1. View

Breitenstein C, Korsukewitz C, Floel A, Kretzschmar T, Diederich K, Knecht S . Tonic dopaminergic stimulation impairs associative learning in healthy subjects. Neuropsychopharmacology. 2006; 31(11):2552-64. DOI: 10.1038/sj.npp.1301167. View

Schultz W, Apicella P, Scarnati E, LJUNGBERG T . Neuronal activity in monkey ventral striatum related to the expectation of reward. J Neurosci. 1992; 12(12):4595-610. PMC: 6575755. View

Shen W, Flajolet M, Greengard P, Surmeier D . Dichotomous dopaminergic control of striatal synaptic plasticity. Science. 2008; 321(5890):848-51. PMC: 2833421. DOI: 10.1126/science.1160575. View

Jaber M, Robinson S, Missale C, Caron M . Dopamine receptors and brain function. Neuropharmacology. 1996; 35(11):1503-19. DOI: 10.1016/s0028-3908(96)00100-1. View

Vermersch A, Rivaud S, Vidailhet M, Bonnet A, Gaymard B, Agid Y . Sequences of memory-guided saccades in Parkinson's disease. Ann Neurol. 1994; 35(4):487-90. DOI: 10.1002/ana.410350419. View

Behrman A, Cauraugh J, Light K . Practice as an intervention to improve speeded motor performance and motor learning in Parkinson's disease. J Neurol Sci. 2000; 174(2):127-36. DOI: 10.1016/s0022-510x(00)00267-7. View

Takikawa Y, Kawagoe R, Hikosaka O . A possible role of midbrain dopamine neurons in short- and long-term adaptation of saccades to position-reward mapping. J Neurophysiol. 2004; 92(4):2520-9. DOI: 10.1152/jn.00238.2004. View

Morris G, Arkadir D, Nevet A, Vaadia E, Bergman H . Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons. Neuron. 2004; 43(1):133-43. DOI: 10.1016/j.neuron.2004.06.012. View

10.

Kawagoe R, Takikawa Y, Hikosaka O . Expectation of reward modulates cognitive signals in the basal ganglia. Nat Neurosci. 1999; 1(5):411-6. DOI: 10.1038/1625. View

11.

Schall J, Hanes D, Thompson K, King D . Saccade target selection in frontal eye field of macaque. I. Visual and premovement activation. J Neurosci. 1995; 15(10):6905-18. PMC: 6577995. View

12.

Hikosaka O, Sakamoto M, Usui S . Functional properties of monkey caudate neurons. III. Activities related to expectation of target and reward. J Neurophysiol. 1989; 61(4):814-32. DOI: 10.1152/jn.1989.61.4.814. View

13.

Kravitz A, Freeze B, Parker P, Kay K, Thwin M, Deisseroth K . Regulation of parkinsonian motor behaviours by optogenetic control of basal ganglia circuitry. Nature. 2010; 466(7306):622-6. PMC: 3552484. DOI: 10.1038/nature09159. View

14.

Schultz W . Behavioral dopamine signals. Trends Neurosci. 2007; 30(5):203-10. DOI: 10.1016/j.tins.2007.03.007. View

15.

Frank M, Samanta J, Moustafa A, Sherman S . Hold your horses: impulsivity, deep brain stimulation, and medication in parkinsonism. Science. 2007; 318(5854):1309-12. DOI: 10.1126/science.1146157. View

16.

Darbaky Y, Baunez C, Arecchi P, Legallet E, Apicella P . Reward-related neuronal activity in the subthalamic nucleus of the monkey. Neuroreport. 2005; 16(11):1241-4. DOI: 10.1097/00001756-200508010-00022. View

17.

Yin H, Mulcare S, Hilario M, Clouse E, Holloway T, Davis M . Dynamic reorganization of striatal circuits during the acquisition and consolidation of a skill. Nat Neurosci. 2009; 12(3):333-41. PMC: 2774785. DOI: 10.1038/nn.2261. View

18.

Lauwereyns J, Watanabe K, Coe B, Hikosaka O . A neural correlate of response bias in monkey caudate nucleus. Nature. 2002; 418(6896):413-7. DOI: 10.1038/nature00892. View

19.

Pessiglione M, Seymour B, Flandin G, Dolan R, Frith C . Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature. 2006; 442(7106):1042-5. PMC: 2636869. DOI: 10.1038/nature05051. View

20.

Cunnington R, Lalouschek W, Dirnberger G, Walla P, Lindinger G, Asenbaum S . A medial to lateral shift in pre-movement cortical activity in hemi-Parkinson's disease. Clin Neurophysiol. 2001; 112(4):608-18. DOI: 10.1016/s1388-2457(01)00467-9. View