» Articles » PMID: 10406133

A Predictive Reinforcement Model of Dopamine Neurons for Learning Approach Behavior

Overview
Specialties Biology
Neurology
Date 1999 Jul 16
PMID 10406133
Citations 25
Authors
Affiliations
Soon will be listed here.
Abstract

A neural network model of how dopamine and prefrontal cortex activity guides short- and long-term information processing within the cortico-striatal circuits during reward-related learning of approach behavior is proposed. The model predicts two types of reward-related neuronal responses generated during learning: (1) cell activity signaling errors in the prediction of the expected time of reward delivery and (2) neural activations coding for errors in the prediction of the amount and type of reward or stimulus expectancies. The former type of signal is consistent with the responses of dopaminergic neurons, while the latter signal is consistent with reward expectancy responses reported in the prefrontal cortex. It is shown that a neural network architecture that satisfies the design principles of the adaptive resonance theory of Carpenter and Grossberg (1987) can account for the dopamine responses to novelty, generalization, and discrimination of appetitive and aversive stimuli. These hypotheses are scrutinized via simulations of the model in relation to the delivery of free food outside a task, the timed contingent delivery of appetitive and aversive stimuli, and an asymmetric, instructed delay response task.

Citing Articles

Human Substantia Nigra Neurons Encode Reward Expectations.

Imtiaz Z, Kato A, Kopell B, Qasim S, Davis A, Nunez Martinez L bioRxiv. 2024; .

PMID: 38766086 PMC: 11100806. DOI: 10.1101/2024.05.10.593406.


Dynamics of striatal action selection and reinforcement learning.

Lindsey J, Markowitz J, Gillis W, Datta S, Litwin-Kumar A bioRxiv. 2024; .

PMID: 38464083 PMC: 10925202. DOI: 10.1101/2024.02.14.580408.


Blunted Expected Reward Value Signals in Binge Alcohol Drinkers.

Tolomeo S, Baldacchino A, Douglas Steele J J Neurosci. 2023; 43(31):5685-5692.

PMID: 36717232 PMC: 10401632. DOI: 10.1523/JNEUROSCI.2157-21.2022.


Modulation of Dopamine for Adaptive Learning: A Neurocomputational Model.

Inglis J, Valentin V, Ashby F Comput Brain Behav. 2021; 4(1):34-52.

PMID: 34151186 PMC: 8210637. DOI: 10.1007/s42113-020-00083-x.


A systems-neuroscience model of phasic dopamine.

Mollick J, Hazy T, Krueger K, Nair A, Mackie P, Herd S Psychol Rev. 2020; 127(6):972-1021.

PMID: 32525345 PMC: 8453660. DOI: 10.1037/rev0000199.


References
1.
Schultz W, Dayan P, Montague P . A neural substrate of prediction and reward. Science. 1997; 275(5306):1593-9. DOI: 10.1126/science.275.5306.1593. View

2.
Kunzle H . An autoradiographic analysis of the efferent connections from premotor and adjacent prefrontal regions (areas 6 and 9) in macaca fascicularis. Brain Behav Evol. 1978; 15(3):185-234. DOI: 10.1159/000123779. View

3.
Gerfen C . The neostriatal mosaic: striatal patch-matrix organization is related to cortical lamination. Science. 1989; 246(4928):385-8. DOI: 10.1126/science.2799392. View

4.
Young 3rd W, Alheid G, Heimer L . The ventral pallidal projection to the mediodorsal thalamus: a study with fluorescent retrograde tracers and immunohistofluorescence. J Neurosci. 1984; 4(6):1626-38. PMC: 6564971. View

5.
Romo R, Schultz W . Dopamine neurons of the monkey midbrain: contingencies of responses to active touch during self-initiated arm movements. J Neurophysiol. 1990; 63(3):592-606. DOI: 10.1152/jn.1990.63.3.592. View