» Articles » PMID: 18032658

Reinforcement Learning Signals in the Human Striatum Distinguish Learners from Nonlearners During Reward-based Decision Making

Overview
Journal J Neurosci
Specialty Neurology
Date 2007 Nov 23
PMID 18032658
Citations 199
Authors
Affiliations
Soon will be listed here.
Abstract

The computational framework of reinforcement learning has been used to forward our understanding of the neural mechanisms underlying reward learning and decision-making behavior. It is known that humans vary widely in their performance in decision-making tasks. Here, we used a simple four-armed bandit task in which subjects are almost evenly split into two groups on the basis of their performance: those who do learn to favor choice of the optimal action and those who do not. Using models of reinforcement learning we sought to determine the neural basis of these intrinsic differences in performance by scanning both groups with functional magnetic resonance imaging. We scanned 29 subjects while they performed the reward-based decision-making task. Our results suggest that these two groups differ markedly in the degree to which reinforcement learning signals in the striatum are engaged during task performance. While the learners showed robust prediction error signals in both the ventral and dorsal striatum during learning, the nonlearner group showed a marked absence of such signals. Moreover, the magnitude of prediction error signals in a region of dorsal striatum correlated significantly with a measure of behavioral performance across all subjects. These findings support a crucial role of prediction error signals, likely originating from dopaminergic midbrain neurons, in enabling learning of action selection preferences on the basis of obtained rewards. Thus, spontaneously observed individual differences in decision making performance demonstrate the suggested dependence of this type of learning on the functional integrity of the dopaminergic striatal system in humans.

Citing Articles

Distinct neural computations scale the violation of expected reward and emotion in social transgressions.

Xu T, Zhang L, Zhou F, Fu K, Gan X, Chen Z Commun Biol. 2025; 8(1):106.

PMID: 39838081 PMC: 11751440. DOI: 10.1038/s42003-025-07561-7.


"Actor-critic" dichotomous hyperactivation and hypoconnectivity in obsessive-compulsive disorder.

Araujo A, Duarte I, Sousa T, Meneses S, Pereira A, Robbins T Neuroimage Clin. 2025; 45:103729.

PMID: 39787803 PMC: 11762915. DOI: 10.1016/j.nicl.2024.103729.


Here Comes Revenge: Peer Victimization Relates to Neural and Behavioral Responses to Social Exclusion.

Kellij S, Dobbelaar S, Lodder G, Veenstra R, Guroglu B Res Child Adolesc Psychopathol. 2024; 52(12):1913-1930.

PMID: 39287772 PMC: 11624251. DOI: 10.1007/s10802-024-01227-4.


Reinforcement learning processes as forecasters of depression remission.

Bansal V, McCurry K, Lisinski J, Kim D, Goyal S, Wang J J Affect Disord. 2024; 368:829-837.

PMID: 39271064 PMC: 11573115. DOI: 10.1016/j.jad.2024.09.066.


Enhanced "learning to learn" through a hierarchical dual-learning system: the case of action video game players.

Gao Y, Fang Z, Zhou Q, Zhang R BMC Psychol. 2024; 12(1):460.

PMID: 39215348 PMC: 11365284. DOI: 10.1186/s40359-024-01952-x.


References
1.
Beck A, Ward C, Mendelson M, Mock J, ERBAUGH J . An inventory for measuring depression. Arch Gen Psychiatry. 1961; 4:561-71. DOI: 10.1001/archpsyc.1961.01710120031004. View

2.
Cools R, Robbins T . Chemistry of the adaptive mind. Philos Trans A Math Phys Eng Sci. 2004; 362(1825):2871-88. DOI: 10.1098/rsta.2004.1468. View

3.
Daw N, Doya K . The computational neurobiology of learning and reward. Curr Opin Neurobiol. 2006; 16(2):199-204. DOI: 10.1016/j.conb.2006.03.006. View

4.
ODoherty J . Reward representations and reward-related learning in the human brain: insights from neuroimaging. Curr Opin Neurobiol. 2004; 14(6):769-76. DOI: 10.1016/j.conb.2004.10.016. View

5.
Pessiglione M, Seymour B, Flandin G, Dolan R, Frith C . Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature. 2006; 442(7106):1042-5. PMC: 2636869. DOI: 10.1038/nature05051. View