» Articles » PMID: 24321554

Neural Systems for Choice and Valuation with Counterfactual Learning Signals

Overview
Journal Neuroimage
Specialty Radiology
Date 2013 Dec 11
PMID 24321554
Citations 15
Authors
Affiliations
Soon will be listed here.
Abstract

The purpose of this experiment was to test a computational model of reinforcement learning with and without fictive prediction error (FPE) signals to investigate how counterfactual consequences contribute to acquired representations of action-specific expected value, and to determine the functional neuroanatomy and neuromodulator systems that are involved. 80 male participants underwent dietary depletion of either tryptophan or tyrosine/phenylalanine to manipulate serotonin (5HT) and dopamine (DA), respectively. They completed 80 rounds (240 trials) of a strategic sequential investment task that required accepting interim losses in order to access a lucrative state and maximize long-term gains, while being scanned. We extended the standard Q-learning model by incorporating both counterfactual gains and losses into separate error signals. The FPE model explained the participants' data significantly better than a model that did not include counterfactual learning signals. Expected value from the FPE model was significantly correlated with BOLD signal change in the ventromedial prefrontal cortex (vmPFC) and posterior orbitofrontal cortex (OFC), whereas expected value from the standard model did not predict changes in neural activity. The depletion procedure revealed significantly different neural responses to expected value in the vmPFC, caudate, and dopaminergic midbrain in the vicinity of the substantia nigra (SN). Differences in neural activity were not evident in the standard Q-learning computational model. These findings demonstrate that FPE signals are an important component of valuation for decision making, and that the neural representation of expected value incorporates cortical and subcortical structures via interactions among serotonergic and dopaminergic modulator systems.

Citing Articles

Semantic associative abilities and executive control functions predict novelty and appropriateness of idea generation.

Wang X, Chen Q, Zhuang K, Zhang J, Cortes R, Holzman D Commun Biol. 2024; 7(1):703.

PMID: 38849461 PMC: 11161622. DOI: 10.1038/s42003-024-06405-0.


Impact of prenatal marijuana exposure on adolescent brain structural and functional connectivity and behavioural outcomes.

Vishnubhotla R, Ahmad S, Zhao Y, Radhakrishnan R Brain Commun. 2024; 6(2):fcae001.

PMID: 38444906 PMC: 10914455. DOI: 10.1093/braincomms/fcae001.


Computational mechanisms underlying latent value updating of unchosen actions.

Ben-Artzi I, Kessler Y, Nicenboim B, Shahar N Sci Adv. 2023; 9(42):eadi2704.

PMID: 37862419 PMC: 10588947. DOI: 10.1126/sciadv.adi2704.


Neural responses in macaque prefrontal cortex are linked to strategic exploration.

Jahn C, Grohn J, Cuell S, Emberton A, Bouret S, Walton M PLoS Biol. 2023; 21(1):e3001985.

PMID: 36716348 PMC: 9910800. DOI: 10.1371/journal.pbio.3001985.


Reward and fictive prediction error signals in ventral striatum: asymmetry between factual and counterfactual processing.

Santo-Angles A, Fuentes-Claramonte P, Argila-Plaza I, Guardiola-Ripoll M, Almodovar-Paya C, Munuera J Brain Struct Funct. 2021; 226(5):1553-1569.

PMID: 33839955 DOI: 10.1007/s00429-021-02270-3.