» Articles » PMID: 38773995

The Reward-Complexity Trade-off in Schizophrenia

Overview
Specialty Biology
Date 2024 May 22
PMID 38773995
Authors
Affiliations
Soon will be listed here.
Abstract

Action selection requires a policy that maps states of the world to a distribution over actions. The amount of memory needed to specify the policy (the policy complexity) increases with the state-dependence of the policy. If there is a capacity limit for policy complexity, then there will also be a trade-off between reward and complexity, since some reward will need to be sacrificed in order to satisfy the capacity constraint. This paper empirically characterizes the trade-off between reward and complexity for both schizophrenia patients and healthy controls. Schizophrenia patients adopt lower complexity policies on average, and these policies are more strongly biased away from the optimal reward-complexity trade-off curve compared to healthy controls. However, healthy controls are also biased away from the optimal trade-off curve, and both groups appear to lie on the same empirical trade-off curve. We explain these findings using a cost-sensitive actor-critic model. Our empirical and theoretical results shed new light on cognitive effort abnormalities in schizophrenia.

Citing Articles

Resource-rational psychopathology.

Bari B, Gershman S Behav Neurosci. 2024; 138(4):221-234.

PMID: 38753400 PMC: 11423359. DOI: 10.1037/bne0000600.


Human decision making balances reward maximization and policy compression.

Lai L, Gershman S PLoS Comput Biol. 2024; 20(4):e1012057.

PMID: 38669280 PMC: 11078408. DOI: 10.1371/journal.pcbi.1012057.


Bayesian Reinforcement Learning With Limited Cognitive Load.

Arumugam D, Ho M, Goodman N, Van Roy B Open Mind (Camb). 2024; 8:395-438.

PMID: 38665544 PMC: 11045037. DOI: 10.1162/opmi_a_00132.


Undermatching Is a Consequence of Policy Compression.

Bari B, Gershman S J Neurosci. 2023; 43(3):447-457.

PMID: 36639891 PMC: 9864556. DOI: 10.1523/JNEUROSCI.1003-22.2022.


Atypical meta-memory evaluation strategy in schizophrenia patients.

Zheng Y, Wang L, Gerlofs D, Duan W, Wang X, Yin J Schizophr Res Cogn. 2021; 27:100220.

PMID: 34646754 PMC: 8501761. DOI: 10.1016/j.scog.2021.100220.


References
1.
Parush N, Tishby N, Bergman H . Dopaminergic Balance between Reward Maximization and Policy Complexity. Front Syst Neurosci. 2011; 5:22. PMC: 3093748. DOI: 10.3389/fnsys.2011.00022. View

2.
Gold J, Kool W, Botvinick M, Hubzin L, August S, Waltz J . Cognitive effort avoidance and detection in people with schizophrenia. Cogn Affect Behav Neurosci. 2014; 15(1):145-54. PMC: 4276545. DOI: 10.3758/s13415-014-0308-5. View

3.
Collins A, Frank M . How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis. Eur J Neurosci. 2012; 35(7):1024-35. PMC: 3390186. DOI: 10.1111/j.1460-9568.2011.07980.x. View

4.
Collins A, Albrecht M, Waltz J, Gold J, Frank M . Interactions Among Working Memory, Reinforcement Learning, and Effort in Value-Based Choice: A New Paradigm and Selective Deficits in Schizophrenia. Biol Psychiatry. 2017; 82(6):431-439. PMC: 5573149. DOI: 10.1016/j.biopsych.2017.05.017. View

5.
Dezfouli A, Balleine B . Habits, action sequences and reinforcement learning. Eur J Neurosci. 2012; 35(7):1036-51. PMC: 3325518. DOI: 10.1111/j.1460-9568.2012.08050.x. View