Two Sides of the Same Coin: Beneficial and Detrimental Consequences of Range Adaptation in Human Reinforcement Learning
Affiliations
Evidence suggests that economic values are rescaled as a function of the range of the available options. Although locally adaptive, range adaptation has been shown to lead to suboptimal choices, particularly notable in reinforcement learning (RL) situations when options are extrapolated from their original context to a new one. Range adaptation can be seen as the result of an adaptive coding process aiming at increasing the signal-to-noise ratio. However, this hypothesis leads to a counterintuitive prediction: Decreasing task difficulty should increase range adaptation and, consequently, extrapolation errors. Here, we tested the paradoxical relation between range adaptation and performance in a large sample of participants performing variants of an RL task, where we manipulated task difficulty. Results confirmed that range adaptation induces systematic extrapolation errors and is stronger when decreasing task difficulty. Last, we propose a range-adapting model and show that it is able to parsimoniously capture all the behavioral results.
Comparing experience- and description-based economic preferences across 11 countries.
Anllo H, Bavard S, Benmarrakchi F, Bonagura D, Cerrotti F, Cicue M Nat Hum Behav. 2024; 8(8):1554-1567.
PMID: 38877287 DOI: 10.1038/s41562-024-01894-9.
Mochizuki Y, Harasawa N, Aggarwal M, Chen C, Fukuda H PLoS Comput Biol. 2024; 20(5):e1012080.
PMID: 38739672 PMC: 11115364. DOI: 10.1371/journal.pcbi.1012080.
Recent Opioid Use Impedes Range Adaptation in Reinforcement Learning in Human Addiction.
Gueguen M, Anllo H, Bonagura D, Kong J, Hafezi S, Palminteri S Biol Psychiatry. 2023; 95(10):974-984.
PMID: 38101503 PMC: 11065633. DOI: 10.1016/j.biopsych.2023.12.005.
Intrinsic rewards explain context-sensitive valuation in reinforcement learning.
Molinaro G, Collins A PLoS Biol. 2023; 21(7):e3002201.
PMID: 37459394 PMC: 10374061. DOI: 10.1371/journal.pbio.3002201.
The functional form of value normalization in human reinforcement learning.
Bavard S, Palminteri S Elife. 2023; 12.
PMID: 37428155 PMC: 10393293. DOI: 10.7554/eLife.83891.