» Articles » PMID: 29209058

Learning the Value of Information and Reward over Time when Solving Exploration-exploitation Problems

Overview
Journal Sci Rep
Specialty Science
Date 2017 Dec 7
PMID 29209058
Citations 21
Authors
Affiliations
Soon will be listed here.
Abstract

To flexibly adapt to the demands of their environment, animals are constantly exposed to the conflict resulting from having to choose between predictably rewarding familiar options (exploitation) and risky novel options, the value of which essentially consists of obtaining new information about the space of possible rewards (exploration). Despite extensive research, the mechanisms that subtend the manner in which animals solve this exploitation-exploration dilemma are still poorly understood. Here, we investigate human decision-making in a gambling task in which the informational value of each trial and the reward potential were separately manipulated. To better characterize the mechanisms that underlined the observed behavioural choices, we introduce a computational model that augments the standard reward-based reinforcement learning formulation by associating a value to information. We find that both reward and information gained during learning influence the balance between exploitation and exploration, and that this influence was dependent on the reward context. Our results shed light on the mechanisms that underpin decision-making under uncertainty, and suggest new approaches for investigating the exploration-exploitation dilemma throughout the animal kingdom.

Citing Articles

Information seeking and the expected utility of information about COVID-19 can be associated with uncertainty and related attitudes.

Torunsky N, Kedrick K, Vilares I Sci Rep. 2025; 15(1):6096.

PMID: 39971991 PMC: 11840097. DOI: 10.1038/s41598-025-89781-9.


Signatures of Perseveration and Heuristic-Based Directed Exploration in Two-Step Sequential Decision Task Behaviour.

Brands A, Mathar D, Peters J Comput Psychiatr. 2025; 9(1):39-62.

PMID: 39959565 PMC: 11827566. DOI: 10.5334/cpsy.101.


Representations of the intrinsic value of information in mouse orbitofrontal cortex.

Bussell J, Badman R, Marton C, Bromberg-Martin E, Abbott L, Rajan K bioRxiv. 2024; .

PMID: 39416043 PMC: 11482914. DOI: 10.1101/2023.10.13.562291.


The roles of intrinsic motivation and capability-related factors in cognitive effort-based decision-making.

Randez A, Helie S Front Psychol. 2024; 15:1303262.

PMID: 38756501 PMC: 11098016. DOI: 10.3389/fpsyg.2024.1303262.


Multiple and subject-specific roles of uncertainty in reward-guided decision-making.

Paunov A, LHotellier M, Guo D, He Z, Yu A, Meyniel F bioRxiv. 2024; .

PMID: 38585958 PMC: 10996615. DOI: 10.1101/2024.03.27.587016.


References
1.
Cooper J, Blanco N, Maddox W . Framing matters: Effects of framing on older adults' exploratory decision-making. Psychol Aging. 2016; 32(1):60-68. PMC: 5300956. DOI: 10.1037/pag0000146. View

2.
Payzan-LeNestour E, Bossaerts P . Do not Bet on the Unknown Versus Try to Find Out More: Estimation Uncertainty and "Unexpected Uncertainty" Both Modulate Exploration. Front Neurosci. 2012; 6:150. PMC: 3472893. DOI: 10.3389/fnins.2012.00150. View

3.
Hertwig R, Erev I . The description-experience gap in risky choice. Trends Cogn Sci. 2009; 13(12):517-23. DOI: 10.1016/j.tics.2009.09.004. View

4.
Green L, Myerson J . A discounting framework for choice with delayed and probabilistic rewards. Psychol Bull. 2004; 130(5):769-92. PMC: 1382186. DOI: 10.1037/0033-2909.130.5.769. View

5.
Cohen J, McClure S, Yu A . Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philos Trans R Soc Lond B Biol Sci. 2007; 362(1481):933-42. PMC: 2430007. DOI: 10.1098/rstb.2007.2098. View