» Articles » PMID: 23087606

Do Not Bet on the Unknown Versus Try to Find Out More: Estimation Uncertainty and "Unexpected Uncertainty" Both Modulate Exploration

Overview
Journal Front Neurosci
Date 2012 Oct 23
PMID 23087606
Citations 23
Authors
Affiliations
Soon will be listed here.
Abstract

Little is known about how humans solve the exploitation/exploration trade-off. In particular, the evidence for uncertainty-driven exploration is mixed. The current study proposes a novel hypothesis of exploration that helps reconcile prior findings that may seem contradictory at first. According to this hypothesis, uncertainty-driven exploration involves a dilemma between two motives: (i) to speed up learning about the unknown, which may beget novel reward opportunities; (ii) to avoid the unknown because it is potentially dangerous. We provide evidence for our hypothesis using both behavioral and simulated data, and briefly point to recent evidence that the brain differentiates between these two motives.

Citing Articles

Losses resulting from deliberate exploration trigger beta oscillations in frontal cortex.

Chernyshev B, Pultsina K, Tretyakova V, Miasnikova A, Prokofyev A, Kozunova G Front Neurosci. 2023; 17:1152926.

PMID: 37250414 PMC: 10211346. DOI: 10.3389/fnins.2023.1152926.


Neurons in human pre-supplementary motor area encode key computations for value-based choice.

Aquino T, Cockburn J, Mamelak A, Rutishauser U, ODoherty J Nat Hum Behav. 2023; 7(6):970-985.

PMID: 36959327 PMC: 10330469. DOI: 10.1038/s41562-023-01548-2.


Anxiety as a disorder of uncertainty: implications for understanding maladaptive anxiety, anxious avoidance, and exposure therapy.

Brown V, Price R, Dombrovski A Cogn Affect Behav Neurosci. 2023; 23(3):844-868.

PMID: 36869259 PMC: 10475148. DOI: 10.3758/s13415-023-01080-w.


The effects of time horizon and guided choices on explore-exploit decisions in rodents.

Wang S, Gerken B, Wieland J, Wilson R, Fellous J Behav Neurosci. 2023; 137(2):127-142.

PMID: 36633987 PMC: 10787949. DOI: 10.1037/bne0000549.


Humans adaptively resolve the explore-exploit dilemma under cognitive constraints: Evidence from a multi-armed bandit task.

Brown V, Hallquist M, Frank M, Dombrovski A Cognition. 2022; 229:105233.

PMID: 35917612 PMC: 9530017. DOI: 10.1016/j.cognition.2022.105233.


References
1.
Hirayama J, Yoshimoto J, Ishii S . Bayesian representation learning in the cortex regulated by acetylcholine. Neural Netw. 2004; 17(10):1391-400. DOI: 10.1016/j.neunet.2004.06.006. View

2.
Yu A, Cohen J . Sequential effects: Superstition or rational behavior?. Adv Neural Inf Process Syst. 2015; 21:1873-1880. PMC: 4580342. View

3.
Jepma M, Nieuwenhuis S . Pupil diameter predicts changes in the exploration-exploitation trade-off: evidence for the adaptive gain theory. J Cogn Neurosci. 2010; 23(7):1587-96. DOI: 10.1162/jocn.2010.21548. View

4.
Payzan-LeNestour E, Bossaerts P . Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings. PLoS Comput Biol. 2011; 7(1):e1001048. PMC: 3024253. DOI: 10.1371/journal.pcbi.1001048. View

5.
Cavanagh J, Figueroa C, Cohen M, Frank M . Frontal theta reflects uncertainty and unexpectedness during exploration and exploitation. Cereb Cortex. 2011; 22(11):2575-86. PMC: 4296208. DOI: 10.1093/cercor/bhr332. View