» Articles » PMID: 37591981

Developmental Changes in Exploration Resemble Stochastic Optimization

Overview
Journal Nat Hum Behav
Date 2023 Aug 17
PMID 37591981
Authors
Affiliations
Soon will be listed here.
Abstract

Human development is often described as a 'cooling off' process, analogous to stochastic optimization algorithms that implement a gradual reduction in randomness over time. Yet there is ambiguity in how to interpret this analogy, due to a lack of concrete empirical comparisons. Using data from n = 281 participants ages 5 to 55, we show that cooling off does not only apply to the single dimension of randomness. Rather, human development resembles an optimization process of multiple learning parameters, for example, reward generalization, uncertainty-directed exploration and random temperature. Rapid changes in parameters occur during childhood, but these changes plateau and converge to efficient values in adulthood. We show that while the developmental trajectory of human parameters is strikingly similar to several stochastic optimization algorithms, there are important differences in convergence. None of the optimization algorithms tested were able to discover reliably better regions of the strategy space than adult participants on this task.

Citing Articles

Perceptual Novelty Drives Early Exploration in a Bottom-Up Manner.

Gao M, Sloutsky V Dev Sci. 2025; 28(3):e70002.

PMID: 40033792 PMC: 11876794. DOI: 10.1111/desc.70002.


The connecting brain in context: How adolescent plasticity supports learning and development.

Baker A, Galvan A, Fuligni A Dev Cogn Neurosci. 2024; 71():101486.

PMID: 39631105 PMC: 11653146. DOI: 10.1016/j.dcn.2024.101486.


Decrease in decision noise from adolescence into adulthood mediates an increase in more sophisticated choice behaviors and performance gain.

Scholz V, Waltmann M, Herzog N, Horstmann A, Deserno L PLoS Biol. 2024; 22(11):e3002877.

PMID: 39541313 PMC: 11563475. DOI: 10.1371/journal.pbio.3002877.


Humans flexibly integrate social information despite interindividual differences in reward.

Witt A, Toyokawa W, Lala K, Gaissmaier W, Wu C Proc Natl Acad Sci U S A. 2024; 121(39):e2404928121.

PMID: 39302964 PMC: 11441569. DOI: 10.1073/pnas.2404928121.


Sensitivity to the Instrumental Value of Choice Increases Across Development.

Nussenbaum K, Katzman P, Lu H, Zorowitz S, Hartley C Psychol Sci. 2024; 35(8):933-947.

PMID: 38900963 PMC: 11693699. DOI: 10.1177/09567976241256961.


References
2.
Moran R, Symmonds M, Dolan R, Friston K . The brain ages optimally to model its environment: evidence from sensory learning over the adult lifespan. PLoS Comput Biol. 2014; 10(1):e1003422. PMC: 3900375. DOI: 10.1371/journal.pcbi.1003422. View

3.
Nussenbaum K, Hartley C . Reinforcement learning across development: What insights can we draw from a decade of research?. Dev Cogn Neurosci. 2019; 40:100733. PMC: 6974916. DOI: 10.1016/j.dcn.2019.100733. View

4.
Gopnik A, OGrady S, Lucas C, Griffiths T, Wente A, Bridgers S . Changes in cognitive flexibility and hypothesis search across human life history from childhood to adolescence to adulthood. Proc Natl Acad Sci U S A. 2017; 114(30):7892-7899. PMC: 5544286. DOI: 10.1073/pnas.1700811114. View

5.
Walasek N, Frankenhuis W, Panchanathan K . Sensitive periods, but not critical periods, evolve in a fluctuating environment: a model of incremental development. Proc Biol Sci. 2022; 289(1969):20212623. PMC: 8848242. DOI: 10.1098/rspb.2021.2623. View

6.
Kirkpatrick S, Gelatt Jr C, Vecchi M . Optimization by simulated annealing. Science. 1983; 220(4598):671-80. DOI: 10.1126/science.220.4598.671. View