» Articles » PMID: 27322574

The Computational Development of Reinforcement Learning During Adolescence

Overview
Specialty Biology
Date 2016 Jun 21
PMID 27322574
Citations 53
Authors
Affiliations
Soon will be listed here.
Abstract

Adolescence is a period of life characterised by changes in learning and decision-making. Learning and decision-making do not rely on a unitary system, but instead require the coordination of different cognitive processes that can be mathematically formalised as dissociable computational modules. Here, we aimed to trace the developmental time-course of the computational modules responsible for learning from reward or punishment, and learning from counterfactual feedback. Adolescents and adults carried out a novel reinforcement learning paradigm in which participants learned the association between cues and probabilistic outcomes, where the outcomes differed in valence (reward versus punishment) and feedback was either partial or complete (either the outcome of the chosen option only, or the outcomes of both the chosen and unchosen option, were displayed). Computational strategies changed during development: whereas adolescents' behaviour was better explained by a basic reinforcement learning algorithm, adults' behaviour integrated increasingly complex computational features, namely a counterfactual learning module (enabling enhanced performance in the presence of complete feedback) and a value contextualisation module (enabling symmetrical reward and punishment learning). Unlike adults, adolescent performance did not benefit from counterfactual (complete) feedback. In addition, while adults learned symmetrically from both reward and punishment, adolescents learned from reward but were less likely to learn from punishment. This tendency to rely on rewards and not to consider alternative consequences of actions might contribute to our understanding of decision-making in adolescence.

Citing Articles

Electrical brain activations in preadolescents during a probabilistic reward-learning task reflect cognitive processes and behavior strategies.

Chung Y, van den Berg B, Roberts K, Bagdasarov A, Woldorff M, Gaffrey M Front Hum Neurosci. 2025; 19:1460584.

PMID: 39949988 PMC: 11821623. DOI: 10.3389/fnhum.2025.1460584.


Interpretation of individual differences in computational neuroscience using a latent input approach.

Schaaf J, Miletic S, van Duijvenvoorde A, Huizenga H Dev Cogn Neurosci. 2025; 72:101512.

PMID: 39854872 PMC: 11804603. DOI: 10.1016/j.dcn.2025.101512.


The preference for surprise in reinforcement learning underlies the differences in developmental changes in risk preference between autistic and neurotypical youth.

Sumiya M, Katahira K, Akechi H, Senju A Mol Autism. 2025; 16(1):3.

PMID: 39819491 PMC: 11740557. DOI: 10.1186/s13229-025-00637-5.


The connecting brain in context: How adolescent plasticity supports learning and development.

Baker A, Galvan A, Fuligni A Dev Cogn Neurosci. 2024; 71():101486.

PMID: 39631105 PMC: 11653146. DOI: 10.1016/j.dcn.2024.101486.


Decrease in decision noise from adolescence into adulthood mediates an increase in more sophisticated choice behaviors and performance gain.

Scholz V, Waltmann M, Herzog N, Horstmann A, Deserno L PLoS Biol. 2024; 22(11):e3002877.

PMID: 39541313 PMC: 11563475. DOI: 10.1371/journal.pbio.3002877.


References
1.
Giedd J, Blumenthal J, Jeffries N, Castellanos F, Liu H, Zijdenbos A . Brain development during childhood and adolescence: a longitudinal MRI study. Nat Neurosci. 1999; 2(10):861-3. DOI: 10.1038/13158. View

2.
Benes F, Taylor J, Cunningham M . Convergence and plasticity of monoaminergic systems in the medial prefrontal cortex during the postnatal period: implications for the development of psychopathology. Cereb Cortex. 2000; 10(10):1014-27. DOI: 10.1093/cercor/10.10.1014. View

3.
ODoherty J, Dayan P, Schultz J, Deichmann R, Friston K, Dolan R . Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science. 2004; 304(5669):452-4. DOI: 10.1126/science.1094285. View

4.
Gogtay N, Giedd J, Lusk L, Hayashi K, Greenstein D, Vaituzis A . Dynamic mapping of human cortical development during childhood through early adulthood. Proc Natl Acad Sci U S A. 2004; 101(21):8174-9. PMC: 419576. DOI: 10.1073/pnas.0402680101. View

5.
Frank M, Seeberger L, OReilly R . By carrot or by stick: cognitive reinforcement learning in parkinsonism. Science. 2004; 306(5703):1940-3. DOI: 10.1126/science.1102941. View