» Articles » PMID: 36271279

Active Inference and the Two-step Task

Overview
Journal Sci Rep
Specialty Science
Date 2022 Oct 22
PMID 36271279
Authors
Affiliations
Soon will be listed here.
Abstract

Sequential decision problems distill important challenges frequently faced by humans. Through repeated interactions with an uncertain world, unknown statistics need to be learned while balancing exploration and exploitation. Reinforcement learning is a prominent method for modeling such behaviour, with a prevalent application being the two-step task. However, recent studies indicate that the standard reinforcement learning model sometimes describes features of human task behaviour inaccurately and incompletely. We investigated whether active inference, a framework proposing a trade-off to the exploration-exploitation dilemma, could better describe human behaviour. Therefore, we re-analysed four publicly available datasets of the two-step task, performed Bayesian model selection, and compared behavioural model predictions. Two datasets, which revealed more model-based inference and behaviour indicative of directed exploration, were better described by active inference, while the models scored similarly for the remaining datasets. Learning using probability distributions appears to contribute to the improved model fits. Further, approximately half of all participants showed sensitivity to information gain as formulated under active inference, although behavioural exploration effects were not fully captured. These results contribute to the empirical validation of active inference as a model of human behaviour and the study of alternative models for the influential two-step task.

Citing Articles

Signatures of Perseveration and Heuristic-Based Directed Exploration in Two-Step Sequential Decision Task Behaviour.

Brands A, Mathar D, Peters J Comput Psychiatr. 2025; 9(1):39-62.

PMID: 39959565 PMC: 11827566. DOI: 10.5334/cpsy.101.

References
1.
Cogliati Dezza I, Yu A, Cleeremans A, Alexander W . Learning the value of information and reward over time when solving exploration-exploitation problems. Sci Rep. 2017; 7(1):16919. PMC: 5717252. DOI: 10.1038/s41598-017-17237-w. View

2.
Smith R, Schwartenbeck P, Stewart J, Kuplicki R, Ekhtiari H, Paulus M . Imprecise action selection in substance use disorder: Evidence for active learning impairments when solving the explore-exploit dilemma. Drug Alcohol Depend. 2020; 215:108208. PMC: 7502502. DOI: 10.1016/j.drugalcdep.2020.108208. View

3.
Karl F . A Free Energy Principle for Biological Systems. Entropy (Basel). 2012; 14(11):2100-2121. PMC: 3510653. DOI: 10.3390/e14112100. View

4.
Daw N, ODoherty J, Dayan P, Seymour B, Dolan R . Cortical substrates for exploratory decisions in humans. Nature. 2006; 441(7095):876-9. PMC: 2635947. DOI: 10.1038/nature04766. View

5.
Wilson R, Geana A, White J, Ludvig E, Cohen J . Humans use directed and random exploration to solve the explore-exploit dilemma. J Exp Psychol Gen. 2014; 143(6):2074-81. PMC: 5635655. DOI: 10.1037/a0038199. View