» Articles » PMID: 35333867

Minimal Cross-trial Generalization in Learning the Representation of an Odor-guided Choice Task

Overview
Specialty Biology
Date 2022 Mar 25
PMID 35333867
Authors
Affiliations
Soon will be listed here.
Abstract

There is no single way to represent a task. Indeed, despite experiencing the same task events and contingencies, different subjects may form distinct task representations. As experimenters, we often assume that subjects represent the task as we envision it. However, such a representation cannot be taken for granted, especially in animal experiments where we cannot deliver explicit instruction regarding the structure of the task. Here, we tested how rats represent an odor-guided choice task in which two odor cues indicated which of two responses would lead to reward, whereas a third odor indicated free choice among the two responses. A parsimonious task representation would allow animals to learn from the forced trials what is the better option to choose in the free-choice trials. However, animals may not necessarily generalize across odors in this way. We fit reinforcement-learning models that use different task representations to trial-by-trial choice behavior of individual rats performing this task, and quantified the degree to which each animal used the more parsimonious representation, generalizing across trial types. Model comparison revealed that most rats did not acquire this representation despite extensive experience. Our results demonstrate the importance of formally testing possible task representations that can afford the observed behavior, rather than assuming that animals' task representations abide by the generative task structure that governs the experimental design.

Citing Articles

Prior cocaine use diminishes encoding of latent information by orbitofrontal, but not medial, prefrontal ensembles.

Mueller L, Konya C, Sharpe M, Wikenheiser A, Schoenbaum G Curr Biol. 2024; 34(22):5223-5238.e3.

PMID: 39454572 PMC: 11576232. DOI: 10.1016/j.cub.2024.09.064.


Neuronal implementation of the temporal difference learning algorithm in the midbrain dopaminergic system.

Stetsenko A, Koos T Proc Natl Acad Sci U S A. 2023; 120(45):e2309015120.

PMID: 37903252 PMC: 10636325. DOI: 10.1073/pnas.2309015120.

References
1.
Zhou J, Jia C, Montesinos-Cartagena M, Gardner M, Zong W, Schoenbaum G . Evolving schema representations in orbitofrontal ensembles during learning. Nature. 2020; 590(7847):606-611. PMC: 7906913. DOI: 10.1038/s41586-020-03061-2. View

2.
Courville A, Daw N, Touretzky D . Bayesian theories of conditioning in a changing world. Trends Cogn Sci. 2006; 10(7):294-300. DOI: 10.1016/j.tics.2006.05.004. View

3.
Burton A, Bissonette G, Vazquez D, Blume E, Donnelly M, Heatley K . Previous cocaine self-administration disrupts reward expectancy encoding in ventral striatum. Neuropsychopharmacology. 2018; 43(12):2350-2360. PMC: 6180050. DOI: 10.1038/s41386-018-0058-0. View

4.
Sweis B, Abram S, Schmidt B, Seeland K, MacDonald 3rd A, Thomas M . Sensitivity to "sunk costs" in mice, rats, and humans. Science. 2018; 361(6398):178-181. PMC: 6377599. DOI: 10.1126/science.aar8644. View

5.
Roesch M, Taylor A, Schoenbaum G . Encoding of time-discounted rewards in orbitofrontal cortex is independent of value representation. Neuron. 2006; 51(4):509-20. PMC: 2561990. DOI: 10.1016/j.neuron.2006.06.027. View