A Neural Network Model for the Orbitofrontal Cortex and Task Space Acquisition During Reinforcement Learning
Overview
Affiliations
Reinforcement learning has been widely used in explaining animal behavior. In reinforcement learning, the agent learns the value of the states in the task, collectively constituting the task state space, and uses the knowledge to choose actions and acquire desired outcomes. It has been proposed that the orbitofrontal cortex (OFC) encodes the task state space during reinforcement learning. However, it is not well understood how the OFC acquires and stores task state information. Here, we propose a neural network model based on reservoir computing. Reservoir networks exhibit heterogeneous and dynamic activity patterns that are suitable to encode task states. The information can be extracted by a linear readout trained with reinforcement learning. We demonstrate how the network acquires and stores task structures. The network exhibits reinforcement learning behavior and its aspects resemble experimental findings of the OFC. Our study provides a theoretical explanation of how the OFC may contribute to reinforcement learning and a new approach to understanding the neural mechanism underlying reinforcement learning.
Category learning in a recurrent neural network with reinforcement learning.
Zhang Y, Pan X, Wang Y Front Psychiatry. 2022; 13:1008011.
PMID: 36387007 PMC: 9640766. DOI: 10.3389/fpsyt.2022.1008011.
Orbitofrontal cortex contributes to the comparison of values underlying economic choices.
Ballesta S, Shi W, Padoa-Schioppa C Nat Commun. 2022; 13(1):4405.
PMID: 35906242 PMC: 9338286. DOI: 10.1038/s41467-022-32199-y.
Neuronal origins of reduced accuracy and biases in economic choices under sequential offers.
Shi W, Ballesta S, Padoa-Schioppa C Elife. 2022; 11.
PMID: 35416775 PMC: 9045815. DOI: 10.7554/eLife.75910.
The Role of Executive Function in Shaping Reinforcement Learning.
Rmus M, McDougle S, Collins A Curr Opin Behav Sci. 2022; 38:66-73.
PMID: 35194556 PMC: 8859995. DOI: 10.1016/j.cobeha.2020.10.003.
Economic Choices under Simultaneous or Sequential Offers Rely on the Same Neural Circuit.
Shi W, Ballesta S, Padoa-Schioppa C J Neurosci. 2021; 42(1):33-43.
PMID: 34764156 PMC: 8741155. DOI: 10.1523/JNEUROSCI.1265-21.2021.