» Articles » PMID: 37422476

Reinforcement Learning Establishes a Minimal Metacognitive Process to Monitor and Control Motor Learning Performance

Overview
Journal Nat Commun
Specialty Biology
Date 2023 Jul 8
PMID 37422476
Authors
Affiliations
Soon will be listed here.
Abstract

Humans and animals develop learning-to-learn strategies throughout their lives to accelerate learning. One theory suggests that this is achieved by a metacognitive process of controlling and monitoring learning. Although such learning-to-learn is also observed in motor learning, the metacognitive aspect of learning regulation has not been considered in classical theories of motor learning. Here, we formulated a minimal mechanism of this process as reinforcement learning of motor learning properties, which regulates a policy for memory update in response to sensory prediction error while monitoring its performance. This theory was confirmed in human motor learning experiments, in which the subjective sense of learning-outcome association determined the direction of up- and down-regulation of both learning speed and memory retention. Thus, it provides a simple, unifying account for variations in learning speeds, where the reinforcement learning mechanism monitors and controls the motor learning process.

Citing Articles

Motor synergy and energy efficiency emerge in whole-body locomotion learning.

Li G, Hayashibe M Sci Rep. 2025; 15(1):712.

PMID: 39753645 PMC: 11698959. DOI: 10.1038/s41598-024-82472-x.


Meta-learning of human motor adaptation via the dorsal premotor cortex.

Sugiyama T, Uehara S, Izawa J Proc Natl Acad Sci U S A. 2024; 121(44):e2417543121.

PMID: 39441634 PMC: 11536165. DOI: 10.1073/pnas.2417543121.


Exploring motor skill acquisition in bimanual coordination: insights from navigating a novel maze task.

Cienfuegos M, Maycock J, Naceri A, Dusterhus T, Koiva R, Schack T Sci Rep. 2024; 14(1):18887.

PMID: 39143119 PMC: 11324764. DOI: 10.1038/s41598-024-69200-1.


Learning-to-learn as a metacognitive correlate of functional outcomes after stroke: a cohort study.

Sugiyama T, Uehara S, Yuasa A, Ushizawa K, Izawa J, Otaka Y Eur J Phys Rehabil Med. 2024; 60(5):750-760.

PMID: 39073359 PMC: 11559250. DOI: 10.23736/S1973-9087.24.08446-6.


The effects of reward and punishment on the performance of ping-pong ball bouncing.

Yin C, Wang Y, Li B, Gao T Front Behav Neurosci. 2024; 18:1433649.

PMID: 38993267 PMC: 11236609. DOI: 10.3389/fnbeh.2024.1433649.

References
1.
Kostadinov D, Hausser M . Reward signals in the cerebellum: Origins, targets, and functional implications. Neuron. 2022; 110(8):1290-1303. DOI: 10.1016/j.neuron.2022.02.015. View

2.
Boehm U, Marsman M, Matzke D, Wagenmakers E . On the importance of avoiding shortcuts in applying cognitive models to hierarchical data. Behav Res Methods. 2018; 50(4):1614-1631. PMC: 6096647. DOI: 10.3758/s13428-018-1054-3. View

3.
Wagner M, Kim T, Savall J, Schnitzer M, Luo L . Cerebellar granule cells encode the expectation of reward. Nature. 2017; 544(7648):96-100. PMC: 5532014. DOI: 10.1038/nature21726. View

4.
Chabrol F, Blot A, Mrsic-Flogel T . Cerebellar Contribution to Preparatory Activity in Motor Neocortex. Neuron. 2019; 103(3):506-519.e4. PMC: 6693889. DOI: 10.1016/j.neuron.2019.05.022. View

5.
Wei K, Kording K . Uncertainty of feedback and state estimation determines the speed of motor adaptation. Front Comput Neurosci. 2010; 4:11. PMC: 2871692. DOI: 10.3389/fncom.2010.00011. View