» Articles » PMID: 38438263

Reinforcement Learning During Locomotion

Overview
Journal eNeuro
Specialty Neurology
Date 2024 Mar 4
PMID 38438263
Authors
Affiliations
Soon will be listed here.
Abstract

When learning a new motor skill, people often must use trial and error to discover which movement is best. In the reinforcement learning framework, this concept is known as exploration and has been linked to increased movement variability in motor tasks. For locomotor tasks, however, increased variability decreases upright stability. As such, exploration during gait may jeopardize balance and safety, making reinforcement learning less effective. Therefore, we set out to determine if humans could acquire and retain a novel locomotor pattern using reinforcement learning alone. Young healthy male and female participants walked on a treadmill and were provided with binary reward feedback (indicated by a green checkmark on the screen) that was tied to a fixed monetary bonus, to learn a novel stepping pattern. We also recruited a comparison group who walked with the same novel stepping pattern but did so by correcting for target error, induced by providing real-time veridical visual feedback of steps and a target. In two experiments, we compared learning, motor variability, and two forms of motor memories between the groups. We found that individuals in the binary reward group did, in fact, acquire the new walking pattern by exploring (increasing motor variability). Additionally, while reinforcement learning did not increase implicit motor memories, it resulted in more accurate explicit motor memories compared with the target error group. Overall, these results demonstrate that humans can acquire new walking patterns with reinforcement learning and retain much of the learning over 24 h.

Citing Articles

The dual timescales of gait adaptation: initial stability adjustments followed by subsequent energetic cost adjustments.

Brinkerhoff S, Sanchez N, Culver M, Murrah W, Robinson A, McCullough J J Exp Biol. 2024; 227(23).

PMID: 39422307 PMC: 11883409. DOI: 10.1242/jeb.249217.


Roles and interplay of reinforcement-based and error-based processes during reaching and gait in neurotypical adults and individuals with Parkinson's disease.

Roth A, Buggeln J, Hoh J, Wood J, Sullivan S, Ngo T PLoS Comput Biol. 2024; 20(10):e1012474.

PMID: 39401183 PMC: 11472932. DOI: 10.1371/journal.pcbi.1012474.

References
1.
Madelain L, Paeye C, Wallman J . Modification of saccadic gain by reinforcement. J Neurophysiol. 2011; 106(1):219-32. PMC: 3129734. DOI: 10.1152/jn.01094.2009. View

2.
Bakkum A, Marigold D . Learning from the Physical Consequences of Our Actions Improves Motor Memory. eNeuro. 2022; 9(3). PMC: 9172287. DOI: 10.1523/ENEURO.0459-21.2022. View

3.
Tsay J, Kim H, Saxena A, Parvin D, Verstynen T, Ivry R . Dissociable use-dependent processes for volitional goal-directed reaching. Proc Biol Sci. 2022; 289(1973):20220415. PMC: 9043705. DOI: 10.1098/rspb.2022.0415. View

4.
Daw N, ODoherty J, Dayan P, Seymour B, Dolan R . Cortical substrates for exploratory decisions in humans. Nature. 2006; 441(7095):876-9. PMC: 2635947. DOI: 10.1038/nature04766. View

5.
Raviv O, Ahissar M, Loewenstein Y . How recent history affects perception: the normative approach and its heuristic approximation. PLoS Comput Biol. 2012; 8(10):e1002731. PMC: 3486920. DOI: 10.1371/journal.pcbi.1002731. View