» Articles » PMID: 36236477

Computational Optimization of Image-Based Reinforcement Learning for Robotics

Overview
Journal Sensors (Basel)
Publisher MDPI
Specialty Biotechnology
Date 2022 Oct 14
PMID 36236477
Authors
Affiliations
Soon will be listed here.
Abstract

The robotics field has been deeply influenced by the advent of deep learning. In recent years, this trend has been characterized by the adoption of large, pretrained models for robotic use cases, which are not compatible with the computational hardware available in robotic systems. Moreover, such large, computationally intensive models impede the low-latency execution which is required for many closed-loop control systems. In this work, we propose different strategies for improving the computational efficiency of the deep-learning models adopted in reinforcement-learning (RL) scenarios. As a use-case project, we consider an image-based RL method on the synergy between push-and-grasp actions. As a first optimization step, we reduce the model architecture in complexity, by decreasing the number of layers and by altering the architecture structure. Second, we consider downscaling the input resolution to reduce the computational load. Finally, we perform weight quantization, where we compare post-training quantization and quantized-aware training. We benchmark the improvements introduced in each optimization by running a standard testing routine. We show that the optimization strategies introduced can improve the computational efficiency by around 300 times, while also slightly improving the functional performance of the system. In addition, we demonstrate closed-loop control behaviour on a real-world robot, while processing everything on a Jetson Xavier NX edge device.

Citing Articles

An Overview of Computational Coronary Physiology Technologies Based on Medical Imaging and Artificial Intelligence.

Li B, Chen H, Wang H, Hong L, Yang L Rev Cardiovasc Med. 2024; 25(6):211.

PMID: 39076307 PMC: 11270081. DOI: 10.31083/j.rcm2506211.

References
1.
Garcia C, Delakis M . Convolutional face finder: a neural architecture for fast and robust face detection. IEEE Trans Pattern Anal Mach Intell. 2004; 26(11):1408-23. DOI: 10.1109/tpami.2004.97. View

2.
Mnih V, Kavukcuoglu K, Silver D, Rusu A, Veness J, Bellemare M . Human-level control through deep reinforcement learning. Nature. 2015; 518(7540):529-33. DOI: 10.1038/nature14236. View

3.
LeCun Y, Bengio Y, Hinton G . Deep learning. Nature. 2015; 521(7553):436-44. DOI: 10.1038/nature14539. View

4.
Van de Maele T, Verbelen T, Catal O, De Boom C, Dhoedt B . Active Vision for Robot Manipulators Using the Free Energy Principle. Front Neurorobot. 2021; 15:642780. PMC: 7973267. DOI: 10.3389/fnbot.2021.642780. View