Computational Optimization of Image-Based Reinforcement Learning for Robotics

Overview

Journal Sensors (Basel)

Publisher MDPI

Specialty Biotechnology

Date 2022 Oct 14

PMID 36236477

Authors

Stefano Ferraro

Toon Van de Maele

Pietro Mazzaglia

Tim Verbelen

Bart Dhoedt

Affiliations

Soon will be listed here.

Abstract

The robotics field has been deeply influenced by the advent of deep learning. In recent years, this trend has been characterized by the adoption of large, pretrained models for robotic use cases, which are not compatible with the computational hardware available in robotic systems. Moreover, such large, computationally intensive models impede the low-latency execution which is required for many closed-loop control systems. In this work, we propose different strategies for improving the computational efficiency of the deep-learning models adopted in reinforcement-learning (RL) scenarios. As a use-case project, we consider an image-based RL method on the synergy between push-and-grasp actions. As a first optimization step, we reduce the model architecture in complexity, by decreasing the number of layers and by altering the architecture structure. Second, we consider downscaling the input resolution to reduce the computational load. Finally, we perform weight quantization, where we compare post-training quantization and quantized-aware training. We benchmark the improvements introduced in each optimization by running a standard testing routine. We show that the optimization strategies introduced can improve the computational efficiency by around 300 times, while also slightly improving the functional performance of the system. In addition, we demonstrate closed-loop control behaviour on a real-world robot, while processing everything on a Jetson Xavier NX edge device.

Citing Articles

An Overview of Computational Coronary Physiology Technologies Based on Medical Imaging and Artificial Intelligence.

Li B, Chen H, Wang H, Hong L, Yang L Rev Cardiovasc Med. 2024; 25(6):211.

PMID: 39076307 PMC: 11270081. DOI: 10.31083/j.rcm2506211.

References

Garcia C, Delakis M . Convolutional face finder: a neural architecture for fast and robust face detection. IEEE Trans Pattern Anal Mach Intell. 2004; 26(11):1408-23. DOI: 10.1109/tpami.2004.97. View

Mnih V, Kavukcuoglu K, Silver D, Rusu A, Veness J, Bellemare M . Human-level control through deep reinforcement learning. Nature. 2015; 518(7540):529-33. DOI: 10.1038/nature14236. View

LeCun Y, Bengio Y, Hinton G . Deep learning. Nature. 2015; 521(7553):436-44. DOI: 10.1038/nature14539. View

Van de Maele T, Verbelen T, Catal O, De Boom C, Dhoedt B . Active Vision for Robot Manipulators Using the Free Energy Principle. Front Neurorobot. 2021; 15:642780. PMC: 7973267. DOI: 10.3389/fnbot.2021.642780. View