» Articles » PMID: 33916995

Coverage Path Planning Using Reinforcement Learning-Based TSP for HTetran-A Polyabolo-Inspired Self-Reconfigurable Tiling Robot

Overview
Journal Sensors (Basel)
Publisher MDPI
Specialty Biotechnology
Date 2021 Apr 30
PMID 33916995
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

One of the critical challenges in deploying the cleaning robots is the completion of covering the entire area. Current tiling robots for area coverage have fixed forms and are limited to cleaning only certain areas. The reconfigurable system is the creative answer to such an optimal coverage problem. The tiling robot's goal enables the complete coverage of the entire area by reconfiguring to different shapes according to the area's needs. In the particular sequencing of navigation, it is essential to have a structure that allows the robot to extend the coverage range while saving energy usage during navigation. This implies that the robot is able to cover larger areas entirely with the least required actions. This paper presents a complete path planning (CPP) for hTetran, a polyabolo tiled robot, based on a TSP-based reinforcement learning optimization. This structure simultaneously produces robot shapes and sequential trajectories whilst maximizing the reward of the trained reinforcement learning (RL) model within the predefined polyabolo-based tileset. To this end, a reinforcement learning-based travel sales problem (TSP) with proximal policy optimization (PPO) algorithm was trained using the complementary learning computation of the TSP sequencing. The reconstructive results of the proposed RL-TSP-based CPP for hTetran were compared in terms of energy and time spent with the conventional tiled hypothetical models that incorporate TSP solved through an evolutionary based ant colony optimization (ACO) approach. The CPP demonstrates an ability to generate an ideal Pareto optima trajectory that enhances the robot's navigation inside the real environment with the least energy and time spent in the company of conventional techniques.

Citing Articles

Collaborative Complete Coverage and Path Planning for Multi-Robot Exploration.

Lin H, Huang Y Sensors (Basel). 2021; 21(11).

PMID: 34073565 PMC: 8198857. DOI: 10.3390/s21113709.


Adaptive Floor Cleaning Strategy by Human Density Surveillance Mapping with a Reconfigurable Multi-Purpose Service Robot.

Sivanantham V, Le A, Shi Y, Elara M, Sheu B Sensors (Basel). 2021; 21(9).

PMID: 33922638 PMC: 8122887. DOI: 10.3390/s21092965.

References
1.
Le A, Prabakaran V, Sivanantham V, Mohan R . Modified A-Star Algorithm for Efficient Coverage Path Planning in Tetris Inspired Self-Reconfigurable Robot with Integrated Laser Sensor. Sensors (Basel). 2018; 18(8). PMC: 6111563. DOI: 10.3390/s18082585. View

2.
Le A, Parween R, Mohan R, Nhan N, Enjikalayil R . Optimization Complete Area Coverage by Reconfigurable hTrihex Tiling Robot. Sensors (Basel). 2020; 20(11). PMC: 7308827. DOI: 10.3390/s20113170. View

3.
Le A, Nhan N, Mohan R . Evolutionary Algorithm-Based Complete Coverage Path Planning for Tetriamond Tiling Robots. Sensors (Basel). 2020; 20(2). PMC: 7013451. DOI: 10.3390/s20020445. View

4.
Ghaddar A, Merei A, Natalizio E . PPS: Energy-Aware Grid-Based Coverage Path Planning for UAVs Using Area Partitioning in the Presence of NFZs. Sensors (Basel). 2020; 20(13). PMC: 7374364. DOI: 10.3390/s20133742. View

5.
Yang S, Luo C . A neural network approach to complete coverage path planning. IEEE Trans Syst Man Cybern B Cybern. 2004; 34(1):718-25. DOI: 10.1109/tsmcb.2003.811769. View