» Articles » PMID: 34960332

Deep Reinforcement Learning for UAV Trajectory Design Considering Mobile Ground Users

Overview
Journal Sensors (Basel)
Publisher MDPI
Specialty Biotechnology
Date 2021 Dec 28
PMID 34960332
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

A network composed of unmanned aerial vehicles (UAVs), serving as base stations (UAV-BS network), is emerging as a promising component in next-generation communication systems. In the UAV-BS network, the optimal positioning of a UAV-BS is an essential requirement to establish line-of-sight (LoS) links for ground users. A novel deep Q-network (DQN)-based learning model enabling the optimal deployment of a UAV-BS is proposed. Moreover, without re-learning of the model and the acquisition of the path information of ground users, the proposed model presents the optimal UAV-BS trajectory while ground users move. Specifically, the proposed model optimizes the trajectory of a UAV-BS by maximizing the mean opinion score (MOS) for ground users who move to various paths. Furthermore, the proposed model is highly practical because, instead of the locations of individual mobile users, an average channel power gain is used as an input parameter. The accuracy of the proposed model is validated by comparing the results of the model with those of a mathematical optimization solver.

Citing Articles

Research on the Total Channel Capacities Pertaining to Two Coverage Layouts for Three-Dimensional, UAV-Assisted Ad Hoc Networks.

Yan X, Zhu S, Wang Q, Wu H Sensors (Basel). 2023; 23(7).

PMID: 37050564 PMC: 10099046. DOI: 10.3390/s23073504.


Handover Management for Drones in Future Mobile Networks-A Survey.

Shayea I, Dushi P, Banafaa M, Rashid R, Ali S, Sarijari M Sensors (Basel). 2022; 22(17).

PMID: 36080883 PMC: 9460841. DOI: 10.3390/s22176424.


Mobility Management of Unmanned Aerial Vehicles in Ultra-Dense Heterogeneous Networks.

Alshaibani W, Shayea I, Caglar R, Din J, Daradkeh Y Sensors (Basel). 2022; 22(16).

PMID: 36015781 PMC: 9416608. DOI: 10.3390/s22166013.


Parallel Cooperative Coevolutionary Grey Wolf Optimizer for Path Planning Problem of Unmanned Aerial Vehicles.

Jarray R, Al-Dhaifallah M, Rezk H, Bouallegue S Sensors (Basel). 2022; 22(5).

PMID: 35270978 PMC: 8914685. DOI: 10.3390/s22051826.


Deep Q-Learning-Based Transmission Power Control of a High Altitude Platform Station with Spectrum Sharing.

Jo S, Yang W, Choi H, Noh E, Jo H, Park J Sensors (Basel). 2022; 22(4).

PMID: 35214535 PMC: 8878605. DOI: 10.3390/s22041630.

References
1.
Mnih V, Kavukcuoglu K, Silver D, Rusu A, Veness J, Bellemare M . Human-level control through deep reinforcement learning. Nature. 2015; 518(7540):529-33. DOI: 10.1038/nature14236. View

2.
Kumar K, Kumar S, Kaiwartya O, Sikandar A, Kharel R, Mauri J . Internet of Unmanned Aerial Vehicles: QoS Provisioning in Aerial Ad-Hoc Networks. Sensors (Basel). 2020; 20(11). PMC: 7308874. DOI: 10.3390/s20113160. View