Trajectory Optimization and Power Allocation Scheme Based on DRL in Energy Efficient UAV-Aided Communication Networks

WANG Chaowei; CUI Yuling; DENG Danhao; WANG Weidong; JIANG Fan

doi:10.1049/cje.2021.00.314

WANG Chaowei, CUI Yuling, DENG Danhao, WANG Weidong, JIANG Fan. Trajectory Optimization and Power Allocation Scheme Based on DRL in Energy Efficient UAV-Aided Communication Networks[J]. Chinese Journal of Electronics, 2022, 31(3): 397-407. DOI: 10.1049/cje.2021.00.314

Citation:

Trajectory Optimization and Power Allocation Scheme Based on DRL in Energy Efficient UAV-Aided Communication Networks

Graphical Abstract

Graphical Abstract

Abstract

Abstract

With flexibility, convenience and mobility, unmanned aerial vehicles (UAVS) can provide wireless communication networks with lower costs, easier deployment, higher network scalability and larger coverage. This paper proposes the deep deterministic policy gradient algorithm to jointly optimize the power allocation and flight trajectory of UAV with constrained effective energy to maximize the downlink throughput to ground users. To validate the proposed algorithm, we compare with the random algorithm, Q-learning algorithm and deep Q network algorithm. The simulation results show that the proposed algorithm can effectively improve the communication quality and significantly extend the service time of UAV. In addition, the downlink throughput increases with the number of ground users.