Contact
Research Key Findings Content
November 5, 2023
Significant Progress in Artificial Intelligence Communication Made by Professor Huang Mengxing's Team from the School of Information and Communication Engineering


Abstract:

Unmanned aerial vehicle (UAV)-assisted communication is a significant technology in 6G communication. In order to cope with the dynamic trajectory optimization problem of the air-ground network, the interaction between entities is modeled as a Markov game firstly. Then, the model-free multi-agent reinforcement learning (MARL) is adopted to optimize individual decision-making. This enables agents to learn the mobile patterns of others, so as to optimize their own mobile strategy. However, there are some common issues when executing the benchmark MARL algorithms, such as biased estimation and local optimum. To solve these problems, an enhanced multi-agent proximal policy optimization algorithm is proposed with policy clipping and average evaluation to guarantee the fast convergence and accurate estimation. Simulations demonstrate that this method produces superior convergence than the benchmark algorithms. It allows the UAV base station, ground users and the aerial jammer to adopt the optimal mobile strategies to achieve their respective maximum cumulative rewards. In addition, the stable strategies of agents constitute the approximate Nash equilibrium for the UAV-assisted communication Markov Game.

全文链接:https://ieeexplore.ieee.org/stampPDF/getPDF.jsp?tp=&arnumber=10197291&ref=

Related Articles