Systems Engineering and Electronics ›› 2025, Vol. 47 ›› Issue (6): 1867-1879.doi: 10.12305/j.issn.1001-506X.2025.06.15

• Systems Engineering • Previous Articles     Next Articles

Confront strategy of multi-unmanned aerial vehicle based on ASDDPG algorithm

Xiaowei FU, Xinyi WANG, Zhe QIAO   

  1. School of Electronics and Information, Northwestern Polytechnical University, Xi'an 710129, China
  • Received:2024-03-05 Online:2025-06-25 Published:2025-07-09
  • Contact: Xiaowei FU

Abstract:

In a multi-unmanned aerial vehicle (UAV) confrontation, the number of friendly UAVs within the range of the UAVs communication is indeterminate, resulting in changes in the amount of information it obtains. In deep reinforcement learning, the input dimension of the neural network is fixed, and many algorithms only consider the interaction information of a fixed number of friendly UAVs at a relatively close distance, resulting in information loss and inconsistent with the actual battlefield environment. In this regard, based on the multi-agent deep deterministic policy gradient (MADDPG) algorithm and attention mechanism, the attention state-deep deterministic policy gradient (ASDDPG) algorithm is proposed to transform changing information into fixed-length feature vectors, which solves the problem of mismatch between amount of information and input dimension, and extracts state features through coder and decoder structure to enhance the decision-making ability of UAVs. Simulation experiments are designed to compare and analyze the performance of the proposed algorithm, and verify the performance advantage of the proposed algorithm with a better winning probability. The algorithm's advantages in improving UAVs adversarial decision-making and generalization have been verified in this study.

Key words: multi-unmanned aerial vehicle (UAV), reinforcement learning, policy gradient, maneuver decision-making, attention mechanism

CLC Number: 

[an error occurred while processing this directive]