Systems Engineering and Electronics ›› 2025, Vol. 47 ›› Issue (10): 3288-3299.doi: 10.12305/j.issn.1001-506X.2025.10.16

• Systems Engineering • Previous Articles    

UAV many-to-one pursuit-evasion game based on ME-DDPG algorithm

Yaozhong ZHANG1,*, Zhuoran WU1, Jiandong ZHANG1, Qiming YANG1, Guoqing SHI1, Zixiang XU2   

  1. 1. School of Electronics and Information,Northwestern Polytechnical University,Xi’an 710072,China
    2. Tianjin Institute of Industrial Biotechnology,Chinese Academy of Sciences,Tianjin 300308,China
  • Received:2023-12-11 Online:2025-10-25 Published:2025-10-23
  • Contact: Yaozhong ZHANG

Abstract:

Aiming at the problem of many-to-one pursuit-evasion game of unmanned aerial vehicle (UAV), based on deep deterministic policy gradient (DDPG) of reinforcement learning, and numerical solution results of differential game confrontation combined with pursuit-evasion problem, a mixed experienced DDPG (ME-DDPG) algorithm is proposed. By incorporating game adversarial numerical solutions into the strategy set of exploratory learning, directional strategies are calculated to enhance the training efficiency of UAV pursuit strategies and improve the slow convergence speed and easy local convergence caused by long turn tasks, sparse reward rewards, and insufficient exploration of reinforcement learning algorithms in UAV pursuit-evasion game problems. This improves the learning efficiency of reinforcement learning algorithm. The simulation experiment results show that using the ME-DDPG algorithm to solve the pursuit-evasion task of UAV in a many-to-one game can quickly converge, and the success rate of the task reaches 83%. Comparative experiments verify the advantages of the proposed algorithm over the DDPG algorithm in terms of convergence, stability, and task success rate.

Key words: game theory, deep reinforcement learning, pursuit-evasion game, unmanned aerial vehicle (UAV), multi-agent

CLC Number: 

[an error occurred while processing this directive]