Systems Engineering and Electronics ›› 2023, Vol. 45 ›› Issue (6): 1702-1711.doi: 10.12305/j.issn.1001-506X.2023.06.14

• Systems Engineering • Previous Articles    

UAV intelligent avoidance decisions based on deep reinforcement learning algorithm

Fengguo WU1, Wei TAO2, Hui LI1,3,*, Jianwei ZHANG1,3, Chengchen ZHENG3   

  1. 1. National Key Laboratory of Fundamental Science on Synthetic Vision, Sichuan University, Chengdu 610065, China
    2. China Ship Development and Design Center, Wuhan 430064, China
    3. School of Computer Science, Sichuan University, Chengdu 610065, China
  • Received:2022-04-02 Online:2023-05-25 Published:2023-06-01
  • Contact: Hui LI

Abstract:

In order to improve the survival rate of unmanned aerial vehicles (UAVs) in complex air combat scenarios, based on the open UAVs air intelligence game simulation platform, a reinforcement learning method is used to generate maneuver strategies. Based on the deep double Q network (DDQN) and deep deterministic policy gradient (DDPG) algorithms, an unit state sequence (USS) is proposed in this paper, and the gated recurrent unit (GRU) is used to fuse the situation features in USS, with the propose to increase the ability of state features recognition and algorithm convergence in complex air combat scenarios. The experimental results show that when faced with missile attacks using standard proportional guidance algorithm, the agent achieves a survival rate of 98% for missiles evading, and in complex scenarios where multiple missiles attack simultaneously, it can also achieve a survival rate of 88%. Compared with the traditional simple maneuvering modes, the survival rate of UAVs is significantly improved.

Key words: deep reinforcement learning (DRL), unmanned aerial vehicles (UAVs), unit state sequence (USS), gated recurrent unit (GRU)

CLC Number: 

[an error occurred while processing this directive]