Systems Engineering and Electronics ›› 2025, Vol. 47 ›› Issue (4): 1285-1299.doi: 10.12305/j.issn.1001-506X.2025.04.25

• Guidance, Navigation and Control • Previous Articles     Next Articles

Research on intelligent decision-making methods for coordinated attack by manned aerial vehicles and unmanned aerial vehicles

Wei XIONG1,2, Dong ZHANG1,2,*, Zhi REN1,2, Shuheng YANG1,2   

  1. 1. School of Astronautics, Northwestern Polytechnical University, Xi'an 710072, China
    2. Shaanxi Key Laboratory of Space Vehicle Design, Xi'an 710072, China
  • Received:2024-04-26 Online:2025-04-25 Published:2025-05-28
  • Contact: Dong ZHANG

Abstract:

The trend in unmanned aerial vehicle air combat is the coordination between manned aerial vehicles and unmanned aerial vehicles, with intelligent decision-making being crucial for achieving coordinated attack between manned aerial vehicles and unmanned aerial vehicles. High dynamic battlefield environment, asymmetric combat tasks and heterogeneous multi-source coordination system lead to poor autonomous capability and real-time performance of unmanned aerial vhicles, and difficult strategic training, which is the difficulty of manned aerial vehicles and unmanned aerial vehicles cooperative attack research. A typical style of manned aerial vehicles and unmanned aerial vehicles cooperative attack pattern is designed based on the loyal wingman scheme of cooperative maneuvering of manned aerial vehicles and unmanned aerial vehicles. A reinforcement learning method based on improved multi-agent twin delayed deep deterministic (MATD3) policy gradient algorithm is proposed. Firstly, the cooperative maneuvering decision-making training framework based on MATD3 policy gradient algorithm, curriculum learning (CL) and the pre-train (PT) strategy based on transfer learning are designed to solve the difficult problem of the cooperative attack strategy training of manned aerial vehicles and unmanned aerial vehicles. Secondly, the reward function and state space for unmanned aerial vehicles cooperative maneuvers are established to facilitate multi-aerial coordinated operations. Finally, a digital simulation and deduction platform is built based on the six-degree-freedom simulation model to verify that the trained attack strategy has efficient attack and survivability and can guide the practical application of manned aerial vehicles and unmanned aerial vehicles coordinated arrack operations.

Key words: manned aerial vehicles and unmanned aerial vehicles coordination, air combat maneuver decision, deep reinforcement learning, loyal wingman

CLC Number: 

[an error occurred while processing this directive]