Systems Engineering and Electronics ›› 2024, Vol. 46 ›› Issue (11): 3764-3773.doi: 10.12305/j.issn.1001-506X.2024.11.18

• Systems Engineering • Previous Articles     Next Articles

Cluster multi-target fire planning method based on PPO algorithm

Hucheng QIN, Yanyan HUANG, Tiande CHEN, Han ZHANG   

  1. School of Automation, Nanjing University of Science and Technology, Nanjing 210094, China
  • Received:2023-07-17 Online:2024-10-28 Published:2024-11-30
  • Contact: Yanyan HUANG

Abstract:

To solve the problem of multi-target firepower planning in defensive combat scenarios under high dynamic battlefield situation, a firepower planning method based on the proximal strategy optimization algorithm is proposed. With the goal of maximizing combat effectiveness, the reinforcement learning reward function is designed from four aspects: ammunition consumption, combat effect, combat cost and combat time. Considering the influence of historical decision sequence on the current planning, the neural network is designed based on the Actor-Critic framework with the long short-term memory network (LSTM) as the core. The network is trained by the proximal strategy optimization algorithm, and the trained reinforcement learning agent is used for sequential decision-making. A series of coherent fire planning schemes are generated in real time according to the situation of multiple decision-making stages. Simulation results show that the agent can realize multi-target firepower planning under high dynamic situation, and its computational efficiency has more obvious advantages than other algorithms.

Key words: multi-target firepower planning, proximal strategy optimization algorithm, long short-term memory network (LSTM), sequential decision-making

CLC Number: 

[an error occurred while processing this directive]