Systems Engineering and Electronics ›› 2020, Vol. 42 ›› Issue (2): 414-419.doi: 10.3969/j.issn.1001-506X.2020.02.21

Previous Articles     Next Articles

Reinforcement learning guidance law of Q-learning

Qinhao ZHANG1(), Baiqiang AO1(), Qinxue ZHANG2()   

  1. 1. Beijing Institute of Electronic Engineering, Beijing 100854, China
    2. College of Computer Science, North China Institute of Aerospace Engineering, Langfang 065000, China
  • Received:2019-07-26 Online:2020-02-01 Published:2020-01-23
  • Supported by:
    中国博士后科学基金资助课题(2017M620863)

Abstract:

As the intelligent missile being a major development trend, it is foreseeable that it will become a precise and effective strike weapon in the future battlefields. On the basis of the traditional proportional guidance law, this paper proposes a guidance algorithm based on reinforcement learning with variable proportional coefficient. Taking the line-of-sight rate as the state, this algorithm designs a discretized action space, as well as a reward function based on the miss distance, to determine the correct guidance command for the missile. The simulation results prove the algorithm possesses better guidance accuracy than the traditional proportional guidance law and endows the missile with the ability of autonomous decision-making.

Key words: proportional guidance, guidance law, miss distance, maneuvering target, reinforcement learning, Q-learning, timing difference algorithm

CLC Number: 

[an error occurred while processing this directive]