Systems Engineering and Electronics ›› 2022, Vol. 44 ›› Issue (5): 1652-1661.doi: 10.12305/j.issn.1001-506X.2022.05.27

• Guidance, Navigation and Control • Previous Articles     Next Articles

Optimization method for orbit transfer of all-electric propulsion satellite based on reinforcement learning

Mingren HAN1,2, Yufeng WANG1,2,*   

  1. 1. Beijing Institute of Control Engineering, Beijing 100094, China
    2. Science and Technology on Space Intelligent Control Laboratory, Beijing 100094, China
  • Received:2021-07-09 Online:2022-05-01 Published:2022-05-16
  • Contact: Yufeng WANG

Abstract:

Using electric thrusters for autonomous orbit transfer is one of the critical technologies in the field of all-electric propulsion satellites. In order to solve the orbit raising problem of all-electric propulsion geostationary orbit (GEO) satellites, a reinforcement learning-based optimization method for the time-optimal low-thrust orbit transfer strategy is formulated by combining generalized advantage estimator (GAE) and proximal policy optimization (PPO) methods, taking into account the influence of multiple orbital perturbations and the constraints of the earth's shadow. Aiming at the key problem of training difficulty caused by too large state space and sparse reward, training acceleration methods such as action output mapping and hierarchical reward are proposed, which effectively improve the training efficiency and accelerate the convergence speed. Through numerical simulation and comparison of the results with the direct method, the indirect method and the feedback control method, it shows that the optimization method based on reinforcement learning is more simple, flexible, efficient, and time-optimal in orbit transfer.

Key words: all-electric propulsion satellite, low-thrust orbit transfer optimization, reinforcement learning, proximal policy optimization (PPO), training acceleration method

CLC Number: 

[an error occurred while processing this directive]