Systems Engineering and Electronics ›› 2025, Vol. 47 ›› Issue (7): 2216-2223.doi: 10.12305/j.issn.1001-506X.2025.07.15

• Systems Engineering • Previous Articles    

Reinforcement learning-based resilience optimization method of equipment system-of-systems

Jiahao LIU1, Renjie XU1,2,*, Maotong SUN2, Jiuyao JIANG1, Jichao LI1, Kewei YANG1   

  1. 1. College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
    2. School of Management, Technical University of Munich, Heilbronn 74076, Germany
  • Received:2023-05-29 Online:2025-07-16 Published:2025-07-22
  • Contact: Renjie XU

Abstract:

The equipment system-of-systems (ESoS) inevitably is affected by disturbance events such as external attacks and internal failures in actual operation, causing multiple equipment node failures. How to scientifically and rationally formulate recovery strategies to quickly restore system capabilities and enhance the resilience of the ESoS has important military value and significance. Based on this, this paper proposes an ESoS resilience optimization method based on reinforcement learning. Firstly, the ESoS resilience measurement index is established by integrating network topology and network performance parameters. Secondly, a reinforcement learning algorithm based on Q-Learning node recovery sequence is proposed, and different disturbance scenarios are used to test the change of resilience. Finally, combined with typical cases to verify the feasibility and effectiveness of the proposed algorithm. Through comparative experiments with empirical recovery strategies and genetic algorithm, the results show that with deliberate attacks, the toughness value obtained based on reinforcement learning is 37.46% higher than that based on node ability importance priority recovery strategy, degree priority recovery strategy and random recovery strategy 52.28% and 85.65%; compared with the genetic algorithm, the resilience value obtained after optimization increased by 28.72%. The above analysis effectively shows the superiority of the proposed method and model.

Key words: equipment system-of-systems (ESoS), reinforcement learning, resilience optimization, recovery strategy

CLC Number: 

[an error occurred while processing this directive]