系统工程与电子技术 ›› 2022, Vol. 44 ›› Issue (10): 3174-3181.doi: 10.12305/j.issn.1001-506X.2022.10.21

• 系统工程 • 上一篇    下一篇

针对无人潜航器的反潜策略研究

曾斌1,*, 张鸿强1, 李厚朴2   

  1. 1. 海军工程大学管理工程与装备经济系, 湖北 武汉 430033
    2. 海军工程大学导航工程系, 湖北 武汉 430033
  • 收稿日期:2021-12-22 出版日期:2022-09-20 发布日期:2022-10-24
  • 通讯作者: 曾斌
  • 作者简介:曾斌(1970—), 男, 教授, 博士, 主要研究方向为信息管理|张鸿强(1996—), 男, 硕士研究生, 主要研究方向为信息管理|李厚朴(1985—), 男, 教授, 博士, 主要研究方向为计算机代数分析
  • 基金资助:
    国家优秀青年科学基金(42122025);国家自然科学基金(41974005);湖北省杰出青年科学基金(2019CFA086)

Research on anti-submarine strategy for unmanned undersea vehicles

Bin ZENG1,*, Hongqiang ZHANG1, Houpu LI2   

  1. 1. Department of Management and Economics, Naval University of Engineering, Wuhan 430033, China
    2. Department of Navigation Engineering, Naval University of Engineering, Wuhan 430033, China
  • Received:2021-12-22 Online:2022-09-20 Published:2022-10-24
  • Contact: Bin ZENG

摘要:

近年来无人潜航器对国家海洋国土安全带来的威胁逐渐增大, 其低噪声特性和隐蔽入侵方式也给反潜行动带来极大困难。为此,提出了一种两阶段规划算法, 用以学习优化反潜策略, 在部署阶段, 建立了基于不确定性马尔可夫决策过程的反潜资源分配模型, 并设计了鲁棒性部署策略强化学习算法, 用以求解不确定条件下分配模型的纳什均衡解。在搜索阶段, 建立了基于部分可观察马尔可夫决策过程的搜潜模型, 并设计了基于多智能体强化学习的搜潜策略学习算法。最后,通过仿真实验验证了本算法与比对算法相比具有更高的性能。

关键词: 反潜, 无人潜航器, 多智能体强化学习, 博弈论

Abstract:

In recent years, the threat of unmanned underwater vehicles (UUV) to national sea security has gradually increased. At the same, it is difficult to detect UUVs for its low noise and team intrusion. A two stage anti-submarine planning method is proposed to learn the optimal anti-submarine strategy. During the deployment stage, a resource allocation model based on uncertain Markov decision process (MDP) is proposed, whose Nash equilibrium point is solved by the elaborately designed robust reinforcement learning algorithm of the deployment strategy. In the search stage, a search model based on partially observable Markov decision process (POMDP) is proposed which is solved by the search strategy learning algorithm based on multi-agent reinforcement learning (MARL). Simulation results show that the proposed algorithm outperforms other algorithms.

Key words: anti-submarine, unmanned underwater vehicle (UUV), multi-agent reinforcement learning, game theory

中图分类号: