系统工程与电子技术 ›› 2026, Vol. 48 ›› Issue (2): 515-523.doi: 10.12305/j.issn.1001-506X.2026.02.13

• 系统工程 • 上一篇    

基于强化学习的海上移动目标搜索路径规划

杨鹏程1(), 杨清清1, 高盈盈1,*, 杨志伟1, 杨克巍1, 艾波2   

  1. 1. 国防科技大学系统工程学院,湖南 长沙 410073
    2. 山东科技大学测绘与空间信息学院,山东 青岛 266590
  • 收稿日期:2023-09-28 修回日期:2024-03-06 出版日期:2024-06-14 发布日期:2024-06-14
  • 通讯作者: 高盈盈 E-mail:15206889290@163.com
  • 作者简介:杨鹏程(2001—),男,硕士研究生,主要研究方向为应急管理与智能决策
    杨清清(1982—),女,教授,博士,主要研究方向为应急管理智能决策、资源优化与任务规划方法
    杨志伟(1988—),男,副教授,博士,主要研究方向为装备体系分析评估与优化、智能优化算法
    杨克巍(1977—),男,教授,博士,主要研究方向为大数据与体系工程、复杂系统机理
    艾 波(1979—),男,教授,博士,主要研究方向为海洋地理信息系统
  • 基金资助:
    国家自然科学基金(72001209,72231011,72374209);湖南省自然科学基金(2023JJ30641,2022JJ20047)资助课题

Path planning for maritime moving targets search based on reinforcement learning

Pengcheng YANG1(), Qingqing YANG1, Yingying GAO1,*, Zhiwei YANG1, Kewei YANG1, Bo AI2   

  1. 1. College of Systems Engineering,National University of Defense Technology,Changsha 410073,China
    2. College of Geodesy and Geomatics,Shandong University of Science and Technology,Qingdao 266590,China
  • Received:2023-09-28 Revised:2024-03-06 Online:2024-06-14 Published:2024-06-14
  • Contact: Yingying GAO E-mail:15206889290@163.com

摘要:

海上移动目标搜索是海上搜救行动的重要环节,搜索效率直接影响搜救行动的成功率。针对海上移动目标搜索路径规划问题进行了以下三部分工作。首先,提出基于最小覆盖矩形的搜索区域确定算法,以划分最佳搜索区域;其次,构建基于强化学习的搜索路径规划算法,综合考虑多重约束条件,实现了子区域的最佳搜索路径规划;最后,研发决策支持系统,验证了算法的有效性和鲁棒性,并扩展到多搜索单元合作的场景。研究结果表明,所提算法在不同搜索场景下均有良好的求解效果,对提高海上搜救行动的成功率具有重要的理论意义和应用价值。

关键词: 海上移动目标搜索, 强化学习, 路径规划, 决策支持

Abstract:

Maritime moving target search plays a crucial role in maritime search and rescue operations, as the search efficiency directly impacts the success rate of the operations. Aiming at the maritime moving target search path planning problem, the following three parts of work are done. Firstly, an algorithm to determine the search area based on the minimum coverage rectangle is proposed to divide the optimal search area. Next, a search path planning algorithm with reinforcement learning is constructed, comprehensively considering multiple constraints, and achieving optimal search path planning of sub-areas. Finally, a decision support system is developed to verify the effectiveness and robustness of the algorithms and it is extended to scenarios involving multi-search unit cooperation. The results demonstrate that the proposed algorithms have achieved positive outcomes across a spectrum of search scenarios. This showcases not only considerable theoretical significance but also practical potential in enhancing the success rate of search and rescue operations.

Key words: maritime moving targets search, reinforcement learning, path planning, decision support

中图分类号: