系统工程与电子技术 ›› 2026, Vol. 48 ›› Issue (2): 556-568.doi: 10.12305/j.issn.1001-506X.2026.02.17

• 系统工程 • 上一篇    

基于改进MCTS的多无人机多任务联合决策

魏建林1(), 林彦超2, 唐慧龙1, 张旺1, 王伟1,*   

  1. 1. 哈尔滨工程大学智能科学与工程学院,黑龙江 哈尔滨 150001
    2. 北京航天自动控制研究所,北京 100854
  • 收稿日期:2024-10-24 修回日期:2025-02-21 出版日期:2025-05-20 发布日期:2025-05-20
  • 通讯作者: 王伟 E-mail:18879161337@163.com
  • 作者简介:魏建林(1999—),男,博士研究生,主要研究方向为雷达电子对抗
    林彦超(1989—),男,高级工程师,硕士,主要研究方向为雷达系统决策
    唐慧龙(2000—),男,博士研究生,主要研究方向为目标探测
    张 旺(1999—),男,硕士研究生,主要研究方向为干扰决策
  • 基金资助:
    国家自然科学基金(62271163)资助课题

Multi-UAV multi-mission joint decision making based on improved MCTS

Jianlin WEI1(), Yanchao LIN2, Huilong TANG1, Wang ZHANG1, Wei WANG1,*   

  1. 1. College of Intelligent Systems Science and Engineering,Harbin Engineering University,Harbin 150001,China
    2. Beijing Aerospace Automatic Control Institute,Beijing 100854,China
  • Received:2024-10-24 Revised:2025-02-21 Online:2025-05-20 Published:2025-05-20
  • Contact: Wei WANG E-mail:18879161337@163.com

摘要:

在多无人机协同突防过程中,针对无人机需完成目标分配与探干侦动作选择的任务决策存在模型构建难,对模型求解方法复杂度高的问题,提出一种蒙特卡罗树搜索(Monte Carlo tree search, MCTS)的改进方法实现多无人机多任务联合决策。首先,考虑无人机与雷达对抗中的角度、距离等态势要素,以及当前无人机动作执行成功概率和雷达状态有效概率,构建多无人机目标分配与探干侦动作统一决策数学模型。其次,提出搜索次数自适应调整的改进MCTS算法对模型求解,实现大规模解空间在线快速寻优。仿真结果表明,改进算法使多雷达系统对无人机的威胁程度下降约16.8%,相比多臂赌博机算法效果提升约5.08%,决策时间约0.23 s,比传统MCTS缩短约45.7%,有助于提升无人机战场生存率。

关键词: 多无人机协同, 任务联合决策, 目标分配, 蒙特卡罗树搜索算法

Abstract:

In the process of multiple unmanned aerial vehicles (UAV) cooperative penetration, to address the challenges of model construction and the high complexity of solving target allocation and detection jamming and reconnaissance action selection tasks, an improved method of Monte Carlo tree search (MCTS) is proposed to realize joint multi-mission decision making for multiple UAVs. First, a unified mathematical decision model for multi-UAV target allocation and reconnaissance actions is constructed, considering the situational factors such as angle and distance in the confrontation between UAV and radars, as well as the current probability of successful execution of UAV actions and the effective probability of radar states. An improved MCTS algorithm with adaptive adjustment of search frequency is proposed for fast online optimization in large solution spaces. Simulation results show that the improved algorithm decreases the threat level of multi-radar system to UAVs by about 16.8%, improves the effect of the algorithm compared to the multi-armed gambling machine by about 5.08%, and the decision time is about 0.23 s, which is about 45.7% shorter than that of the traditional MCTS, thereby improving UAV battlefield survivability.

Key words: multi-UAV collaboration, task joint decision, target assignment, Monte Carlo tree search (MCTS)

中图分类号: