系统工程与电子技术 ›› 2023, Vol. 45 ›› Issue (6): 1624-1632.doi: 10.12305/j.issn.1001-506X.2023.06.05

• 电子技术 • 上一篇    

基于改进Actor-Critic算法的多传感器交叉提示技术

韦道知, 张曌宇, 谢家豪, 李宁   

  1. 空军工程大学防空反导学院, 陕西 西安 710051
  • 收稿日期:2022-01-19 出版日期:2023-05-25 发布日期:2023-06-01
  • 通讯作者: 张曌宇
  • 作者简介:韦道知(1977—), 男, 副教授, 博士, 主要研究方向为空天目标协同跟踪与拦截引导
    张曌宇(1996—), 女, 硕士研究生, 主要研究方向为空天目标协同跟踪与拦截引导
    谢家豪(1996—), 男, 博士研究生, 主要研究方向为空天目标协同跟踪与拦截引导
    李宁(1996—), 男, 硕士研究生, 主要研究方向为空天目标协同跟踪与拦截引导

Multi-sensor cross cueing technique based on improved Actor-Critic algorithm

Daozhi WEI, Zhaoyu ZHANG, Jiahao XIE, Ning LI   

  1. Air and Missile Defense College, Air Force Engineering University, Xi'an 710051, China
  • Received:2022-01-19 Online:2023-05-25 Published:2023-06-01
  • Contact: Zhaoyu ZHANG

摘要:

针对在减少战场资源浪费、平衡战场效费比的同时提高目标探测概率, 保证目标的可持续跟踪, 提出利用改进Actor-Critic算法的多传感器交叉提示技术进行目标探测。首先, 综合传感器探测、能耗、时效等因素搭建基于“交叉提示”传感器的动态管理评估模型; 其次, 重点分析利用Actor-Critic交叉提示算法的传感器管理决策规则, 并且提出了Actor-Critic算法,以根据任务自身需求组建中央评价网络, 加大传感器与外部环境的交互。仿真结果表明,改进的算法可以加速网络收益, 实现对目标的持续性探测, 加强传感器之间的交叉提示功能, 提升调度的智能化水平, 具有较大的应用价值。

关键词: 多传感器交叉提示, Actor-Critic算法, 强化学习, 目标探测, 传感器资源调度

Abstract:

To reduce the waste of battlefield resources, balance the cost-effectiveness ratio of the battlefield, increase the probability of target detection, and to ensure the sustainable tracking of the target, a multi-sensor cross-cueing technique based on improved Actor-Critic algorithm for target detection is proposed in this paper. Firstly, a sensor dynamic management evaluation model based on "cross-cueing" is built by integrating factors such as sensor detection, energy consumption, and timing. Secondly, the paper focuses on analyzing the decision rules of sensor management under the Actor-Critic cross-cueing algorithm. The improvement of the Actor-Critic algorithm is proposed to form a central evaluation network according to the needs of the task itself, and enlarge the interaction between the sensor and the external environment. Simulation result shows that the improved algorithm can accelerate the profit of the alliance, realize the continuous detection of the target, strengthen the cross-cueing function between sensors, and improve the intelligent level of scheduling, which provides great application value.

Key words: multi-sensor cross-cueing, Actor-Critic algorithm, reinforcement learning, object detection, sensor resource scheduling

中图分类号: