系统工程与电子技术 ›› 2023, Vol. 45 ›› Issue (5): 1305-1314.doi: 10.12305/j.issn.1001-506X.2023.05.06

• 电子技术 • 上一篇    

基于低秩双线性池化注意力网络的舰船目标识别

关欣1, 国佳恩1,2,*, 衣晓1   

  1. 1. 海军航空大学, 山东 烟台 264001
    2. 中国人民解放军91422部队, 山东 烟台 265200
  • 收稿日期:2022-01-11 出版日期:2023-04-21 发布日期:2023-04-28
  • 通讯作者: 国佳恩
  • 作者简介:关欣(1978—), 女, 教授, 博士, 主要研究方向为信息融合、电子对抗、智能计算
    国佳恩(1998—), 男, 硕士研究生, 主要研究方向为航迹关联、多模态信息融合
    衣晓(1976—), 男, 教授, 博士, 主要研究方向为无线传感器网络、多源信息融合

Ship target recognition based on low rank bilinear pooling attention network

Xin GUAN1, Jiaen GUO1,2,*, Xiao YI1   

  1. 1. Naval Aviation University, Yantai 264001, China
    2. Unit 91422 of the PLA, Yantai 265200, China
  • Received:2022-01-11 Online:2023-04-21 Published:2023-04-28
  • Contact: Jiaen GUO

摘要:

针对多模态舰船图像融合识别质量不高等问题,构建了一种端到端的低秩双线性池化注意力网络。首先对各模态原始特征向量基于跨模类别中心进行注意力加权重构, 使不同模态特征更好地关注公共类别信息; 然后采用双线性池化捕获不同模态图像的交互信息, 并引入权重低秩分解降低网络参数规模; 最后依靠特征级联实现模态信息的交互与互补, 并设计联合损失提升网络跨模态融合识别效果。实验结果表明, 相比现有融合方法, 所提方法可有效提升多模态遥感舰船图像的融合识别效果, 在公开的遥感舰船数据集上取得了较高的识别准确率。

关键词: 舰船识别, 双线性池化, 跨模类别中心, 注意力加权, 跨模联合损失

Abstract:

In order to solve the problem of low quality of multimodal ship image fusion recognition, an end-to-end low rank bilinear pooling attention network is constructed. Firstly, the original feature vector of each modal are reconstructed based on the cross-modal category center, so that different modal features can pay more attention to the common category information. Then, bilinear pooling is used to capture the interactive information of different modal images, and the weight low rank decomposition is introduced to reduce the scale of network parameters. Finally, the interaction and complementarity of modal information are realized by feature cascaded, and the joint loss is designed to improve the effect of network cross-modal fusion recognition. The experimental results show that compared with the existing fusion methods, the proposed method can effectively improve the fusion recognition effect of multimodal remote sensing ship images, and achieve high recognition accuracy on the public remote sensing ship datasets.

Key words: ship recognition, bilinear pooling, cross-modal category center, attention weighting, cross-modal joint loss

中图分类号: