系统工程与电子技术 ›› 2026, Vol. 48 ›› Issue (6): 2072-2080.doi: 10.12305/j.issn.1001-506X.2026.06.27

• 制导、导航与控制 • 上一篇    下一篇

基于多特征正交约束的无人机跨视角地理定位方法

刘瑞航1, 刘海颖1,2,*, 刘宇辰1, 陈晨1, 李铁香2,3   

  1. 1. 南京航空航天大学航天学院,江苏 南京 211106
    2. 南京应用数学中心,江苏 南京 210018
    3. 东南大学数学学院,江苏 南京 210096
  • 收稿日期:2025-03-03 修回日期:2025-04-02 出版日期:2026-06-25 发布日期:2025-06-10
  • 通讯作者: 刘海颖
  • 作者简介:刘瑞航(2001—),男,硕士研究生,主要研究方向为深度学习、视觉导航、遥感与地理信息系统
    刘宇辰(2000—),男,硕士研究生,主要研究方向为导航制导与控制
    陈 晨(2001—),男,硕士研究生,主要研究方向为多域联合作战规划
    李铁香(1979—),女,教授,博士,主要研究方向为大规模矩阵计算
  • 基金资助:
    国家自然科学基金(12371377)资助课题

Diverse features orthogonal constraint-based cross-view geo-localization method for UAVs

Ruihang LIU1, Haiying LIU1,2,*, Yuchen LIU1, Chen CHEN1, Tiexiang LI2,3   

  1. 1. College of Astronautics,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,China
    2. Nanjing Center for Applied Mathematics,Nanjing 210018,China
    3. School of Mathematics,Southeast University,Nanjing 210096,China
  • Received:2025-03-03 Revised:2025-04-02 Online:2026-06-25 Published:2025-06-10
  • Contact: Haiying LIU

摘要:

在有限的训练数据下,通过深度学习来学习更具判别性的特征仍是一个挑战。本文提出一种基于Transformer的自适应正交约束神经网络,通过将原始空间划分为多个子空间来增加嵌入空间的密度,学习更深层次的特征。基于正交约束的思想,提出了一种重叠的子空间分离的方法,从而减少同质信息。同时,动态调整子空间之间的权重,优化因相似性而难以训练的样本。分别在University-1652数据集、实测数据和不同天气条件模拟数据上进行了实验,结果表明,与基线算法相比,本文提出的算法在定位和导航任务中的性能分别提升了29.58%/26.52%(R@1/AP)和21.51%/27.98%(R@1/AP)。尽管训练数据有限,本文提出的模型仍能学习到更鲁棒和更具判别性的特征表征,从而促进相似类别的识别,帮助无人机实现准确的定位和导航。

关键词: 无人机, 跨视角地理定位, 视觉Transformer, 正交约束, 图像检索

Abstract:

Under limited training data, learning more discriminative features through deep learning remains a challenging task. In this paper, we propose an adaptive orthogonal constraint neural network based on Transformer. This approach aims to enhance the density of the embedding space by dividing the original space into multiple subspaces, thereby facilitating the learning of deeper features. Based on the concept of orthogonal constraints, a method for overlapping subspace separation is proposed, thereby reducing homogeneous information. Furthermore, the weights between subspaces are dynamically adjusted to optimize the samples that are difficult to train due to similarity. Experiments are conducted on the University-1652 dataset, real word measured data and simulated data under different weather conditions, and the results show that the algorithm proposed in this paper improves the performance in localization and navigation tasks by 29.58%/26.52% (R@1/AP) and 21.51%/27.98% (R@1/AP), respectively, compared with the baseline algorithm. Notwithstanding the constrained training data, the model proposed in this paper learns more robust and discriminative feature representations, facilitating the identification of similar classes and aiding unmanned aerial vehicle in achieving precise localization and navigation.

Key words: unmanned aerial vehicles (UAVs), cross-view geo-localization, visual Transformer, orthogonal constraints, image retrieval

中图分类号: