Systems Engineering and Electronics ›› 2025, Vol. 47 ›› Issue (5): 1421-1431.doi: 10.12305/j.issn.1001-506X.2025.05.05

• Electronic Technology • Previous Articles    

Remote sensing small target detection algorithm based on cross-scale feature fusion

Kai SHAO1,2,3,*, Haogang LI1, Yan LIANG1, Jing NING1, Wu CHEN1   

  1. 1. School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
    2. Chongqing Key Laboratory of Mobile Communications Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
    3. Engineering Research Center of Mobile Communications of the Ministry of Education, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
  • Received:2024-05-09 Online:2025-06-11 Published:2025-06-18
  • Contact: Kai SHAO

Abstract:

For three problems of shallow thinning features, deep semantic representation and multi-scale information extraction for the detection of small targets in remote sensing images, a cross-scale YOLOv7(CSYOLOv7) network by comprehensively applying multiple technologies is proposed. Firstly, a cross-stage feature extraction module (CFEM) and a receptive field feature enhancement module (RFFEM) are designed. CFEM is to improve the model's ability of refining feature extraction and suppress the loss of features during shallow down-sampling. RFFEM is to increase the network's ability of extracting deep semantic features and improve the model's ability of acquiring target context information. Secondly, a cross-gradient space pyramid pool module (CSPPM) is designed to effectively fuse global multi-scale and local features of small targets. Finally, shape intersection over union (Shape IoU) is used to replace the complete intersection over union (CIoU) to improve the accuracy of the model in the bounding box positioning task. Experimental results show that the CSYOLOv7 network achieves detection accuracy of 74% and 89.6% on the Dataset for Image Object Recognition (DIOR) data set and Northwestern Polytechnical University Very High Resolution-10 (NWPU VHR-10) data set respectively, which effectively improves the detection effect of small targets in remote sensing images.

Key words: remote sensing image, small target, feature extraction, context information

CLC Number: 

[an error occurred while processing this directive]