The abundant contextual information in synthetic aperture radar (SAR) images remains underutilized in deep learning-based ship detection. Hence, this study proposed a novel method for detecting ships from SAR images based on high-dimensional contextual attention and dual receptive field enhancement. The dual receptive field enhancement was employed to extract multi-dimensional feature information from SAR images, thereby guiding the dynamic attention matrix to learn rich contextual information during the coarse-to-fine extraction of high-dimensional features. Based on YOLOv7, a YOLO-HD network was constructed by incorporating a lightweight convolutional module, a lightweight asymmetric multi-level compression detection head, and a new loss function,XIoU. A comparative experiment was conducted on the E-HRSID and SSDD datasets. The proposed method achieved average detection accuracy of 91.36 % and 97.64 %, respectively, representing improvements by 4.56 and 9.83 percentage points compared to the original model, and outperforming other classical models.
郭伟, 李煜, 金海波. 高维上下文注意和双感受野增强的SAR船舶检测[J]. 自然资源遥感, 2025, 37(3): 104-112.
GUO Wei, LI Yu, JIN Haibo. Detecting ships from SAR images based on high-dimensional contextual attention and dual receptive field enhancement. Remote Sensing for Natural Resources, 2025, 37(3): 104-112.
Zeng T, Wen Y H, Wang Y, et al. Research progress on synthetic aperture Radar parametric imaging methods[J]. Journal of Radars, 2021, 10(3):327-341.
[2]
Gong M, Li Y, Jiao L, et al. SAR change detection based on intensity and texture changes[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2014,93:123-135.
[3]
El-Darymli K, McGuire P, Power D, et al. Target detection in synthetic aperture Radar imagery:A state-of-the-art survey[J]. Journal of Applied Remote Sensing, 2013, 7(1):071598.
Zhang F, Lu S T, Xiang D L, et al. An improved superpixel-based CFAR method for high-resolution SAR image ship target detection[J]. Journal of Radars, 2023, 12(1):120-139.
[5]
Sugimoto M. SAR image analysis target detection utilizing polarimetricinformation[D]. Yokosukashi: National Defense Academy Graduate School of Science and Engineering, 2013.
[6]
Wang C, Wang Y, Liao M. Removal of azimuth ambiguities and detection of a ship:Usingpolarimetric airborne C-band SAR images[J]. International Journal of Remote Sensing, 2012, 33(10):3197-3210.
[7]
Liu T, Yang Z, Marino A, et al. PolSAR ship detection based on neighborhood polarimetric covariance matrix[J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 59(6):4874-4887.
[8]
Zhang P, Zhang J F, Liu T. Constant false alarm rate detection of slow targets in polarimetric along-track interferometric synthetic aperture radar imagery[J]. IET Radar,Sonar& Navigation, 2019, 13(1):31-44.
Zhang P, Zhang J F, Liu T. Slow and small target CFAR detection of polarimetric along-track interferometric SAR using coherence optimization[J]. Journal of Beijing University of Aeronautics and Astronautics, 2019, 45(3):575-587.
[10]
Girshick R. Fast R-CNN[C]// 2015 IEEE International Conference on Computer Vision (ICCV).December 7-13,2015,Santiago,Chile.IEEE, 2015:1440-1448.
[11]
Liu W, Anguelov D, Erhan D, et al. SSD:single shot MultiBox detector[M]// Lecture Notes in Computer Science. Cham:SpringerInternational Publishing, 2016:21-37.
[12]
Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7:Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).June 17-24,2023,Vancouver,BC,Canada.IEEE, 2023:7464-7475.
[13]
Li Y, Yao T, Pan Y, et al. Contextual transformer networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(2):1489-1500.
[14]
Rao Y, Zhao W, Tang Y, et al. Hornet:Efficient high-order spatial interactions with recursive gated convolutions[J]. Advances in Neural Information Processing Systems, 2022,35:10353-10366.
[15]
Vaswani A. Attention is all you need[J]. Advances in Neural Information Processing Systems, 2017.
[16]
Li L, Li B, Zhou H. Lightweight multi-scale network for small object detection[J]. PeerJ Computer Science, 2022,8:e1145.
[17]
Yu Z, Huang H, Chen W, et al. YOLO-FaceV2:A scale and occlusion aware face detector[J]. Pattern Recognition, 2024,155:110714.
[18]
Chen J, Kao S H, He H, et al. Run,don’twalk:Chasing higher FLOPS for faster neural networks[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).June 17-24,2023,Vancouver,BC,Canada.IEEE, 2023:12021-12031.
[19]
Chollet F. Xception:Deep learning with depthwise separable convolutions[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).July 21-26,2017,Honolulu,HI,USA.IEEE, 2017:1800-1807.
[20]
Ioannou Y, Robertson D, Cipolla R, et al. Deeproots:Improving CNN efficiency with hierarchical filter groups[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).July 21-26,2017,Honolulu,HI,USA.IEEE, 2017:5977-5986.
[21]
Wei S, Zeng X, Qu Q, et al. HRSID:A high-resolution SAR images dataset for ship detection and instance segmentation[J]. IEEE Access, 2020,8:120234-120254.
[22]
Zhang T, Zhang X, Li J, et al. SAR ship detection dataset (SSDD):Official release and comprehensive data analysis[J]. Remote Sensing, 2021, 13(18):3690.
[23]
Chicco D, Jurman G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation[J]. BMC Genomics, 2020, 21(1):6.
doi: 10.1186/s12864-019-6413-7
pmid: 31898477
[24]
Hou X, Wang D, Krähenbühl P. Objects as points[J/OL]. arXiv, 2019. https://arxiv.org/pdf/1904.07850.
[25]
Tan M, Pang R, Le Q V. EfficientDet:Scalable and efficient object detection[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).June 13-19,2020,Seattle,WA,USA.IEEE, 2020:10778-10787.
[26]
Ren S, He K, Girshick R, et al. FasterR-CNN:Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149.
[27]
Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[C]// 2017 IEEE International Conference on Computer Vision (ICCV).October 22-29,2017,Venice,Italy.IEEE, 2017:2999-3007.
[28]
Redmon J, Divvala S, Girshick R, et al. You only look once:Unified,real-time object detection[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).June 27-30,2016,Las Vegas,NV,USA.IEEE, 2016:779-788.