|
Abstract High-resolution remote sensing images contain rich data and information, which reduce the difference between the target and the background, resulting in substandard detection accuracy and reduced target detection performance. Based on the deep learning algorithm You Only Look Once (YOLO), this study designed a lightweight network model GC-YOLOv5 by combining end-to-end coordinate attention (CA) and the lightweight network module GhostConv. The CA was employed to encode channels along the horizontal and vertical directions, enabling the attention mechanism module to simultaneously capture remote spatial interactions with precise location information and helping the network locate targets of interest more accurately. The original ordinary convolutional module convolutional-batchnormal-SiLu (CBS) was replaced by the GhostConv module, reducing the number of parameters in the feature channel fusion process and the size of the optimal model. Experiments were conducted on the GC-YOLOv5 using the publicly available NWPU-VHR-10 dataset, with the robustness of the model verified on the RSOD dataset. The results show that GC-YOLOv5 yielded a detection accuracy of 96.5% on the NWPU-VHR-10 dataset, with a recall rate of 96.4% and mAP of 97.7%. Moreover, GC-YOLOv5 achieved satisfactory results on the RSOD dataset.
|
Keywords
deep learning
remote sensing image
target detection
YOLOv5
|
|
Issue Date: 14 June 2024
|
|
|
[1] |
Fang X L, Hu F, Yang M, et al. Small object detection in remote sensing images based on super-resolution[J]. Pattern Recognition Letters, 2022, 153:107-112.
|
[2] |
Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]// 2014 IEEE Conference on Computer Vision and Pattern Recognition.IEEE, 2014:580-587.
|
[3] |
Girshick R. Fast R-CNN[C]// 2015 IEEE International Conference on Computer Vision (ICCV).IEEE, 2015:1440-1448.
|
[4] |
Ren S Q, He K M, Girshick R, et al. Faster R-CNN:Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149.
|
[5] |
Redmon J, Divvala S, Girshick R, et al. You only look once:Unified,real-time object detection[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE, 2016:779-788.
|
[6] |
Redmon J, Farhadi A. YOLO9000:Better,faster,stronger[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE, 2017:6517-6525.
|
[7] |
Redmon J, Farhadi A. YOLOv3:An incremental improvement[J/OL]. ArXiv, 2018(2018-4-8). https://arxiv.org/abs/1804.02767.
url: https://arxiv.org/abs/1804.02767
|
[8] |
Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7:Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[J/OL]. arXiv, 2022(2022-7-6). http://arxiv.org/abs/2207.02696.
url: http://arxiv.org/abs/2207.02696
|
[9] |
Jiang P Y, Ergu D J, Liu F Y, et al. A review of YOLO algorithm developments[J]. Procedia Computer Science, 2022, 199:1066-1073.
|
[10] |
Liu W, Anguelov D, Erhan D, et al. SSD:single shot MultiBox detector[C]// Computer Vision-ECCV 2016. Springer International Publishing, 2016:21-37.
|
[11] |
Lamane M, Tabaa M, Klilou A. Classification of targets detected by mmWave radar using YOLOv5[J]. Procedia Computer Science, 2022, 203:426-431.
|
[12] |
Puliti S, Astrup R. Automatic detection of snow breakage at single tree level using YOLOv5 applied to UAV imagery[J]. International Journal of Applied Earth Observation and Geoinformation, 2022, 112:102946.
|
[13] |
Li S W, Gu X Y, Xu X R, et al. Detection of concealed cracks from ground penetrating radar images based on deep learning algorithm[J]. Construction and Building Materials, 2021, 273:121949.
|
[14] |
Wang Y, Bashir S M A, Khan M, et al. Remote sensing image super-resolution and object detection:Benchmark and state of the art[J]. Expert Systems with Applications, 2022, 197:116793.
|
[15] |
Qu Z F, Zhu F Z, Qi C X. Remote sensing image target detection:Improvement of the YOLOv3 model with auxiliary networks[J]. Remote Sensing, 2021, 13(19):3908.
|
[16] |
Qi J T, Liu X N, Liu K, et al. An improved YOLOv5 model based on visual attention mechanism:Application to recognition of tomato virus disease[J]. Computers and Electronics in Agriculture, 2022, 194:106780.
|
[17] |
Bao W X, Du X, Wang N, et al. A defect detection method based on BC-YOLO for transmission line components in UAV remote sensing images[J]. Remote Sensing, 2022, 14(20):5176.
|
[18] |
Chen W, Gao L, Li X Y, et al. Lightweight convolutional neural network with knowledge distillation for cervical cells classification[J]. Biomedical Signal Processing and Control, 2022, 71:103177.
|
[19] |
Yao J, Qi J M, Zhang J, et al. A real-time detection algorithm for kiwifruit defects based on YOLOv5[J]. Electronics, 2021, 10(14):1711.
|
[20] |
Han K, Wang Y H, Tian Q, et al. GhostNet:More features from cheap operations[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).IEEE, 2020:1577-1586.
|
[21] |
Hou Q, Zhou D, Feng J. Coordinate attention for efficient mobile network design[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).IEEE, 2021:13708-13717.
|
[22] |
Cheng G, Han J W, Zhou P C, et al. Multi-class geospatial object detection and geographic image classification based on collection of part detectors[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2014, 98:119-132.
|
[23] |
Cheng G, Han J W. A survey on object detection in optical remote sensing images[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2016, 117:11-28.
|
[24] |
Long Y, Gong Y P, Xiao Z F, et al. Accurate object localization in remote sensing images based on convolutional neural networks[J]. IEEE Transactions on Geoscience and Remote Sensing, 2017, 55(5):2486-2498.
|
[25] |
Jocher G, Chaurasia A, Stoken A, et al. Ultralytics/YOLOv5:V6.2-YOLOv5 classification models,apple M1,reproducibility,clearML and deci.ai integrations[Z]. Zenodo, 2022.
|
[26] |
Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.IEEE, 2018:7132-7141.
|
[27] |
Woo S H Y, Park J C, Lee J Y, et al. CBAM:Convolutional block attention module[C]// European Conference on Computer Vision.Springer, 2018:3-19.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|