基于深度特征的多方向目标检测研究

doi:10.6046/zrzyyg.2023139

摘要
图/表
参考文献
相关文章
Metrics

全文: PDF(1845 KB)

HTML
输出: BibTeX | EndNote (RIS)

摘要

近年来目标检测成为计算机视觉技术的重要分支,广泛应用于医学、军事、城轨等领域,随着卫星和遥感技术的进步,其获取的图像蕴含着丰富的信息,因此对这些图像中目标自动检测和理解变得至关重要。但是遥感影像中目标方向随意、密集等,传统目标检测方法容易导致漏检错检,针对此问题,该文提出多卷积核特征组合自适应区域生成网络(multi-convolution kernel feature combination adaptive region proposal network,MFCARPN)算法进行多方向检测,该算法引入多个不同卷积核提取特征,可以根据目标的差异性自适应地学习每个卷积核特征的权重参数,得到和目标更加匹配的特征图,同时通过结合目标原始特征使分类回归模型参数可以依据目标之间的差异性动态变化,提高区域生成网络(region proposal network,RPN)自适应能力。实验表明其在DOTA标准数据集的平均精度均值(mean average precision,mAP)达到75.52%,相较于GV算法提高0.5个百分点,由此证明了该算法的有效性。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	于淼
	荆虹波
	王翔
	李兴久

关键词 ：遥感影像, 自适应, MFCARPN, 多方向检测

Abstract：

In recent years, target detection, as an important branch of computer vision technology, has been widely applied in fields such as medicine, military affairs, and urban rail transit. As satellite and remote sensing technologies advance, images obtained using these technologies contain abundant information. This makes it crucial to conduct automatic target detection and understanding of these images. However, due to the random directions and dense distribution of targets in remote sensing images, conventional methods are prone to lead to missing or incorrect detection. In response, this study proposes a multi-convolution kernel feature combination-based adaptive region proposal network (MFCARPN) algorithm for multi-directional detection. This algorithm introduces multiple convolution kernel features for target extraction. The weight parameters of these convolution kernel features can be determined through adaptive learning according to the differences between the targets, yielding the characteristic patterns that match better with targets. Meanwhile, in combination with the original features of the targets, the parameters of the classification and regression model vary dynamically according to the difference between targets. Thus, the RPN’s adaptive ability can be improved. The experimental results indicate that the mAP of the standard dataset DOTA reached up to 75.52%, which is 0.5 percentages higher than that of the baseline algorithm GV. Therefore, the MFCARPN algorithm proposed in this study proves effective.

Key words： remote sensing image adaptive ability MFCARPN multi-directional detection

收稿日期: 2023-05-17 出版日期: 2024-09-03

ZTFLH:

TP391.4

作者简介: 于淼(1984-),女,博士,高级工程师,主要围绕城市轨道交通行业精细化建设和运营管理开展信息化数字化产品研发工作。Email: yumiao4503210@126.com。

引用本文:

于淼, 荆虹波, 王翔, 李兴久. 基于深度特征的多方向目标检测研究[J]. 自然资源遥感, 2024, 36(3): 267-271.
YU Miao, JING Hongbo, WANG Xiang, LI Xingjiu. Multi-directional target detection based on depth features. Remote Sensing for Natural Resources, 2024, 36(3): 267-271.

链接本文:

https://www.gtzyyg.com/CN/10.6046/zrzyyg.2023139 或 https://www.gtzyyg.com/CN/Y2024/V36/I3/267

Fig.1 风险源识别研究路线

Fig.2 MFCARPN网络结构

Fig.3 池化自适应网络

Tab.1 DOTA数据集目标影像示例

Tab.2 定量实验结果

[1]	Tang T, Zhou S, Deng Z, et al. Arbitrary-oriented vehicle detection in aerial imagery with single convolutional neural networks[J]. Remote Sensing, 2017, 9(11):1170.
[2]	郭俊, 宁志勇, 王亚涛, 等. 基于视频大数据的施工安全监测研究及应用[J]. 科技与创新, 2023(6):153-159.
	Guo J, Ning Z Y, Wang Y T, et al. Research and application of construction safety monitoring based on video big data[J]. Science and Technology & Innovation, 2023(6):153-159.
[3]	江思阳. 基于计算机视觉的城市轨道交通弓网磨耗病害检测技术研究[D]. 北京: 北京交通大学, 2020.
	Jiang S Y. Research on abrasion defect detection technology of urban rail transit pantograph and catenary based on computer vision[D]. Beijing: Beijing Jiaotong University, 2020.
[4]	Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks[J]. Communication of the ACM, 2017, 60(6):84-90.
[5]	Redmon J, Farhadi A. YOLOv3:An incremental improvement[J/OL]. arXiv, 2018(2018-04-08). https://arxiv.org/abs/1804.02767.
[6]	Tian Z, Shen C, Chen H, et al. FCOS:Fully convolutional one-stage object detection[C]// 2019 IEEE/CVF International Conference on Computer Vision (ICCV).IEEE, 2020:9627-9636.
[7]	Jiang Y, Zhu X, Wang X, et al. R²CNN:Rotational region CNN for orientation robust scene text detection[J/OL]. arXiv, 2017(2017-06-29). https://arxiv.org/abs/1706.09579.
[8]	Xia G S, Bai X, Ding J, et al. DOTA:A large-scale dataset for object detection in aerial images[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.IEEE, 2018:3974-3983.
[9]	王振华, 谭智联, 李静, 等. Re-YOLOX:利用Resizer改进的YOLOX近岸海域监测目标识别模型[J]. 自然资源遥感, 2023, 35(3):10-16.doi:10.6046/zrzyyg.2022425.
	Wang Z H, Tan Z L, Li J, et al. Re-YOLOX:A YOLOX model for identifying nearshore monitoring targets improved based on the Resizer model[J]. Remote Sensing for Natural Resources, 2023, 35(3):10-16.doi:10.6046/zrzyyg.2022425.
[10]	Xu Y, Fu M, Wang Q, et al. Gliding vertex on the horizontal bounding box for multi-oriented object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43(4):1452-1459.
[11]	Pan X, Ren Y, Sheng K, et al. Dynamic refinement network for oriented and densely packed object detection[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).IEEE, 2020:11204-11213.
[12]	Ren S, He K, Girshick R, et al. Faster R-CNN:Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(6):1137-1149.

[1]	邰佳怡, 慎利, 乔文凡, 周吾珍. 不同上下文比例对损毁建筑遥感场景图片样本集构建的影响[J]. 自然资源遥感, 2024, 36(3): 154-162.
[2]	易孜芳, 周磊磊, 骆检兰, 曹里. 高光谱反演耕地土壤质量评价元素含量方法研究[J]. 自然资源遥感, 2024, 36(3): 225-232.
[3]	赵彬如, 牛思文, 王力彦, 杨晓彤, 焦红波, 王子珂. 面向海岛海岸带区域的高分遥感影像智能化色彩增强方法[J]. 自然资源遥感, 2024, 36(2): 70-79.
[4]	陈笛, 彭秋志, 黄培依, 刘雅璇. 采用注意力机制与改进YOLOv5的光伏用地检测[J]. 自然资源遥感, 2023, 35(4): 90-95.
[5]	刁明光, 刘勇, 郭宁博, 李文吉, 江继康, 王云霄. 基于Mask R-CNN的遥感影像疏林地智能识别方法[J]. 自然资源遥感, 2023, 35(2): 97-104.
[6]	赵凌虎, 袁希平, 甘淑, 胡琳, 丘鸣语. 改进Deeplabv3+的高分辨率遥感影像道路提取模型[J]. 自然资源遥感, 2023, 35(1): 107-114.
[7]	沈骏翱, 马梦婷, 宋致远, 柳汀洲, 张微. 基于深度学习语义分割模型的高分辨率遥感图像水体提取[J]. 自然资源遥感, 2022, 34(4): 129-135.
[8]	吕雅楠, 朱红, 孟健, 崔成玲, 宋其淇. 面向高分辨率遥感影像车辆检测的深度学习模型综述及适应性研究[J]. 自然资源遥感, 2022, 34(4): 22-32.
[9]	唐文魁, 俞露, 周伟奇, 岳隽, 周正. 基于长时间序列遥感数据的深圳景观连通性动态变化研究[J]. 自然资源遥感, 2022, 34(3): 97-105.
[10]	程滔. 一种与遥感影像同步纠正的矢量地理信息采集方法[J]. 自然资源遥感, 2022, 34(3): 59-64.
[11]	苏赋, 于海鹏, 朱威西. 标签聚类损失在遥感影像分类中的应用[J]. 自然资源遥感, 2022, 34(2): 144-151.
[12]	薛白, 王懿哲, 刘书含, 岳明宇, 王艺颖, 赵世湖. 基于孪生注意力网络的高分辨率遥感影像变化检测[J]. 自然资源遥感, 2022, 34(1): 61-66.
[13]	宋仁波, 朱瑜馨, 郭仁杰, 赵鹏飞, 赵珂馨, 朱洁, 陈颖. 基于多源数据集成的城市建筑物三维建模方法[J]. 自然资源遥感, 2022, 34(1): 93-105.
[14]	王译著, 黄亮, 陈朋弟, 李文国, 余晓娜. 联合显著性和多方法差异影像融合的遥感影像变化检测[J]. 自然资源遥感, 2021, 33(3): 89-96.
[15]	桑潇, 张成业, 李军, 朱守杰, 邢江河, 王金阳, 王兴娟, 李佳瑶, 杨颖. 煤炭开采背景下的伊金霍洛旗土地利用变化强度分析[J]. 自然资源遥感, 2021, 33(3): 148-155.

Viewed

Full text

Abstract

Cited

Shared

Discussed