基于混合注意力机制和Deeplabv3+的遥感影像建筑物提取方法

doi:10.6046/zrzyyg.2023295

摘要
图/表
参考文献
相关文章
Metrics

全文: PDF(2344 KB)

HTML
输出: BibTeX | EndNote (RIS)

摘要

在大量且复杂的遥感影像中提取建筑物信息是遥感智能应用的重要研究内容之一。针对复杂环境下的遥感影像建筑物提取不精准及小型建筑物易被忽略等问题,文章提出了一种基于混合注意力机制和Deeplabv3+的遥感影像语义分割算法——SC-deep网络。该网络采用编码-解码结构,利用主干残差注意力网络提取深层特征和浅层特征,通过空洞空间金字塔池化模块和通道空间注意力模块聚合遥感影像的空间和通道信息权重,有效利用了遥感影像建筑物的多尺度信息,从而减少影像细节在训练中的损失。实验结果表明,所提方法在Aerial imagery dataset数据集上的分割结果均优于其他主流分割网络,能够有效识别并提取复杂建筑物边缘和小型建筑物,表现出更优异的建筑物提取性能。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	刘晨晨
	葛小三
	武永斌
	余海坤
	张蓓蓓

关键词 ：多尺度信息, 建筑物提取, 语义分割, 注意力机制, 空洞卷积

Abstract：

Extracting information about buildings from a large and complex set of remote sensing images has always been a hot research topic in the intelligent applications of remote sensing. To address issues such as inaccurate information extraction of buildings and the tendency to ignore small buildings within a complex environment in remote sensing images, this study proposed the SC-deep network-a semantic segmentation algorithm for remote sensing images based on a hybrid attention mechanism and Deeplabv3+. Utilizing an encoder-decoder structure, this network employs a backbone residual attention network to extract deep- and shallow-layer features. Meanwhile, this network aggregates the spatial and channel information weights in remote sensing images using a dilated space pyramid pool module and a channel-space attention module. These allow for effectively utilizing the multi-scale information of building structures in remote sensing images, thereby reducing the loss of image details during training. The experimental results indicate that the proposed method outperforms other mainstream segmentation networks on the Aerial imagery dataset. Overall, this method can effectively identify and extract the edges of complex buildings and small structures, exhibiting superior building extraction performance.

Key words： multi-scale information building extraction semantic segmentation attention mechanisms dilated convolution

收稿日期: 2023-09-22 出版日期: 2025-02-17

ZTFLH:

TP79

基金资助:国家自然科学基金项目“面向矿区地理协同设计的空间信息语义服务模式研究”(41572341);河南省自然科学基金项目“深度学习支持下的灾损建筑物提取与检测研究”(222300420450);河南省高等教育教学改革研究与实践项目(学位与研究生教育)“面向学科前沿的研究生创新能力提升路径研究与实践”(2021SJGLX100Y)

通讯作者: 葛小三(1971-),男,博士,教授,主要从事时空数据智能处理与分析和地理信息服务方面的研究。Email: gexiaosan@163.com。

作者简介: 刘晨晨(1998-),男,硕士研究生,主要从事遥感影像处理与应用方面的研究。Email: 18203233036@163.com。

引用本文:

刘晨晨, 葛小三, 武永斌, 余海坤, 张蓓蓓. 基于混合注意力机制和Deeplabv3+的遥感影像建筑物提取方法[J]. 自然资源遥感, 2025, 37(1): 31-37.
LIU Chenchen, GE Xiaosan, WU Yongbin, YU Haikun, ZHANG Beibei. A method for information extraction of buildings from remote sensing images based on hybrid attention mechanism and Deeplabv3+. Remote Sensing for Natural Resources, 2025, 37(1): 31-37.

链接本文:

https://www.gtzyyg.com/CN/10.6046/zrzyyg.2023295 或 https://www.gtzyyg.com/CN/Y2025/V37/I1/31

Fig.1 注意力残差模块

Tab.1 CAM-Resnet50网络结构

Fig.2 CAM模块

Fig.3 SAM模块

Fig.4 SC-deep网络

Fig.5 图像预处理结果

Tab.2 实验环境配置

Tab.3 主干网络消融实验结果

Tab.4 注意力模块消融实验结果

Tab.5 对比实验分割可视化结果

Tab.6 对比实验结果

[1]	胡明洪, 李佳田, 姚彦吉, 等. 结合多路径的高分辨率遥感影像建筑物提取SER-UNet算法[J]. 测绘学报, 2023, 52(5):808-817. doi: 10.11947/j.AGCS.2023.20210691
	Hu M H, Li J T, Yao Y J, et al. SER-UNet algorithm for building extraction from high-resolution remote sensing image combined with multipath[J]. Acta Geodaetica et Cartographica Sinica, 2023, 52(5):808-817. doi: 10.11947/j.AGCS.2023.20210691
[2]	吴炜, 骆剑承, 沈占锋, 等. 光谱和形状特征相结合的高分辨率遥感图像的建筑物提取方法[J]. 武汉大学学报(信息科学版), 2012, 37(7):800-805.
	Wu W, Luo J C, Shen Z F, et al. Building extraction from high resolution remote sensing imagery based on spatial-spectral method[J]. Geomatics and Information Science of Wuhan University, 2012, 37(7):800-805.
[3]	贾士军, 王昆. 融合颜色和纹理特征的彩色图像分割[J]. 测绘科学, 2014, 39(12):138-142,147.
	Jia S J, Wang K. Color image segmentation by integrating color and texture features[J]. Science of Surveying and Mapping, 2014, 39(12):138-142,147.
[4]	Lagunas E, Amin M G, Ahmad F, et al. Pattern matching for building feature extraction[J]. IEEE Geoscience and Remote Sensing Letters, 2014, 11(12):2193-2197.
[5]	Gong J, Ji S. Photogrammetry and deep learning[J]. Journal of Geodesy and Geoinformation Science, 2018(1):1-15. doi: 10.11947/j.JGGS.2018.0101
[6]	Shelhamer E, Long J, Darrell T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):640-651. doi: 10.1109/TPAMI.2016.2572683 pmid: 27244717
[7]	Ronneberger O, Fischer P, Brox T. U-net:Convolutional networks for biomedical image segmentation[C]// International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham:Springer,2015:234-241.
[8]	Zhuo Z W, Tajbakhsh N, Liang J M, et al. Unet++:A nested U-Net architecture for medical image segmentation[EB/OL].(2018-09-20).[2022-05-20].https://arxiv.org/abs/1807.10165.
[9]	Badrinarayanan V, Kendall A, Cipolla R. SegNet:A deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12):2481-2495. doi: 10.1109/TPAMI.2016.2644615 pmid: 28060704
[10]	Zhao H, Shi J, Qi X, et al. Pyramid scene parsing network[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Honolulu,HI, USA.IEEE,2017:6230-6239.
[11]	Chen L C, Papandreou G, Kokkinos I, et al. DeepLab:Semantic image segmentation with deep convolutional nets,atrous convolution,and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4):834-848.
[12]	季顺平, 魏世清. 遥感影像建筑物提取的卷积神经元网络与开源数据集方法[J]. 测绘学报, 2019, 48(4):448-459. doi: 10.11947/j.AGCS.2019.20180206
	Ji S P, Wei S Q. Building extraction via convolutional neural networks from an open remote sensing building dataset[J]. Acta Geodaetica et Cartographica Sinica, 2019, 48(4):448-459. doi: 10.11947/j.AGCS.2019.20180206
[13]	Yang H, Wu P, Yao X, et al. Building extraction in very high resolution imagery by dense-attention networks[J]. Remote Sensing, 2018, 10(11):1768.
[14]	赵凌虎, 袁希平, 甘淑, 等. 改进Deeplabv3+的高分辨率遥感影像道路提取模型[J]. 自然资源遥感, 2023, 35(1):107-114.doi:10.6046/zrzyyg.2021460.
	Zhao L H, Yuan X P, Gan S, et al. An information extraction model of roads from high-resolution remote sensing images based on improved Deeplabv3+[J]. Remote Sensing for Natural Resources, 2023, 35(1):107-114.doi:10.6046/zrzyyg.2021460.
[15]	Xia L, Mi S, Zhang J, et al. Dual-stream feature extraction network based on CNN and transformer for building extraction[J]. Remote Sensing, 2023, 15(10):2689.
[16]	郭文, 张荞. 基于注意力增强全卷积神经网络的高分卫星影像建筑物提取[J]. 国土资源遥感, 2021, 33(2):100-107.doi:10.6046/gtzyyg.2020230.
	Guo W, Zhang Q. Building extraction using high-resolution satellite imagery based on an attention enhanced full convolution neural network[J]. Remote Sensing for Land and Resources, 2021, 33(2):100-107.doi:10.6046/gtzyyg.2020230.
[17]	吕少云, 李佳田, 阿晓荟, 等. Res_ASPP_UNet++:结合分离卷积与空洞金字塔的遥感影像建筑物提取网络[J]. 遥感学报, 2023, 27(2):502-519.
	Lyu S Y, Li J T, A X H, et al. Res_ASPP_UNet++:Building an extraction network from remote sensing imagery combining depthwise separable convolution with atrous spatial pyramid pooling[J]. National Remote Sensing Bulletin, 2023, 27(2):502-519.
[18]	Chollet F. Xception:deep learning with depthwise separable convolutions[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu,HI,USA.IEEE,2017:1800-1807.
[19]	Woo S, Park J, Lee J Y, et al. Cbam:Convolutional block attention module[C]// Proceedings of the European conference on computer vision (ECCV).2018:3-19.

[1]	曲海成, 王莹, 刘腊梅, 郝明. 融合CNN与Transformer的遥感影像道路信息提取[J]. 自然资源遥感, 2025, 37(1): 38-45.
[2]	陈佳雪, 肖东升, 陈虹宇. 一种边界引导与跨尺度信息交互网络用于遥感影像水体提取[J]. 自然资源遥感, 2025, 37(1): 15-23.
[3]	郑宗生, 王政翰, 王振华, 卢鹏, 高萌, 霍志俊. 改进3D-Octave卷积的高光谱图像分类方法[J]. 自然资源遥感, 2024, 36(4): 82-91.
[4]	曲海成, 梁旭. 融合混合注意力机制与多尺度特征增强的高分影像建筑物提取[J]. 自然资源遥感, 2024, 36(4): 107-116.
[5]	潘俊杰, 慎利, 鄢薪, 聂欣, 董宽林. 一种基于对抗学习的高分辨率遥感影像语义分割无监督域自适应方法[J]. 自然资源遥感, 2024, 36(4): 149-157.
[6]	李世琦, 姚国清. 基于CNN与SETR的特征融合滑坡体检测[J]. 自然资源遥感, 2024, 36(4): 158-164.
[7]	苏腾飞. 深度卷积语义分割网络在农田遥感影像分类中的对比研究——以河套灌区为例[J]. 自然资源遥感, 2024, 36(4): 210-217.
[8]	罗维, 李修华, 覃火娟, 张木清, 王泽平, 蒋柱辉. 基于多源卫星遥感影像的广西中南部地区甘蔗识别及产量预测[J]. 自然资源遥感, 2024, 36(3): 248-258.
[9]	白石, 唐攀攀, 苗朝, 金彩凤, 赵博, 万昊明. 基于高分辨率遥感影像和改进U-Net模型的滑坡提取——以汶川地区为例[J]. 自然资源遥感, 2024, 36(3): 96-107.
[10]	邓丁柱. 基于深度学习的多源卫星遥感影像云检测方法[J]. 自然资源遥感, 2023, 35(4): 9-16.
[11]	陈笛, 彭秋志, 黄培依, 刘雅璇. 采用注意力机制与改进YOLOv5的光伏用地检测[J]. 自然资源遥感, 2023, 35(4): 90-95.
[12]	刘立, 董先敏, 刘娟. 顾及地学特征的遥感影像语义分割模型性能评价方法[J]. 自然资源遥感, 2023, 35(3): 80-87.
[13]	牛祥华, 黄微, 黄睿, 蒋斯立. 基于注意力特征融合的高保真遥感图像薄云去除[J]. 自然资源遥感, 2023, 35(3): 116-123.
[14]	林佳惠, 刘广, 范景辉, 赵红丽, 白世彪, 潘宏宇. 联合改进U-Net模型和D-InSAR技术采矿沉陷提取方法[J]. 自然资源遥感, 2023, 35(3): 145-152.
[15]	郑宗生, 刘海霞, 王振华, 卢鹏, 沈绪坤, 唐鹏飞. 改进3D-CNN的高光谱图像地物分类方法[J]. 自然资源遥感, 2023, 35(2): 105-111.

Viewed

Full text

Abstract

Cited

Shared

Discussed