A method for information extraction of buildings from remote sensing images based on hybrid attention mechanism and Deeplabv3+
LIU Chenchen1(), GE Xiaosan1(), WU Yongbin1,2, YU Haikun3, ZHANG Beibei3
1. School of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454003, China 2. Henan Surveying and Mapping Geographic Information Technology Center, Zhengzhou 450003, China 3. Henan Remote Sensing Institute, Zhengzhou 450003, China
Extracting information about buildings from a large and complex set of remote sensing images has always been a hot research topic in the intelligent applications of remote sensing. To address issues such as inaccurate information extraction of buildings and the tendency to ignore small buildings within a complex environment in remote sensing images, this study proposed the SC-deep network-a semantic segmentation algorithm for remote sensing images based on a hybrid attention mechanism and Deeplabv3+. Utilizing an encoder-decoder structure, this network employs a backbone residual attention network to extract deep- and shallow-layer features. Meanwhile, this network aggregates the spatial and channel information weights in remote sensing images using a dilated space pyramid pool module and a channel-space attention module. These allow for effectively utilizing the multi-scale information of building structures in remote sensing images, thereby reducing the loss of image details during training. The experimental results indicate that the proposed method outperforms other mainstream segmentation networks on the Aerial imagery dataset. Overall, this method can effectively identify and extract the edges of complex buildings and small structures, exhibiting superior building extraction performance.
刘晨晨, 葛小三, 武永斌, 余海坤, 张蓓蓓. 基于混合注意力机制和Deeplabv3+的遥感影像建筑物提取方法[J]. 自然资源遥感, 2025, 37(1): 31-37.
LIU Chenchen, GE Xiaosan, WU Yongbin, YU Haikun, ZHANG Beibei. A method for information extraction of buildings from remote sensing images based on hybrid attention mechanism and Deeplabv3+. Remote Sensing for Natural Resources, 2025, 37(1): 31-37.
Hu M H, Li J T, Yao Y J, et al. SER-UNet algorithm for building extraction from high-resolution remote sensing image combined with multipath[J]. Acta Geodaetica et Cartographica Sinica, 2023, 52(5):808-817.
doi: 10.11947/j.AGCS.2023.20210691
Wu W, Luo J C, Shen Z F, et al. Building extraction from high resolution remote sensing imagery based on spatial-spectral method[J]. Geomatics and Information Science of Wuhan University, 2012, 37(7):800-805.
Jia S J, Wang K. Color image segmentation by integrating color and texture features[J]. Science of Surveying and Mapping, 2014, 39(12):138-142,147.
[4]
Lagunas E, Amin M G, Ahmad F, et al. Pattern matching for building feature extraction[J]. IEEE Geoscience and Remote Sensing Letters, 2014, 11(12):2193-2197.
[5]
Gong J, Ji S. Photogrammetry and deep learning[J]. Journal of Geodesy and Geoinformation Science, 2018(1):1-15.
doi: 10.11947/j.JGGS.2018.0101
[6]
Shelhamer E, Long J, Darrell T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):640-651.
doi: 10.1109/TPAMI.2016.2572683
pmid: 27244717
[7]
Ronneberger O, Fischer P, Brox T. U-net:Convolutional networks for biomedical image segmentation[C]// International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham:Springer,2015:234-241.
[8]
Zhuo Z W, Tajbakhsh N, Liang J M, et al. Unet++:A nested U-Net architecture for medical image segmentation[EB/OL].(2018-09-20).[2022-05-20].https://arxiv.org/abs/1807.10165.
[9]
Badrinarayanan V, Kendall A, Cipolla R. SegNet:A deep convolutional encoder-decoder architecture for image segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12):2481-2495.
doi: 10.1109/TPAMI.2016.2644615
pmid: 28060704
[10]
Zhao H, Shi J, Qi X, et al. Pyramid scene parsing network[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Honolulu,HI, USA.IEEE,2017:6230-6239.
[11]
Chen L C, Papandreou G, Kokkinos I, et al. DeepLab:Semantic image segmentation with deep convolutional nets,atrous convolution,and fully connected CRFs[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018, 40(4):834-848.
Ji S P, Wei S Q. Building extraction via convolutional neural networks from an open remote sensing building dataset[J]. Acta Geodaetica et Cartographica Sinica, 2019, 48(4):448-459.
doi: 10.11947/j.AGCS.2019.20180206
[13]
Yang H, Wu P, Yao X, et al. Building extraction in very high resolution imagery by dense-attention networks[J]. Remote Sensing, 2018, 10(11):1768.
Zhao L H, Yuan X P, Gan S, et al. An information extraction model of roads from high-resolution remote sensing images based on improved Deeplabv3+[J]. Remote Sensing for Natural Resources, 2023, 35(1):107-114.doi:10.6046/zrzyyg.2021460.
[15]
Xia L, Mi S, Zhang J, et al. Dual-stream feature extraction network based on CNN and transformer for building extraction[J]. Remote Sensing, 2023, 15(10):2689.
Guo W, Zhang Q. Building extraction using high-resolution satellite imagery based on an attention enhanced full convolution neural network[J]. Remote Sensing for Land and Resources, 2021, 33(2):100-107.doi:10.6046/gtzyyg.2020230.
Lyu S Y, Li J T, A X H, et al. Res_ASPP_UNet++:Building an extraction network from remote sensing imagery combining depthwise separable convolution with atrous spatial pyramid pooling[J]. National Remote Sensing Bulletin, 2023, 27(2):502-519.
[18]
Chollet F. Xception:deep learning with depthwise separable convolutions[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu,HI,USA.IEEE,2017:1800-1807.
[19]
Woo S, Park J, Lee J Y, et al. Cbam:Convolutional block attention module[C]// Proceedings of the European conference on computer vision (ECCV).2018:3-19.