基于动态增强型的DE-DeepLabV3+多光谱水体分割研究

李琦; 郭中华; 李小军; 班博; 陈佳丽

doi:10.6046/zrzyyg.2025171

基于动态增强型的DE-DeepLabV3+多光谱水体分割研究

Multispectral image segmentation for water bodies based on dynamically enhanced DeepLabV3+

摘要

摘要: 针对水体分割任务中存在的多尺度适应性差、边缘细节丢失及复杂背景干扰等问题，该文提出了一种以MobileNetV2为骨干网络的动态增强型DeepLabV3+(dynamically enhanced DeepLabV3+，DE-DeepLabV3+)模型。首先，针对空洞空间金字塔池化(atrous spatial pyramid pooling，ASPP)模块固定膨胀率的局限性，设计了动态多粒度上下文模块(dynamic multi-granularity context，DMGC)，通过自适应特征融合机制实现全局、区域与局部特征的多尺度动态融合，提升了模型对不同尺度水体目标的适应能力; 其次，在解码阶段提出了动态感受野上采样模块(dynamic receptive field upsampling，DRFU)，该模块结合多分支空洞卷积和高效的亚像素卷积上采样技术，解决了传统模型在边缘细节丢失方面的问题; 最后，该文设计了一种多尺度边缘感知加权损失函数(multi-scale edge-aware weighted loss，MEWL)，旨在全面提升模型对水体目标的识别精度与边界分割效果，同时有效缓解因类别不均衡带来的训练偏差。基于哨兵2号遥感影像构建的宁夏黄河流域水体分割数据集进行实验，结果表明，该文提出的优化模型相较于基准DeepLabV3+模型在性能上有显著提升。总体而言，平均交并比、平均召回率和准确率分别达到92.78%，96.31%和97.13%。在水体分割任务中效果尤为显著，相比基准网络交并比和召回率分别提升了5.13百分点和5.4百分点。该研究的优化模型在复杂水陆交界区域以及小支流分割任务中表现突出，展现出其在复杂水域场景中的卓越分割能力。

Abstract: The image segmentation of water bodies faces a range of challenges, including limited multi-scale adaptability, loss of edge details, and interference from complex backgrounds. To address these issues, this study proposed a dynamically enhanced DeepLabV3+ (DE-DeepLabV3+) model with MobileNetV2 as the backbone network. Specifically, to overcome the limitation caused by the fixed dilation rate of the atrous spatial pyramid pooling (ASPP) module, a dynamic multi-granularity context (DMGC) module was designed. Using an adaptive feature fusion mechanism, the DMGC module enabled multi-scale dynamic fusion of global, regional, and local features, thereby enhancing the adaptability of the DE-DeepLabV3+ model to water body targets at various scales. A dynamic receptive field upsampling (DRFU) module was proposed for the decoding stage, avoiding edge detail loss inherent in conventional models by combining multi-branch atrous convolution and efficient sub-pixel convolution upsampling techniques. Furthermore, a multi-scale edge-aware weighted loss (MEWL) function was designed, aiming to comprehensively improve the model's identification accuracy and boundary segmentation effectiveness for water body targets while also effectively reducing the training errors caused by class imbalance. In this study, all experiments were conducted on a water body segmentation dataset for the Yellow River Basin within Ningxia Hui Autonomous Region, which was constructed using remote sensing images from Sentinel-2. The experimental results indicate that the DE-DeepLabV3+ model significantly outperformed the baseline DeepLabV3+ model. Overall, the optimized model increased the mean intersection over union (mIoU) by 92.78% and exhibited mean recall and accuracy of 96.31% and 97.13%, respectively. Notably, this model delivered significantly high performance in the image segmentation of water bodies, increasing mIoU and recall by 5.13 percentage points and 5.4 percentage points, respectively, compared to its baseline. The DE-DeepLabV3+ model performs well when applied to image segmentation of complex water-land transition zones and small tributaries, exhibiting a remarkable segmentation ability in complex water scenarios.

HTML全文

参考文献(33)

施引文献

资源附件(0)