面向复杂背景无人机影像的水体分割网络

汪博雅; 曾微波; 张乾坤; 姚睿涵

doi:10.6046/zrzyyg.2025176

面向复杂背景无人机影像的水体分割网络

Water body segmentation network for unmanned aerial vehicle images with complex backgrounds

摘要

摘要: 针对无人机影像水体提取存在的遮挡干扰、混合像元误判及细小水体漏分等问题，提出一种融合残差结构与注意力机制的高精度水体分割网络: ①构建以ResNet50为骨干的深层编码器，通过残差连接提升语义特征表达能力; ②在跳跃连接中引入通道-空间双维度注意力机制，实现特征动态校准，通道维度重标定水体显著性，空间维度聚焦水体边界敏感区域; ③设计Focal-Dice混合损失函数，通过挖掘难例样本，降低细小水体边界重叠度，实现类别不平衡与空间结构信息的联合优化。基于沙河集水库邻近村庄的无人机影像及成都市中心城区开源数据集，综合4个主流模型FCN，SegNet，DeepLabV3+以及经典U-Net进行对比实验，结果表明: 通过定性分析与定量评估，该文网络均优于以上4种方法，精确率、召回率分别达98.29%，97.00%，可为无人机遥感影像水体提取任务提供兼顾精度、效率与鲁棒性的解决方案。

Abstract: Extracting information on water bodies from unmanned aerial vehicle (UAV) images faces challenges, such as occlusion-induced interference, misclassification caused by mixed pixels, and omission of tiny water bodies. To address these issues, this study proposed a high-precision water body segmentation network that integrates the residual structure with an attention mechanism. First, a deep encoder with ResNet50 (i.e., a residual network with 50 layers) as the architecture was constructed to enhance semantic feature representation through residual connections. Second, a channel-spatial dual-dimensional attention mechanism was introduced into skip connections to achieve dynamic feature calibration. The channel attention reweighed the water body saliency, while the spatial attention focused on areas sensitive to water body boundaries. Third, a hybrid Focal-Dice loss function was designed to reduce boundary overlaps of tiny water bodies through hard sample mining, thereby achieving co-optimization of class imbalance and spatial structural information. The proposed network was compared with four mainstream models, i.e., fully convolutional network (FCN), SegNet, DeepLabV3+, and classic U-Net. The results from qualitative analysis and quantitative evaluation demonstrate that the proposed network outperformed all the models, yielding a precision of 98.29% and a recall of 97.00%. Therefore, the proposed network can provide a novel solution that balances accuracy, efficiency, and robustness for water body information extraction from UAV remote sensing images.

HTML全文

参考文献(35)

施引文献

资源附件(0)