深度语义分割网络无人机遥感松材线虫病变色木识别

doi:10.6046/zrzyyg.2023094

[1]

Proença

D N

, Grass

G

, Morais

P V

.

Understanding pine wilt disease:Roles of the pine endophytic bacteria and of the bacteria carried by the disease-causing pinewood nematode

[J]. MicrobiologyOpen, 2017, 6(2):e00415.

[本文引用: 2]

[2]

张瑞瑞, 夏浪, 陈立平, 等.

基于U-Net网络和无人机影像的松材线虫病变色木识别

[J]. 农业工程学报, 2020, 36(12):61-68.

[本文引用: 1]

Zhang

R R

, Xia

L

, Chen

L P

, et al.

Recognition of wilt wood caused by pine wilt nematode based on U-Net network and unmanned aerial vehicle images

[J]. Transactions of the Chinese Society of Agricultural Engineering, 2020, 36(12):61-68.

[本文引用: 1]

[3]

徐信罗, 陶欢, 李存军, 等.

基于Faster R-CNN的松材线虫病受害木识别与定位

[J]. 农业机械学报, 2020, 51(7):228-236.

[本文引用: 2]

Xu

X L

, Tao

H

, Li

C J

, et al.

Detection and location of pine wilt disease induced dead pine trees based on faster R-CNN

[J]. Transactions of the Chinese Society for Agricultural Machinery, 2020, 51(7):228-236.

[本文引用: 2]

[4]

叶建仁

.

松材线虫病在中国的流行现状、防治技术与对策分析

[J]. 林业科学, 2019, 55(9):1-10.

[本文引用: 1]

Ye

J R

.

Epidemic status of pine wilt disease in China and its prevention and control techniques and counter measures

[J]. Scientia Silvae Sinicae, 2019, 55(9):1-10.

[本文引用: 1]

[5]

国家林业和草原局. 国家林业和草原局公告(2020 年第 4 号)(2020年松材线虫病疫区)[EB/OL]. [2020-03-16]. http://www.forestry.gov.cn/main/3457/20200326/145712092854308.html.

URL [本文引用: 1]

State Forestry and Grassland Administration. Announcement of the National Forestry and Grassland Administration (No.4 of 2020) (Pine wood nematode disease epidemic area in 2020) [EB/OL]. [2020-03-16]. http://www.forestry.gov.cn/main/3457/20200326/145712092854308.html.

URL [本文引用: 1]

[6]

许青云, 李莹, 谭靖, 等.

基于高分六号卫星数据的红树林提取方法

[J]. 自然资源遥感, 2023, 35(1):41-48.doi:10.6046/zrzyyg.2022048.

[本文引用: 1]

Xu

Q Y

, Li

Y

, Tan

J

, et al.

Information extraction method of mangrove forests based on GF-6 data

[J]. Remote Sensing for Natural Resources, 2023, 35(1):41-48.doi:10.6046/zrzyyg.2022048.

[本文引用: 1]

[7]

Xia

L

, Zhang

R

, Chen

L

, et al.

Evaluation of deep learning segmentation models for detection of pine wilt disease in unmanned aerial vehicle images

[J]. Remote Sensing, 2021, 13(18):3594.

[本文引用: 3]

[8]

吴琼

. 基于遥感图像的松材线虫病区域检测算法研究[D]. 合肥: 安徽大学, 2013.

[本文引用: 1]

Wu

Q

. Research on Bursaphelenchus xylophilus area detection based on remote sensing image[D]. Hefei: Anhui University, 2013.

[本文引用: 1]

[9]

曾全, 孙华富, 杨远亮, 等.

无人机监测松材线虫病的精度比较

[J]. 四川林业科技, 2019, 40(3):92-95,114.

[本文引用: 1]

Zeng

Q

, Sun

H F

, Yang

Y L

, et al.

Precision comparison for pine wood nematode disease monitoring by UAV

[J]. Journal of Sichuan Forestry Science and Technology, 2019, 40(3):92-95,114.

[本文引用: 1]

[10]

Iordache

M D

, Mantas

V

, Baltazar

E

, et al.

A machine learning approach to detecting pine wilt disease using airborne spectral imagery

[J]. Remote Sensing, 2020, 12(14):2280.

[本文引用: 1]

[11]

Syifa

M

, Park

S J

, Lee

C W

.

Detection of the pine wilt disease tree candidates for drone remote sensing using artificial intelligence techniques

[J]. Engineering, 2020, 6(8):919-926.

[本文引用: 1]

[12]

Xia

L

, Zhao

F

, Chen

J

, et al.

A full resolution deep learning network for paddy rice mapping using Landsat data

[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2022, 194:91-107.

[本文引用: 1]

[13]

胡建文, 汪泽平, 胡佩.

基于深度学习的空谱遥感图像融合综述

[J]. 自然资源遥感, 2023, 35(1):1-14.doi:10.6046/zrzyyg.2021433.

[本文引用: 1]

Hu

J W

, Wang

Z P

, Hu

P

.

A review of pansharpening methods based on deep learning

[J]. Remote Sensing for Natural Resources, 2023, 35(1):1-14.doi:10.6046/zrzyyg.2021433.

[本文引用: 1]

[14]

金远航, 徐茂林, 郑佳媛.

基于改进YOLOv4-tiny的无人机影像枯死树木检测算法

[J]. 自然资源遥感, 2023, 35(1):90-98.doi:10.6046/zrzyyg.2022018.

[本文引用: 1]

Jin

Y H

, Xu

M L

, Zheng

J Y

.

A dead tree detection algorithm based on improved YOLOv4-tiny for UAV images

[J]. Remote Sensing for Natural Resources, 2023, 35(1):90-98.doi:10.6046/zrzyyg.2022018.

[本文引用: 1]

[15]

Shelhamer

E

, Long

J

, Darrell

T

.

Fully convolutional networks for semantic segmentation

[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4):640-651.

DOI:10.1109/TPAMI.2016.2572683 PMID:27244717 [本文引用: 3]

Convolutional networks are powerful visual models that yield hierarchies of features. We show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels, improve on the previous best result in semantic segmentation. Our key insight is to build "fully convolutional" networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning. We define and detail the space of fully convolutional networks, explain their application to spatially dense prediction tasks, and draw connections to prior models. We adapt contemporary classification networks (AlexNet, the VGG net, and GoogLeNet) into fully convolutional networks and transfer their learned representations by fine-tuning to the segmentation task. We then define a skip architecture that combines semantic information from a deep, coarse layer with appearance information from a shallow, fine layer to produce accurate and detailed segmentations. Our fully convolutional networks achieve improved segmentation of PASCAL VOC (30% relative improvement to 67.2% mean IU on 2012), NYUDv2, SIFT Flow, and PASCAL-Context, while inference takes one tenth of a second for a typical image.

[16]

Zhao

H

, Shi

J

, Qi

X

, et al.

Pyramid scene parsing network

[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Honolulu,HI,USA.IEEE, 2017:6230-6239.

[本文引用: 1]

[17]

Badrinarayanan

V

, Kendall

A

, Cipolla

R

.

SegNet:A deep convolutional encoder-decoder architecture for image segmentation

[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(12):2481-2495.

DOI:10.1109/TPAMI.2016.2644615 PMID:28060704 [本文引用: 1]

We present a novel and practical deep fully convolutional neural network architecture for semantic pixel-wise segmentation termed SegNet. This core trainable segmentation engine consists of an encoder network, a corresponding decoder network followed by a pixel-wise classification layer. The architecture of the encoder network is topologically identical to the 13 convolutional layers in the VGG16 network [1]. The role of the decoder network is to map the low resolution encoder feature maps to full input resolution feature maps for pixel-wise classification. The novelty of SegNet lies is in the manner in which the decoder upsamples its lower resolution input feature map(s). Specifically, the decoder uses pooling indices computed in the max-pooling step of the corresponding encoder to perform non-linear upsampling. This eliminates the need for learning to upsample. The upsampled maps are sparse and are then convolved with trainable filters to produce dense feature maps. We compare our proposed architecture with the widely adopted FCN [2] and also with the well known DeepLab-LargeFOV [3], DeconvNet [4] architectures. This comparison reveals the memory versus accuracy trade-off involved in achieving good segmentation performance. SegNet was primarily motivated by scene understanding applications. Hence, it is designed to be efficient both in terms of memory and computational time during inference. It is also significantly smaller in the number of trainable parameters than other competing architectures and can be trained end-to-end using stochastic gradient descent. We also performed a controlled benchmark of SegNet and other architectures on both road scenes and SUN RGB-D indoor scene segmentation tasks. These quantitative assessments show that SegNet provides good performance with competitive inference time and most efficient inference memory-wise as compared to other architectures. We also provide a Caffe implementation of SegNet and a web demo at http://mi.eng.cam.ac.uk/projects/segnet.

[18]

Ronneberger

O

, Fischer

P

, Brox

T

.

U-net:Convolutional networks for biomedical image segmentation

[C]// International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer International Publishing, 2015:234-241.

[本文引用: 4]

[19]

Yang

M

, Yu

K

, Zhang

C

, et al.

DenseASPP for semantic segmentation in street scenes

[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Salt Lake City,UT,USA.IEEE, 2018:3684-3692.

[本文引用: 1]

[20]

Fu

J

, Liu

J

, Tian

H

, et al.

Dual attention network for scene segmentation

[C]// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).Long Beach,CA,USA.IEEE, 2019:3141-3149.

[本文引用: 1]

[21]

Chen

L C

, Zhu

Y

, Papandreou

G

, et al.

Encoder-decoder with atrous separable convolution for semantic image segmentation

[C]// Proceedings of the European Conference on Computer Vision (ECCV),Munich.ACM, 2018:833-851.

[本文引用: 3]

[22]

Yuan

Y

, Huang

L

, Guo

J

, et al. OCNet:Object context network for scene parsing[EB/OL]. 2018:arXiv:1809.00916. https://arxiv.org/abs/1809.00916.pdf.

URL [本文引用: 4]

[23]

Simonyan

K

, Zisserman

A

. Very deep convolutional networks for large-scale image recognition[EB/OL].arXiv. https://arxiv.org/abs/1409.1556.pdf.

URL [本文引用: 2]

[24]

Chen

L C

, Papandreou

G

, Schroff

F

, et al. Rethinking atrous convolution for semantic image segmentation[EB/OL].arXiv. https://arxiv.org/abs/1706.05587.pdf.

URL [本文引用: 2]

[25]

Chollet

F

.

Xception:Deep learning with depthwise separable convolutions

[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Honolulu,HI,USA.IEEE, 2017:1800-1807.

[本文引用: 1]

[26]

He

K

, Zhang

X

, Ren

S

, et al.

Deep residual learning for image recognition

[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Las Vegas,NV,USA.IEEE, 2016:770-778.

[本文引用: 2]

[27]

Huang

Q

, Sun

J

, Ding

H

, et al.

Robust liver vessel extraction using 3D U-Net with variant dice loss function

[J]. Computers in Biology and Medicine, 2018, 101:153-162.

DOI:S0010-4825(18)30238-5 PMID:30144657 [本文引用: 1]

Liver vessel extraction from CT images is essential in liver surgical planning. Liver vessel segmentation is difficult due to the complex vessel structures, and even expert manual annotations contain unlabeled vessels. This paper presents an automatic liver vessel extraction method using deep convolutional network and studies the impact of incomplete data annotation on segmentation accuracy evaluation.We select the 3D U-Net and use data augmentation for accurate liver vessel extraction with few training samples and incomplete labeling. To deal with high imbalance between foreground (liver vessel) and background (liver) classes but also increase segmentation accuracy, a loss function based on a variant of the dice coefficient is proposed to increase the penalties for misclassified voxels. We include unlabeled liver vessels extracted by our method in the expert manual annotations, with a specialist's visual inspection for refinement, and compare the evaluations before and after the procedure.Experiments were performed on the public datasets Sliver07 and 3Dircadb as well as local clinical datasets. The average dice and sensitivity for the 3Dircadb dataset were 67.5% and 74.3%, respectively, prior to annotation refinement, as compared with 75.3% and 76.7% after refinement.The proposed method is automatic, accurate and robust for liver vessel extraction with high noise and varied vessel structures. It can be used for liver surgery planning and rough annotation of new datasets. The evaluation difference based on some benchmarks, and their refined results, showed that the quality of annotation should be further considered for supervised learning methods.Copyright © 2018 Elsevier Ltd. All rights reserved.

[28]

Lin

T Y

, Goyal

P

, Girshick

R

, et al.

Focal loss for dense object detection

[C]// 2017 IEEE International Conference on Computer Vision (ICCV).Venice,Italy.IEEE, 2017:2999-3007.

[本文引用: 1]

Understanding pine wilt disease:Roles of the pine endophytic bacteria and of the bacteria carried by the disease-causing pinewood nematode

2

2017