自适应感受野机制遥感图像分割模型被引量：4

Remote sensing image segmentation model based on an adaptive receptive field mechanism

导出

摘要目的遥感图像中存在大小、形态不一的目标,增加了目标分割任务的困难性。感受野代表了特征图中每个像素对应输入图像的区域,若感受野与目标形状的契合度较高,则特征图中包含的目标特征更加完整,有利于分割。在现有的分割方法中,通常采用的是正方形的感受野,而遥感图像中目标形状多变,导致感受野无法较好地契合目标形状,在提取目标特征时会引入过多的无用特征,从而影响分割精度。为此,本文提出基于自适应感受野机制的遥感图像分割模型。方法在编码—解码网络结构的基础上,引入自适应感受野机制。首先在编码器上提取不同大小和宽高比的感受野特征,然后在特征融合时使用通道注意力模块自适应地获取通道权重,通过加权强化与目标形状契合度高的感受野的特征,弱化与目标形状契合度低的感受野的特征,在保留目标特征的同时减少背景特征的干扰,进而提升模型的分割精度。结果在Inria Aerial Image Labeling数据集与Deep Globe Road Extraction数据集上进行实验并与相关方法比较,在两个数据集上的平均交并比分别为76.1%和61.9%,平均F1值分别为86.5%和76.5%。结论本文模型能够提取不同形状感受野的特征,并自适应地获取通道权重,使模型能提取更加完整的目标特征,从而提升目标分割效果。 Objective Remote sensing image segmentation is a technique for segmenting the target of interest.In the field of deep learning,convolutional neural networks(CNNs)are typically used to extract image features and then classify each pixel of the image.Remote sensing image segmentation has a wide range of applications,including environmental monitoring,urban construction,and crop classification.It is highly significant in the extraction and analysis of image information.However,high-resolution remote sensing images have a large number of targets with different shapes and sizes,and thus,many difficulties are encountered in achieving image segmentation.A receptive field is an important attribute of CNNs,and the matching degree between the receptive field and target size is related to the completeness and robustness of the extracted target features.If the receptive field matches the target shape well,then the target features contained in the feature map will be complete;otherwise,the feature map will contain many useless features that will interfere with the segmentation task.In existing methods,the square receptive field is used to extract features.However,the shape of targets in remote sensing images are different,and thus,the square receptive field cannot fit the shape of the target well.If the mismatched receptive field is used to extract target features,then useless features will interfere with segmentation.To solve this problem,this study proposes a remote sensing image segmentation model(RSISM)based on an adaptive receptive field mechanism(ARFM),referred to as RSISM-ARFM hereafter.Method RSISM-ARFM can extract receptive fields with different sizes and ratios while simultaneously channel weighting the features of different receptive fields during feature fusion.In this manner,the receptive field features that match the target shape can be strengthened;otherwise,they are weakened,reducing the interference of useless features while retaining target features.RSISM-ARFM uses an encoder-decoder network as its backbone network.This backbone network consists of an encoder and a decoder.The encoder is used to extract basic convolution features while reducing the size of the feature map to extract deep semantic information.The extracted features in the shallow layer of the encoder contain rich detailed information,such as target location and edge.Meanwhile,the extracted features in the deep layer of the encoder contain semantic information that can help the model identify the target better.To fuse the two parts of information,the decoder concatenates feature maps at different layers to improve the feature extraction capability of the model.On the basis of the backbone network,this study introduces an ARFM.First,the features of different receptive fields are extracted from the encoder.Then,the channel attention module is used to calculate the dependency relationship among the channels of the feature map to generate channel weights.Finally,the feature maps of different receptive fields are weighted.After the aforementioned operations,the model can adaptively adjust the relationship among different receptive fields and select appropriate receptive fields to extract the features of the target.Result In this study,we conducted ablation and comparative experiments on the Inria Image Labeling and Deep Globe Road Extraction datasets.Given the large size of the original images in the datasets,they cannot be used directly in the experiments.Therefore,the training and test sets were cropped to 256×256 pixel images during the experiments.The model was trained first using the training set and then tested using the test set.To verify the effectiveness of RSISM-ARFM,we conducted ablation and comparative experiments using the two aforementioned datasets.Simultaneously,we used different evaluation indexes in the experiments to evaluate the segmentation performance of the model from multiple perspectives.Experimental results show that the proposed method can effectively improve the segmentation accuracy of targets with different shapes.The segmentation result of RSISM-ARFM is the closest to the labeled image,and the details of the targets are the clearest.The intersection over union on the two datasets reaches 76.1%and 61.9%,and the average F1 score reaches 86.5%and76.5%,respectively.Segmentation performance is better than that of the comparison model.Conclusion The model proposed in this study adds an ARFM based on an encoder-decoder network.It extracts the features of the receptive fields of different target shapes and sizes and then uses the channel attention module to perform channel weighting adaptively on the features during the feature fusion process.Accordingly,the model extracts complete target features and reduces the introduction of useless features,improving segmentation accuracy.

作者刘航汪西莉 Liu Hang;Wang Xili(School of Computer Science,Shaanxi Normal Inirersity,Xi'an 710119,China)

机构地区陕西师范大学计算机科学学院

出处《中国图象图形学报》 CSCD 北大核心 2021年第2期464-474,共11页 Journal of Image and Graphics

基金国家自然科学基金项目(41471280,61701290,61701289)。

关键词遥感图像卷积神经网络(CNN) 图像分割自适应感受野机制(ARFM) 通道注意力模块(CAM) remote sensing image convolutional neural network(CNN) image segmentation adaptive receptive field mechanism(ARFM) channel attention module(CAM)

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1刘航,汪西莉.基于注意力机制的遥感图像分割模型[J].激光与光电子学进展,2020,57(4):162-172. 被引量：19

二级参考文献4

1徐岩,孙美双.基于多特征融合的卷积神经网络图像去雾算法[J].激光与光电子学进展,2018,55(3):254-263. 被引量：17
2吴晨玥,易本顺,章云港,黄松,冯雨.基于改进卷积神经网络的视网膜血管图像分割[J].光学学报,2018,38(11):125-131. 被引量：46
3贺浩,王仕成,杨东方,王舒洋,刘星.基于Encoder-Decoder网络的遥感影像道路提取方法[J].测绘学报,2019,48(3):330-338. 被引量：50
4张芳,吴玥,肖志涛,耿磊,吴骏,刘彦北,王雯.基于U-Net卷积神经网络的纳米颗粒分割[J].激光与光电子学进展,2019,56(6):129-135. 被引量：11

共引文献18

1刘昕宇,闫铮,段放,戴中颖.基于Kanade-Lucas-Tomasi算法的人体体表呼吸运动追踪[J].激光与光电子学进展,2020,57(22):50-58. 被引量：3
2李宇昕,杨帆,刘钊,司亚中.基于改进残差网络的道口车辆分类方法[J].激光与光电子学进展,2021,58(4):376-382. 被引量：7
3曹春红,段鸿轩,曹玲,张乐乐,胡凯,肖芬.基于多级特征级联的遥感图像实时语义分割[J].山东大学学报（工学版）,2021,51(2):19-25. 被引量：4
4伍玉彬,刘浩然.改进活动轮廓波模型的激光全息图像分割研究[J].激光杂志,2021,42(6):73-77. 被引量：1
5宋廷强,刘童心,宗达,蒋晓旭,黄腾杰,范海生.改进U-Net网络的遥感影像道路提取方法研究[J].计算机工程与应用,2021,57(14):209-216. 被引量：20
6何青,孟洋洋,李华智.多层次编码—解码网络遥感图像建筑物分割[J].计算机应用研究,2021,38(8):2510-2514. 被引量：6
7张颖,李小霞,李永龙,吕念祖,王皓冉,顾书豪,王学渊.结合密集注意力和并行上采样的遥感图像道路分割[J].小型微型计算机系统,2021,42(11):2356-2361. 被引量：2
8李喆雨,丁坤,张经炜,李辰阳,黎彰,刘永杰.基于选择增强的光伏阵列遮挡分割研究[J].激光与光电子学进展,2021,58(24):193-201. 被引量：3
9于丽丽,于海洋,何子鑫,陈良轩.基于双注意力机制和多尺度特征的点云场景分割[J].激光与光电子学进展,2021,58(24):463-471. 被引量：4
10耿欣,雷丽珍,花卉,胡睿飏,杨钰灵.基于深度学习方法的耕地违建自动提取[J].地理空间信息,2022,20(3):18-24. 被引量：3

同被引文献31

1唐英鹏,黄圣君.主动学习研究进展[J].中国基础科学,2022(3):18-26. 被引量：2
2李恒,赵广社,王鼎衡,刘美兰,马凡波.加权聚合深度卷积特征的图像检索方法[J].信息与控制,2020,49(1):55-61. 被引量：1
3袁培森,任守纲,翟肇裕,徐焕良.基于半监督主动学习的菊花表型分类研究[J].农业机械学报,2018,49(9):27-34. 被引量：4
4许磊,黎智辉,李志刚,班茂森,王富强,黄威,郭晶晶,谢兰迟,张宁,晏于文.视频侦查模拟实验在案件侦破中的应用[J].刑事技术,2018,43(4):330-333. 被引量：10
5张鑫禄,张崇涛,戴晨光,季虹良,王映雪.基于DeepLabv3架构的高分辨率遥感图像分类[J].海洋测绘,2019,39(2):40-44. 被引量：12
6侯媛媛,何儒汉,李敏,陈佳.结合卷积神经网络多层特征融合和K-Means聚类的服装图像检索方法[J].计算机科学,2019,46(B06):215-221. 被引量：19
7王中宇,倪显扬,尚振东.利用卷积神经网络的自动驾驶场景语义分割[J].光学精密工程,2019,27(11):2429-2438. 被引量：33
8朱杰,赵相坤,谢博鋆,吴树芳.基于深度特征加权的图像表示方法[J].郑州大学学报（理学版）,2020,52(1):47-53. 被引量：2
9刘文祥,舒远仲,唐小敏,刘金梅.采用双注意力机制Deeplabv3+算法的遥感影像语义分割[J].热带地理,2020,40(2):303-313. 被引量：43
10余帅,汪西莉.基于多级通道注意力的遥感图像分割方法[J].激光与光电子学进展,2020,57(4):134-143. 被引量：9

引证文献4

1张家钧,唐云祁,杨智雄.融合自适应感受野与多支路特征的鞋型识别算法[J].计算机工程,2022,48(6):295-303.
2郭新,张斌,程坤.面向小目标提取的改进DeepLabV3+模型遥感图像分割[J].遥感信息,2022,37(2):34-44. 被引量：3
3叶宽,杨博,谢欢,朱戎,赵蕾,张青月,赵杰.基于渐进生长Transformer Unet的遥感图像建筑物分割[J].无线电工程,2023,53(2):424-430. 被引量：1
4袁培森,丁毅飞,徐焕良.基于深度主动学习与CBAM的细粒度菊花表型识别[J].农业机械学报,2024,55(2):258-267. 被引量：3

二级引证文献7

1郝朝阳,于晓,叶健.基于改进U-Net的刑侦红外手印目标提取[J].红外,2023,44(5):46-52.
2周羿,刘德儿.融合注意力机制及DenseASPP改进的DeeplabV3+遥感图像分割方法[J].遥感信息,2023,38(3):85-92. 被引量：1
3李华,李国.无人机可见光遥感影像地物目标提取技术研究[J].计算机测量与控制,2024,32(2):250-255. 被引量：1
4韩久春,袁啸宇,杜海峰,张玉明,邱征.基于TransU-Net网络的高分辨率遥感影像水体提取[J].计算机应用文摘,2024,40(10):45-47.
5张家瑜,朱锐,邱威,陈坤杰.基于选择性注意力神经网络的木薯叶病害检测算法[J].农业机械学报,2024,55(5):254-262.
6郑俊键,兰玉彬,熊万杰,李硕,杨润娜,董昕.基于YOLOv5s改进模型的小白菜虫害识别方法[J].农业工程学报,2024,40(13):124-133.
7岳继博,冷梦蝶,田庆久,郭伟,刘杨,冯海宽,乔红波.叶片多理化参数的高光谱遥感与深度学习估算[J].光谱学与光谱分析,2024,44(10):2873-2883.

1Hao HE,Shuyang WANG,Shicheng WANG,Dongfang YANG,Xing LIU.A Road Extraction Method for Remote Sensing Image Based on Encoder-Decoder Network[J].Journal of Geodesy and Geoinformation Science,2020,3(2):16-25. 被引量：23
2Ronnie Lins.Vision for the Globe[J].Beijing Review,2021,64(7):20-21.
3余奕杉,卫平.中国城市绿色全要素生产率测度研究[J].生态经济,2021,37(3):43-52. 被引量：47
4Andrzej Pawuła.The Phenomenon of a Natural Thermonuclear Reactor[J].Journal of Geoscience and Environment Protection,2021,9(2):92-109. 被引量：3
5Jieyin BAI,Jie ZHU,Rui ZHAO,Fengqiang GU,Jiao WANG.Area-based non-maximum suppression algorithm for multi-object fault detection[J].Frontiers of Optoelectronics,2020,13(4):425-432. 被引量：5
6Philippe Ciais,Yitong Yao,Thomas Gasser,Alessandro Baccini,Yilong Wang,Ronny Lauerwald,Shushi Peng,Ana Bastos,Wei Li,Peter A.Raymond,Josep G.Canadell,Glen P.Peters,Rob J.Andres,Jinfeng Chang,Chao Yue,A.Johannes Dolman,Vanessa Haverd,Jens Hartmann,Goulven Laruelle,Alexandra G.Konings,Anthony W.King,Yi Liu,Sebastiaan Luyssaert,Fabienne Maignan,Prabir K.Patra,Anna Peregon,Pierre Regnier,Julia Pongratz,Benjamin Poulter,Anatoly Shvidenko,Riccardo Valentini,Rong Wang,Grégoire Broquet,Yi Yin,Jakob Zscheischler,Bertrand Guenet,Daniel SGoll,Ashley-P.Ballantyne,Hui Yang,Chunjing Qiu,Dan Zhu.Empirical estimates of regional carbon budgets imply reduced global soil heterotrophic respiration[J].National Science Review,2021,8(2):55-68. 被引量：5
7BAO Bei-hua,YAN Xiao-jing,CAO Yu-dan,YAO Wei-feng,CHENG Fang-fang,CHEN Pei-dong,SHAN Ming-qiu,ZHANG Li,DING An-wei.Radix Kansui Stir-Fried with Vinegar Reduces Radix Kansui-Related Hepatotoxicity in Mice via Mitochondrial Pathway[J].Chinese Journal of Integrative Medicine,2021,27(3):192-197. 被引量：1
8Li-qing Zhu,Li Zhang,Jia Zhang,Guo-lin Chang,Gang Liu,Dan-dan Yu,Xiao-min Yu,Mi-sheng Zhao,Bin Ye.Evodiamine inhibits high-fat diet-induced colitis-associated cancer in mice through regulating the gut microbiota[J].Journal of Integrative Medicine,2021,19(1):56-65. 被引量：16

中国图象图形学报

2021年第2期

浏览历史

内容加载中请稍等...

自适应感受野机制遥感图像分割模型被引量：4

参考文献1

二级参考文献4

共引文献18

同被引文献31

引证文献4

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

自适应感受野机制遥感图像分割模型 被引量：4

参考文献1

二级参考文献4

共引文献18

同被引文献31

引证文献4

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

自适应感受野机制遥感图像分割模型被引量：4