深度学习在航拍场景分类中的应用被引量：9

Aerial Images Categorization with Deep Learning

下载PDF

导出

摘要最近几十年来,航拍图片和视频在城市规划、沿海地区监视、军事任务等方面都得到了广泛的运用。因而了解航拍图片中所包含的内容,研究航拍视频所拍摄的场景类型就显得异常重要。目前流行的场景分类算法大多是针对自然场景的,很少有针对高分辨率航拍场景分类的算法。针对高分辨率航拍图片的场景分类给出了一种分层式算法。该算法首先用尺度不变特征转换(scale-invariant feature transform,SIFT)算法提取鲁棒的块局部特征,然后在视觉词袋的基础上,用经局限型波兹曼模型(restricted Boltzmann machine,RBM)初始化的深层信念网络(deep belief network,DBN)来表示低层特征与高层视频特征之间的关系;同时深层信念网络也起到了分类器的作用。实验结果表明,该算法在处理高分辨率航拍图片场景分类问题时都要略好于目前主流算法。 In recent decades, aerial image/video processing has been widely studied for urban planning, coastal mon- itoring and military tasks. Therefore, understanding the contents contained in aerial images and studying the scene classification of aerial videos are very important. However, currently most popular scene classification algorithms are mainly for natural scenes, rarely for high resolution aerial scene classification. This paper proposes a hierarchical scene classification model for aerial videos/images. Firstly, the scale-invariant feature transform （SIFT） vector is extracted as the patch feature. Then, on the basis of utilizing bag of words, the deep belief network （DBN） initialized by restricted Boltzmann machine （RBM） is used to obtain the latent variables which describe the relationship between low-level region features and high-level global features. The DBN also plays as a classifier. The proposed method achieves promising performance compared with the state of art scene classification methods.

作者李晓龙张兆翔王蕴红刘庆杰

机构地区北京航空航天大学计算机学院智能识别与图像处理实验室

出处《计算机科学与探索》 CSCD 2014年第3期305-312,共8页 Journal of Frontiers of Computer Science and Technology

基金国家自然科学基金No.61005016 国家重点基础研究发展计划(973计划)No.2010CB327902~~

关键词航拍场景分类视觉词袋深度学习高分辨率 aerial image scene classification bags of feature deep learning high resolution

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

同被引文献80

1Mohamed A, Dahl G, I-Iinton G. Acoustic modeling using deep belief networks[ J ]. IEEE Transactions on Audio, Speech and Language Processing, 2012, 20(1) : 14-22.
2Hanlin Goh, Nicolas Thome, Matthieu Cord. Biasing restricted bolt- zmann machines to manipulate latent selectivity and sparsity [ C ]// NIPS workshop on deep learning and unsuperdsed feature learning. 2010.
3Luo Heng, Shen Ruimin, Niu Changyong, et al. Sparse group re- stricted boltzmann machines [ C ]// Proceedings of the National Con- ference on Artificial Intelligence. San Francisco: AAAI, 2011: 429-434.
4Andrew Maas, Awni Harlnun, Andrew Ng. Rectifier nonlinearities improve neural network acoustic models [ C 3// Proceedings of the 30th International Conference on Machine Learning. Atlanta : JMLR, 2013.
5Senior A, Xin Lei. Fine context, low-rank, softplus deep neural networks for mobile speech recognition[ C]//2014 IEEE Internation- al Conference on Acoustics, Speech and Signal Processing. Florence: ICASSP, 2014: 7644-7648.
6Ranzato M, Hinton G. Modeling pixel means and covariances using factorized third-order boltzmann machines [ C ]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2010: 2551-2558.
7Courville A, Bergstra J, Bengio Y. Unsupervised models of images by spike-and-slab RBMs [ C ]// Proceedings of the 28th International Conference on Machine Learning. Bellevue : ICML, 2011 : 1145-1152.
8H Lee, R Grosse, R Ranganath, et al. Unsupervised learning of hi- erarchical representations with convolutional deep belief networks [ J 1. Communications of the ACM, 2011,54 (10) : 95-103.
9Gunasekar S,Ghosh J,Bovik A.Face Detection on Distorted Images Augmented by Perceptual Quality Aware Features[J].IEEE transactions on information forensics and security,2014,9(12):2119-2131.
10Usui H,Tanabe J,Sano T,et al.An evaluation of an energy efficient many-core SoC with parallelized face detection[C]//ASP-DAC.2014:311-316.

引证文献9

1邱少霞,陈晓松,林惺,万力衡,钟映春.融合Bag-of-Words的室内场景分类研究[J].电子世界,2015(17):158-159.
2谭萍,邢玉娟,于成洋.基于深度学习和部分模型的相关性人脸检测[J].计算机应用与软件,2015,32(12):123-127. 被引量：1
3高强,阳武,李倩.基于稀疏差异深度信念网络的绝缘子故障识别算法[J].电测与仪表,2016,53(1):19-25. 被引量：11
4卢良锋,谢志军,叶宏武.基于RGB特征与深度特征融合的物体识别算法[J].计算机工程,2016,42(5):186-193. 被引量：15
5奚彩莲,林孟群.无人机航拍技术在提高长绳跑绳成绩中的应用[J].无线互联科技,2018,15(2):132-136.
6许婷婷,朱允斌,张跃.基于联合分类器的非自然图像检索[J].计算机应用与软件,2018,35(4):244-248.
7陈虹,陈建虎,肖成龙,万广雪,肖振久.深度学习模型下多分类器的入侵检测方法[J].计算机科学与探索,2019,13(7):1123-1133. 被引量：7
8林华燕,陈其兵,陈伊平,杨静.基于SimAM注意力机制的遥感图像场景分类[J].北京测绘,2023,37(7):933-937.
9徐毅,李蓓蓓,宋威.改进的深度置信网络分类算法研究[J].计算机科学与探索,2019,13(4):596-607. 被引量：9

二级引证文献42

1黄辉,肖豪,王琼瑶,吴建强,梁志龙.基于改进YOLOv5与CRNN的电表示数识别[J].电子测量技术,2023,46(1):173-180. 被引量：2
2骆健,蒋旻.基于RGB-D图像核描述子的物体识别方法[J].计算机应用,2017,37(1):255-261. 被引量：3
3娄红.云计算环境下无线通信节点深度融合方法仿真[J].计算机仿真,2017,34(7):227-230. 被引量：2
4高强,孟格格.基于卷积神经网络的绝缘子故障识别算法研究[J].电测与仪表,2017,54(21):30-36. 被引量：21
5刘帆,刘鹏远,张峻宁,徐彬彬.基于稀疏原子融合的RGB-D场景图像融合算法[J].光学学报,2018,38(1):214-223. 被引量：3
6刘帆,刘鹏远,张峻宁,徐彬彬.基于双流卷积神经网络的RGB-D图像联合检测[J].激光与光电子学进展,2018,55(2):380-388. 被引量：8
7张骥,张金锋,朱能富,余娟,陈子亮.基于改进深度学习的刀闸状态识别方法研究[J].电测与仪表,2018,55(5):8-13. 被引量：17
8安强强,郑敏.基于深度学习的图像识别研究[J].自动化与仪器仪表,2018,0(3):115-118. 被引量：26
9王恩德,刘巧英,李勇.基于LLC与GIST特征的静态人体行为分类[J].计算机工程,2018,44(8):268-272. 被引量：5
10高强,王明.深度信念网络的等效模型及权值扩展算法研究[J].电测与仪表,2017,54(23):54-59.

1张中华,芦利斌,金国栋,谭力宁.一种两阶段的视频图像关键帧提取方法[J].现代计算机,2012,18(10):21-24.
2武志锋,郭文刚.基于高斯滤波的航拍视频图像处理研究[J].计算机光盘软件与应用,2013,16(14):67-68. 被引量：2
3马超,徐瑾辉,侯天诚,蓝斌.新型深度学习算法研究概述[J].赤峰学院学报（自然科学版）,2015,31(2):37-39. 被引量：1
4沈丽琴,胡栋梁,戚飞虎.航空图像中线状目标的自动识别[J].上海交通大学学报,1995,29(4):53-60. 被引量：2
5李新国,黄晓晴.一种基于DBN的高光谱遥感图像分类方法[J].电子测量技术,2016,39(7):81-86. 被引量：20
6单外平,曾雪琼.基于深度信念网络的信号重构与轴承故障识别[J].电子设计工程,2016,24(4):67-71. 被引量：35
7wu.强者的握手DJI大疆创新携手优酷土豆共建航拍视频平台[J].摄影与摄像,2015,0(6):89-89.
8王晶.用波兹曼系列理论看虚拟现实技术[J].新闻研究导刊,2017,8(2):283-283. 被引量：1
9申浩,李书晓,申意萍,朱承飞,常红星.航拍视频帧间快速配准算法[J].航空学报,2013,34(6):1405-1413. 被引量：13
10张阳.结合纹理特征和深度学习的行人检测算法[J].辽宁工程技术大学学报（自然科学版）,2016,35(2):206-210. 被引量：7

计算机科学与探索

2014年第3期

浏览历史

内容加载中请稍等...

深度学习在航拍场景分类中的应用被引量：9

同被引文献80

引证文献9

二级引证文献42

相关作者

相关机构

相关主题

浏览历史

深度学习在航拍场景分类中的应用 被引量：9

同被引文献80

引证文献9

二级引证文献42

相关作者

相关机构

相关主题

浏览历史

深度学习在航拍场景分类中的应用被引量：9