基于多视觉码本的图像表示被引量：1

Image Representation Based on Multiple Visual Codebooks

下载PDF

导出

摘要基于词袋模型的图像表示方法的有效性主要受限于局部特征的量化误差.文中提出一种基于多视觉码本的图像表示方法,通过综合考虑码本构建和编码方法这两个方面的因素加以改进.具体包括:1)多视觉码本构建,以迭代方式构建多个紧凑且具有互补性的视觉码本;2)图像表示,首先针对多码本的情况,依次从各码本中选择相应的视觉单词并采用线性回归估计编码系数,然后结合图像的空间金字塔结构形成最终的图像表示.在一些标准测试集合的图像分类结果验证文中方法的有效性. The effectiveness of the image representation based on bag-of-visual words （BoW） model is majorly limited by the quantization error. To address this issue, an improved image representation based on multiple visual codebooks is proposed in this paper, which considers both visual codebook construction and feature coding. The proposed method specifically consists of 1 ） multiple visual codebooks construction, in which the compact and complementary visual codebooks are iteratively generated; 2） image representation, in which the visual words are firstly selected from each individual visual codebook, then the coding coefficients are determined by using the regularized linear regression method, and finally the image is represented by combining the spatial pyramid structure. The experimental results on several benchmark image classification datasets demonstrate the consistent and significant improvement of the proposed method.

作者宋彦蒋兵戴礼荣

机构地区中国科学技术大学电子工程与信息科学系科大讯飞语音实验室

出处《模式识别与人工智能》 EI CSCD 北大核心 2013年第10期909-915,共7页 Pattern Recognition and Artificial Intelligence

基金国家自然科学基金资助项目(No.61172158)

关键词图像分类视觉码本聚类分析图像表示 Image Classification, Visual Codebook, Clustering Analysis, Image Representation

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献15

1Lazebnik S, Schmid C, PonceJ. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories//Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. New York, USA, 2006, II: 2169-2178.
2Boureau Y, Bach F, LeCun Y, et al. Learning Mid-Level Features for Recognition//Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Francisco, USA, 2010: 2559-2566.
3Lowe D. Distinctive Image Features from Scale-Invariant Keypoints. InternationalJournal of Computer Vision, 2004, 60(2): 91-110.
4SivicJ, Zisserman A. Video Coogle: A Text Retrieval Approach to Object Matching in Videos//Proc of the 9th IEEE International Conference on Computer Vision. Nice, France, 2003, II: 1470-1477.
5Aharon M, Elad M, Bruckstein A. K-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation. IEEE Trans on Signal Processing, 2006, 54 ( 11 ) : 4311-4322.
6Jiang Yuguang, Ngo C W. Visual Word Proximity and Linguistics for Semantic Video Indexing and Near-Duplicate Retrieval. Compu?ter Vision and Image Understanding, 2009,113(3): 405-414.
7Jurie F, Triggs B. Creating Efficient Codebooks for Visual Recogni?tion//Proc of the 10th International Conference on Computer Vision. Beijing, China, 2005, I: 604-610.
8Boiman 0, Shechtman E, Irani M. In Defense of Nearest-Neighbor Based Image Classification//Proc of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Ancho?rage, USA, 2008: 1-8.
9GernertJ, GeusebroekJ, Veenman C, et al. Kernel Codebooks for Scene Categorization//Proc of the 10th European Conference on Computer Vision. Marseille, France, 2008: 696-709.
10Coates A, Ng A Y. The Importance of Encoding versus Training with Sparse Coding and Vector Quantization//Proc of the 28th International Conference on Machine Learning. Bellevue, USA, 2011 : 921-928.

同被引文献9

1袁非牛,夏雪,李钢,章琳,史劲亭.面向烟雾识别与纹理分类的Gabor网络[J].中国图象图形学报,2019,24(2):269-281. 被引量：19
2陶华伟,赵力,奚吉,虞玲,王彤.基于颜色及纹理特征的果蔬种类识别方法[J].农业工程学报,2014,30(16):305-311. 被引量：51
3生海迪,段会川,孔超.基于语义短语的空间金字塔词袋模型图像分类方法[J].小型微型计算机系统,2015,36(4):877-881. 被引量：8
4郭礼华,罗材.食材数据库统计与对比实验性能分析[J].中国图象图形学报,2017,22(8):1079-1088. 被引量：3
5汤勃,孔建益,伍世虔.机器视觉表面缺陷检测综述[J].中国图象图形学报,2017,22(12):1640-1663. 被引量：278
6李志欣,李艳红,张灿龙.一种多特征融合的场景分类方法[J].小型微型计算机系统,2018,39(5):1085-1091. 被引量：7
7朱善玮,李玉惠.基于SVM和VAR/LBP的车脸识别[J].电子科技,2018,31(7):7-10. 被引量：6
8白鑫,卫琳.基于双级特征提取与度量的图像检索算法[J].包装工程,2018,39(21):198-205. 被引量：3
9周柱,甘屹,孙福佳.一种融合Gabor+SIFT特征的人脸识别算法[J].电子科技,2019,32(4):1-5. 被引量：5

引证文献1

1张泽晨,巨志勇.基于BoF模型的多特征融合果蔬图像分类方法[J].电子科技,2020,33(7):41-45. 被引量：3

二级引证文献3

1裴晓芳,胡敏.杜鹃花各生长期识别与监测研究[J].电子科技,2021,34(1):17-22.
2毛颖颖,袁浩.基于深度学习的果蔬智能分类识别研究[J].长江信息通信,2021,34(7):91-93. 被引量：2
3吴冀豪,常玉祥,汪宇玲,彭思绘.基于机器学习的果蔬识别研究综述[J].机器人技术与应用,2022(4):29-31. 被引量：2

1刘琴.基于二维图像表示的人脸识别算法研究[J].无线互联科技,2016,13(23):111-112. 被引量：1
2李春华,秦志英.图像的超小波稀疏表示[J].电视技术,2012,36(13):44-47. 被引量：2
3卢朝阳,颜尧平,吴成柯.多分辨率DT模型基图像表示方法[J].西安电子科技大学学报,1999,26(3):277-280. 被引量：10
4王养利,吴成柯.基于V氏图的图像分割及表示[J].西安电子科技大学学报,2000,27(3):340-343. 被引量：1
5Barba.,S 陶建义.利用小波分析对合成孔径雷达图像分类[J].空载雷达,1995(4):25-30.
6方绍峡,金德鹏,苏厉,曾烈光.基于线性回归的UWB系统频偏估计算法[J].数据采集与处理,2012,27(1):101-104. 被引量：4
7钱钧,杨恒,刘培桢,姜文涛,周锋飞.一种基于词袋模型的大规模图像层次化分组算法[J].应用光学,2014,35(5):799-805. 被引量：4
8郭平,胡明.基于RBF网络图像表示的CT重建算法研究[J].电子学报,2007,35(6):1183-1186. 被引量：1
9黄巍,陈传波,郑运平,吴雪丽.梯形子模式非对称逆布局二值图像表示方法[J].计算机科学,2008,35(8):213-217. 被引量：1
10视频技术[J].电子科技文摘,2002,0(4):79-93.

模式识别与人工智能

2013年第10期

浏览历史

内容加载中请稍等...

基于多视觉码本的图像表示被引量：1

参考文献15

同被引文献9

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

基于多视觉码本的图像表示 被引量：1

参考文献15

同被引文献9

引证文献1

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

基于多视觉码本的图像表示被引量：1