期刊文献+

基于内容相关性的场景图像分类方法 被引量:13

Scene Image Categorization Based on Content Correlation
下载PDF
导出
摘要 场景图像分类是计算机视觉领域中的一个基本问题.提出一种基于内容相关性的场景图像分类方法.首先从图像上提取视觉单词,并把图像表示成视觉单词的词频矢量;然后利用产生式模型来学习训练集合中包含的主题,和每一幅图像所包含的相关主题;最后用判定式分类器进行多类学习.提出的方法利用logistic正态分布对主题的相关性进行建模,使得学习得到的类别的主题分布更准确.并且在学习过程中不需要对图像内容进行人工标注.还提出了一种新的局部区域描述方法,它结合了局部区域的梯度信息和彩色信息.在自然场景图像集合和人造场景图像集合上实验了提出的方法,它相对于传统方法取得了更好的结果. Scene image categorization is a basic problem in the field of computer vision. A content correlation based scene image categorization method is proposed in this paper. First of all, dense local features are extracted from images. The local features are quantized to form visual words, and images are represented by the " bag-of-visual words" vector. Then a logistic-normal distribution-based generative model is used to learn themes in the training set, and themes distribution on each image in the training set. Finally, an SVM based discriminative model is used to train the multi-classifier. The proposed approach has the following advantages. Firstly, the approach uses logistic normal distribution as the prior distribution of themes. The correlation of themes is induced by the covariance matrix of logistic normal distribution, which makes the theme distribution of subjects more accurate. Secondly, manually tagging image content is not required in learning process, so as to avoid the heavy human labor and subjective uncertainty introduced in the process of labeling. A new local descriptor is proposed in this paper, which combines the gradient and color information of local area. Experimental results on natural scene dataset and manmade scene dataset show that the proposed scene image categorization method achieves better results than traditional methods.
作者 秦磊 高文
出处 《计算机研究与发展》 EI CSCD 北大核心 2009年第7期1198-1205,共8页 Journal of Computer Research and Development
基金 国家"八六三"高技术研究发展计划基金项目(2006AA01Z117) 国家"九七三"重点基础发展研究计划基金项目(2009CB320900)~~
关键词 场景图像分类 产生式模型 logistic正态分布 内容相关性 计算机视觉 scene image categorization generative model logistic normal distribution content correlation computer vision
  • 相关文献

参考文献19

  • 1Treisman A.Gelade G.A featureintcgration theory of attention[J].Cognitive Psychology.1980.12(1):97-136.
  • 2Olira A,Torralba A.Modeling the shape of the scene:A holistic representation of the spatial envelope[J].International Journal on Computer Vision,2001,42 (3):145-175.
  • 3Vogel J,Schiele B.Natural scene retrieval based on a semantic modeling step[C] //Proc of the ACM Int Conf on Image and Video Retrieval.New York:ACM,2004:207-215.
  • 4Blei D,Ng A,Jordan M.Latent Dirichlet allocation[J].Journal of Machine Learning Re,arch.2003.3 (5):993-1022.
  • 5Hofmann T.Unsupervised learning by probabilistie latent semantic analysis[J].Machine l.earning.2001,42(1):177-196.
  • 6Li Feifei,Perona P.A Bayesian hierarchical model for learning natural scene categories[C] //Proc of the IEEE Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2005:524-531.
  • 7Anna Bosch,Andrew Zisserman,Xavier Munoz.Scene classification via pLSA[C] //Proc of the European Conf on Computer Vision.Berlin:Springer,2006:517-530.
  • 8in Lei,Zeng Wei,Gap Wen,et al.Local invariant descriptor for image matching[C] //Proc of the IEEE Int Conf on Acoustics,Speech,and Signal Processing.Piscataway.NJ:IEEE,2005:1025-1028.
  • 9Sivic J,Russell B C,Eiros A A,et al.Discovering objects and their localization in images[C] //Proc of the Int Conf on Computer Vision.Washington,DC:IEEE Computer Society,2005:370-377.
  • 10Sivic J,Zisserman A.Video Googlee A text retrieval approach to object matching in videos[C] //Proc of the lnt Conf on Computer Vision.Washington,OC:IEEE Computer.Society,2003:1470-1477.

共引文献3

同被引文献126

引证文献13

二级引证文献35

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部