期刊文献+

Study of Human Action Recognition Based on Improved Spatio-temporal Features 被引量:7

Study of Human Action Recognition Based on Improved Spatio-temporal Features
原文传递
导出
摘要 Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information(PDI) of interest points, a novel motion descriptor is proposed in this paper. The proposed method detects interest points by using an improved interest point detection method. Then, 3-dimensional scale-invariant feature transform(3D SIFT) descriptors are extracted for every interest point. In order to obtain a compact description and efficient computation, the principal component analysis(PCA) method is utilized twice on the 3D SIFT descriptors of single frame and multiple frames. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using the support vector machine(SVM) recognition algorithm on the public KTH dataset. The testing results have showed that the recognition rate has been significantly improved and the proposed features can more accurately describe human motion with high adaptability to scenarios. Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information(PDI) of interest points, a novel motion descriptor is proposed in this paper. The proposed method detects interest points by using an improved interest point detection method. Then, 3-dimensional scale-invariant feature transform(3D SIFT) descriptors are extracted for every interest point. In order to obtain a compact description and efficient computation, the principal component analysis(PCA) method is utilized twice on the 3D SIFT descriptors of single frame and multiple frames. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using the support vector machine(SVM) recognition algorithm on the public KTH dataset. The testing results have showed that the recognition rate has been significantly improved and the proposed features can more accurately describe human motion with high adaptability to scenarios.
出处 《International Journal of Automation and computing》 EI CSCD 2014年第5期500-509,共10页 国际自动化与计算杂志(英文版)
基金 supported by National Natural Science Foundation of China(No.61103123) Scientific Research Foundation for the Returned Overseas Chinese Scholars,State Education Ministry
关键词 Action recognition spatio-temporal interest points 3-dimensional scale-invariant feature transform (3D SIFT) positional distribution information dimension reduction Action recognition spatio-temporal interest points 3-dimensional scale-invariant feature transform (3D SIFT) positional distribution information dimension reduction
  • 相关文献

参考文献6

  • 1Alexandros Andre Chaaraoui,Pau Climent-Pérez,Francisco Flórez-Revuelta.Silhouette-based human action recognition using sequences of key poses[J].Pattern Recognition Letters.2013
  • 2Jose M. Chaquet,Enrique J. Carmona,Antonio Fernández-Caballero.A survey of video datasets for human action and activity recognition[J].Computer Vision and Image Understanding.2013(6)
  • 3Chih-Chung Chang,Chih-Jen Lin.LIBSVM[J]ACM Transactions on Intelligent Systems and Technology (TIST),2011(3).
  • 4Daniel Weinland,Remi Ronfard,Edmond Boyer.A survey of vision-based methods for action representation, segmentation and recognition[J].Computer Vision and Image Understanding.2010(2)
  • 5Ronald Poppe.A survey on vision-based human action recognition[J].Image and Vision Computing.2009(6)
  • 6Juan Carlos Niebles,Hongcheng Wang,Li Fei-Fei.Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words[J].International Journal of Computer Vision.2008(3)

共引文献6

同被引文献34

  • 1Wang H, Klaser A, Schmid C. Dense trajectories and mo- tion boundary descriptors for action recognition[J]. Internation- al Journal of Computer Vision, 2013, 103(1): 60-79.
  • 2Hoai M, Zisserrnan A. Improving human action recognition us- ing score distribution and ranking[M]//Lecture Notes in Com- puter Science, vol.9003. Berlin, Germany: Springer-Veflag, 2015: 3-20.
  • 3顾颖嫒.稀疏表示在基于视频的人体动作识别中的应用研究[D].北京:清华大学,2013.
  • 4Simonyan K, Zisserman A. Two-stream convolutional networks for action recognition in videos[C]//Advances in Neural Infor- mation Processing Systems. Montreal, Canada: Morgan Kauf- mann, 2014: 568-576.
  • 5Warade S, Aghav J, Claude P, et al. Real-time detection and tracking with Kinect[C]//12th IEEE International Conference on Computer and Information Technology. Piscataway, USA: IEEE, 2012: 86-89.
  • 6Yao B, Hagras H, Alhaddad M J, et al. A fuzzy logic-based system for the automation of human behavior recognition using machine vision in intelligent environments[J]. Soft Computing, 2015(19): 499-506.
  • 7朱岩,赵旭,刘允才.基于稀疏编码和局部时空特征的人体动作识别[C]//第十五届全国图象图形学学术会议论文集.北京:清华大学出版社,2010:237-241.
  • 8姬晓飞,王策,李一波.基于一种视角鲁棒性特征的人体动作识别方法研究[C]//中国自动化学会控制理论专业委员会、中国系统工程学会第三十二届中国控制会议论文集(c卷).北京:北京航空航天大学出版社,2013:3877-3881.
  • 9叶喜勇,陶霖密,王国建,等.视角无关的人体躯干动作识别[C]//第六届和谐人机环境联合学术会议(HHME2010)、第19届全国多媒体学术会议(NCMT2010)、第6届全国人机交互学术会议(CHCI2010)、第5届全国普适计算学术会议(PCC2010)论文集.2010.
  • 10李毅,孙正兴,远博,张岩.一种改进的帧差和背景减相结合的运动检测方法[J].中国图象图形学报,2009,14(6):1162-1168. 被引量:33

引证文献7

二级引证文献33

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部