Study of Human Action Recognition Based on Improved Spatio-temporal Features 被引量：7

Study of Human Action Recognition Based on Improved Spatio-temporal Features

导出

摘要 Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information(PDI) of interest points, a novel motion descriptor is proposed in this paper. The proposed method detects interest points by using an improved interest point detection method. Then, 3-dimensional scale-invariant feature transform(3D SIFT) descriptors are extracted for every interest point. In order to obtain a compact description and efficient computation, the principal component analysis(PCA) method is utilized twice on the 3D SIFT descriptors of single frame and multiple frames. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using the support vector machine(SVM) recognition algorithm on the public KTH dataset. The testing results have showed that the recognition rate has been significantly improved and the proposed features can more accurately describe human motion with high adaptability to scenarios. Most of the exist action recognition methods mainly utilize spatio-temporal descriptors of single interest point while ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information(PDI) of interest points, a novel motion descriptor is proposed in this paper. The proposed method detects interest points by using an improved interest point detection method. Then, 3-dimensional scale-invariant feature transform(3D SIFT) descriptors are extracted for every interest point. In order to obtain a compact description and efficient computation, the principal component analysis(PCA) method is utilized twice on the 3D SIFT descriptors of single frame and multiple frames. Simultaneously, the PDI of the interest points are computed and combined with the above features. The combined features are quantified and selected and finally tested by using the support vector machine(SVM) recognition algorithm on the public KTH dataset. The testing results have showed that the recognition rate has been significantly improved and the proposed features can more accurately describe human motion with high adaptability to scenarios.

作者 Xiao-Fei Ji Qian-Qian Wu Zhao-Jie Ju Yang-Yang Wang

机构地区 School of Automation School of Computing

出处《International Journal of Automation and computing》 EI CSCD 2014年第5期500-509,共10页 国际自动化与计算杂志（英文版）

基金 supported by National Natural Science Foundation of China(No.61103123) Scientific Research Foundation for the Returned Overseas Chinese Scholars,State Education Ministry

关键词 Action recognition spatio-temporal interest points 3-dimensional scale-invariant feature transform (3D SIFT) positional distribution information dimension reduction Action recognition spatio-temporal interest points 3-dimensional scale-invariant feature transform (3D SIFT) positional distribution information dimension reduction

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1Alexandros Andre Chaaraoui,Pau Climent-Pérez,Francisco Flórez-Revuelta.Silhouette-based human action recognition using sequences of key poses[J].Pattern Recognition Letters.2013
2Jose M. Chaquet,Enrique J. Carmona,Antonio Fernández-Caballero.A survey of video datasets for human action and activity recognition[J].Computer Vision and Image Understanding.2013(6)
3Chih-Chung Chang,Chih-Jen Lin.LIBSVM[J]ACM Transactions on Intelligent Systems and Technology (TIST),2011(3).
4Daniel Weinland,Remi Ronfard,Edmond Boyer.A survey of vision-based methods for action representation, segmentation and recognition[J].Computer Vision and Image Understanding.2010(2)
5Ronald Poppe.A survey on vision-based human action recognition[J].Image and Vision Computing.2009(6)
6Juan Carlos Niebles,Hongcheng Wang,Li Fei-Fei.Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words[J].International Journal of Computer Vision.2008(3)

共引文献6

1魏丽娟,李咏,陈渊,袁哲明.三代棉铃虫幼虫发生量的非线性分析——以河北、山西等六省为例[J].湖南农业科学,2013(11):62-65.
2徐勤军,吴镇扬.视频序列中的行为识别研究进展[J].电子测量与仪器学报,2014,28(4):343-351. 被引量：20
3马洪亮,王伟,韩臻.基于JavaScript的轻量级恶意网页异常检测方法[J].华中科技大学学报（自然科学版）,2014,42(11):34-38. 被引量：8
4陈蔼祥,陈智锋.ADST:用机器学习方法鉴别结节病和肺结核[J].计算机科学,2014,41(S1):103-109. 被引量：5
5刘广东.基于支持向量机的地面驱动螺杆泵井工况诊断技术[J].排灌机械工程学报,2014,32(2):125-129. 被引量：5
6刘笑楠,苑玮琦,张波.一种虹膜色素块检测与分类方法[J].沈阳工业大学学报,2014,36(6):688-693. 被引量：1

同被引文献34

1Wang H, Klaser A, Schmid C. Dense trajectories and mo- tion boundary descriptors for action recognition[J]. Internation- al Journal of Computer Vision, 2013, 103(1): 60-79.
2Hoai M, Zisserrnan A. Improving human action recognition us- ing score distribution and ranking[M]//Lecture Notes in Com- puter Science, vol.9003. Berlin, Germany: Springer-Veflag, 2015: 3-20.
3顾颖嫒.稀疏表示在基于视频的人体动作识别中的应用研究[D].北京:清华大学,2013.
4Simonyan K, Zisserman A. Two-stream convolutional networks for action recognition in videos[C]//Advances in Neural Infor- mation Processing Systems. Montreal, Canada: Morgan Kauf- mann, 2014: 568-576.
5Warade S, Aghav J, Claude P, et al. Real-time detection and tracking with Kinect[C]//12th IEEE International Conference on Computer and Information Technology. Piscataway, USA: IEEE, 2012: 86-89.
6Yao B, Hagras H, Alhaddad M J, et al. A fuzzy logic-based system for the automation of human behavior recognition using machine vision in intelligent environments[J]. Soft Computing, 2015(19): 499-506.
7朱岩,赵旭,刘允才.基于稀疏编码和局部时空特征的人体动作识别[C]//第十五届全国图象图形学学术会议论文集.北京:清华大学出版社,2010:237-241.
8姬晓飞,王策,李一波.基于一种视角鲁棒性特征的人体动作识别方法研究[C]//中国自动化学会控制理论专业委员会、中国系统工程学会第三十二届中国控制会议论文集(c卷).北京:北京航空航天大学出版社,2013:3877-3881.
9叶喜勇,陶霖密,王国建,等.视角无关的人体躯干动作识别[C]//第六届和谐人机环境联合学术会议(HHME2010)、第19届全国多媒体学术会议(NCMT2010)、第6届全国人机交互学术会议(CHCI2010)、第5届全国普适计算学术会议(PCC2010)论文集.2010.
10李毅,孙正兴,远博,张岩.一种改进的帧差和背景减相结合的运动检测方法[J].中国图象图形学报,2009,14(6):1162-1168. 被引量：33