融合多姿势估计特征的动作识别被引量：5

Fusing multiple pose estimations for still image action recognition

导出

摘要目的为了提高静态图像在遮挡等复杂情况下的动作识别效果和鲁棒性,提出融合多种姿势估计得到的特征信息进行动作识别的方法。方法利用已得到的多个动作模型对任意一幅图像进行姿势估计,得到图像的多组姿势特征信息,每组特征信息包括关键点信息和姿势评分。将训练集中各个动作下所有图像的区分性关键点提取出来,并计算每一幅图像中区分性关键点之间的相对距离,一个动作所有图像的特征信息共同构成该动作的模板信息。测试图像在多个动作模型下进行姿势估计,得到多组姿势特征,从每组姿势特征中提取与对应模板一致的特征信息,将提取的多组姿势特征信息分别与对应的模板进行匹配,并通过姿势评分对匹配值优化,根据最终匹配值进行动作分类。结果在两个数据集上,本文方法与5种比较流行的动作识别方法进行比较,获得了较好的平均准确率,在数据集PASCAL VOC 2011-val上较其他一些最新的经典方法平均准确率至少提高近2%。在数据集Stanford 40 actions上,较其他一些最新的经典方法平均准确率至少提高近6%。结论本文方法融合了多个姿势特征,并且能够获取关键部位的遮挡信息,所以能较好应对遮挡等复杂环境情况,具有较高的平均识别准确率。 Objective To adapt to occlusion or other complex situations, an action recognition method is proposed which fuses multiple pose estimation features. Method Multiple pose features will be obtained using muhiple action models. Each pose feature information includes key point positions and pose scores. Distinguishing key pointsare extracted from all train images and computing relative distances between point pairs. An action template is built using all features of the train images of the action. Multiple feature information consistent with multiple action templates are extracted from each test image from multiple pose features. Multiple features of the test image are-matched with the corresponding action template and then matched values are optimized using pose scores. Result The experimental results have shown that the average accuracy of the proposed method is approximately 2% better than some other state-of-the-art methods on VOC 2011-val set, and is approximately 6% better than some other state-of-the-art methods on Stanford 40 actions set. Conclusion By fusing muhiple pose features, the proposed method can adapt to occlusion and other complex situations and improve average recognition accuracy.

作者罗会兰冯宇杰孔繁胜

机构地区江西理工大学信息工程学院浙江大学计算机科学技术学院

出处《中国图象图形学报》 CSCD 北大核心 2015年第11期1462-1472,共11页 Journal of Image and Graphics

基金国家自然科学基金项目(61105042 61462035) 国家重点基础研究发展计划(973)基金项目(2010CB327900) 江西省青年科学家(井冈之星)培养对象计划资助~~

关键词动作识别多姿势估计模板匹配遮挡 action recognition multiple pose estimations template matching occlusion

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献16

1Yang W,Wang Y,Mori G.Recognizing human actions from still images with latent poses[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.San Francisco,CA:IEEE,2010:2030-2037.
2Yang Y,Ramanan D.Articulated pose estimation with flexi- blemixtures-of-parts[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Colorado Springs,CO:IEEE,2011:1385-1392.
3Yao B,Fei-Fei L.Action recognition with exemplar based 2.5 D graph matching[C]//Proceedings of the 12th European Confer- ence on Computer Vision.Berlin:Springer,2012:173-186.
4Khan F S,Weijer vandej,Rao M A,et al.Semantic pyramids for gender and action recognition[J].IEEE Transactions on Im- age Processing,2014,23(8):3633-3645.
5Desai C,Ramanan D.Detecting actions,poses,and objects with relational phraselets[C]//Proceedings of the 12th European Conference on Computer Vision.Berlin:Springer,2012:158-172.
6Sharma G,Jurie F,Schmid C.Expanded parts model for human attribute and action recognition in still images[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Portland,OR:IEEE,2013:652-659.
7Sadeghi M A,Farhadi A.Recognition using visual phrases[C]// Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Colorado Springs,CO:IEEE,2011:1745-1752.
8Felzenszwalb P F,Huttenlocher D P.Pictorial structures for ob- ject recognition[J].International Journal of Computer Vision,2005,61(1):55-79.
9Khan F S,Anwer R M,Weijer van de J,et al.Coloring action recognition in still images[J].International Journal of Computer Vision,2013,105(3):205-221.
10Delaitre V,Laptev I,Sivic J.Recognizing human actions in still images:a study of bag-of-features and part-based representations [C]//Proceedings of the 21st British Machine Vision Confer- ence.Aberystwyth:BMVA,2010:97.1-97.11.

二级参考文献47

1Efros A A, Berg A C, Mori G, et al. Recognizing action at a dis- tance[ C ]//Proceedings of the 9th IEEE International Conference on Computer Vision. Nice, France: IEEE, 2003:726-733.
2Yilmaz A, Shah M. Matching actions in presence of camera mo- tion[ J]. Computer Vision and Image Understanding, 2006, 104(2-3) :221-231.
3Dollar P, Rabaud V, Cottrell G, et al. Behavior recognition via sparse spatio-temporal features [ C ]//Proceedings of the 2nd Joint 1EEE International Workshop on :isual Surveillance and Perform- ance Evaluation of Tracking and Surveillance. Beijing, China: IEEE, 2005:65-72.
4Laptev L On space-time interest points [ J ]. International Journal of Computer Vision, 2005, 64 (2) : 107-123.
5Bregonzio M, Gong S, Xiang T. Recognising action as clouds of space-time interest points[ C ]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Florida, USA: IEEE, 2009 : 1948-1955.
6Niebles J C, Wang H, Li F F. Unsupervised learning of human action categories using spatial-temporal words [ J ]. International Journal of Computer Vision, 2008,79(3): 299-318.
7Laptev I, Marszalek M, Schmid C, et al. Learning realistic hu- man actions from movies [ C ]// Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Anchorage, AK, USA : IEEE, 2008 : 1-8.
8Klaser A, M Marszalek, C Schmid. A spatio-temporal descriptor based on 3D-gradients [ C ]//Proceedings of British Machine Vi- sion Conference. Leeds, England : University of Leeds, 2008 : 955-1004.
9Gao Z, Chen M, Hauptmann A, et at. Comparing evaluation protocols on the KTH dataset [ C ]//Proceedings of the 1 st Inter- national Conference on Human Behavior Understanding. Istan- bul, Turkey : Springer-Verlag, 2010:88-100.
10Nater F, Grabner H, Gool L Van. Exploiting simple hierarchies for unsupervised human behavior analysis [ C ]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. San Francisco, California, USA: IEEE, 2010 : 2014-2021.

共引文献36

1刘淑琴,彭进业.约束优化进化的夜间图像时频复合加权提取[J].计算机科学,2014,41(6):295-298.
2应锐,蔡瑾,冯辉,杨涛,胡波.基于运动块及关键帧的人体动作识别[J].复旦学报（自然科学版）,2014,53(6):815-822. 被引量：6
3许丽娟,刘大龙.公交车危险动作视觉图像识别仿真[J].计算机仿真,2015,32(6):150-153. 被引量：4
4赵汉理,孟庆如,韩丽贞,潘志庚.个性化定制的虚拟健身系统设计与实现[J].中国图象图形学报,2015,20(7):953-962. 被引量：4
5王刘涛,王建玺,鲁书喜.基于Adaboost关键帧选择的多尺度人体动作识别方法[J].重庆邮电大学学报（自然科学版）,2015,27(4):549-555. 被引量：5
6王江峰,陈天华.基于行为视觉的交通肇事现场检测模型仿真[J].计算机仿真,2015,32(8):195-198. 被引量：1
7陈小辉.PDV分解结合三元组的视角无关动作识别[J].控制工程,2015,22(5):1010-1016.
8任晓芳,秦健勇,杨杰,任永军.基于能量模型的LS-TSVM在人体动作识别中的应用[J].计算机应用研究,2016,33(2):598-601. 被引量：10
9徐海宁,陈恩庆,梁成武.三维动作识别时空特征提取方法[J].计算机应用,2016,36(2):568-573. 被引量：7
10杨建,刘述木,王晓林.投影深度向量分解融合PEMS的视角不变人体动作识别[J].计算机应用研究,2016,33(3):940-944. 被引量：1

同被引文献36

1罗斌,E.R.Hancock.图象角点检测的矢量场方法[J].中国图象图形学报（A辑）,1998,3(10):832-835. 被引量：6
2许允喜,蒋云良,陈方.基于支持向量机增量学习和LPBoost的人体目标再识别算法[J].光子学报,2011,40(5):758-763. 被引量：3
3方育柯,傅彦,周俊临,佘莉,孙崇敬.基于选择性集成的最大化软间隔算法[J].软件学报,2012,23(5):1132-1147. 被引量：7
4王展,皇甫堪,万建伟.基于模板匹配的人脸图像特征提取[J].计算机辅助设计与图形学学报,2000,12(5):333-336. 被引量：19
5魏龙翔,何小海,滕奇志,高明亮.结合Hausdorff距离和最长公共子序列的轨迹分类[J].电子与信息学报,2013,35(4):784-790. 被引量：26
6赵军伟,侯清涛,李金屏,彭勃.基于数学形态学和HSI颜色空间的人头检测[J].山东大学学报（工学版）,2013,43(2):6-10. 被引量：17
7赵明瀚,王晨升.基于视频的人数识别方法综述[J].软件,2013,34(3):10-12. 被引量：15
8刘文予,朱光喜.二值图像角点检测的形态骨架法[J].信号处理,2000,16(3):276-280. 被引量：8
9刘博,安建成.基于关键姿势的人体动作识别[J].电视技术,2014,38(5):38-41. 被引量：8
10王美丽,牛晓静,张宏鸣,赵建邦,何东健.小麦叶部常见病害特征提取及识别技术研究[J].计算机工程与应用,2014,50(7):154-157. 被引量：25

引证文献5

1刘莹莹,邱崧,孙力,周梅,徐伟.基于多视角自步学习的人体动作识别方法[J].计算机工程,2018,44(2):257-263. 被引量：2
2孙万春,张建勋,瞿先平,马慧.基于视频的人数统计方法综述[J].数字技术与应用,2018,36(1):49-51. 被引量：6
3李红竹.舞蹈视频图像中动作识别方法研究[J].电视技术,2018,42(7):34-37. 被引量：4
4赵雪章,席运江,黄雄波.基于双层分类模型的人体动作识别方法[J].计算机工程与设计,2018,39(12):3860-3866. 被引量：1
5陆付祥.基于特征提取的健美操分解动作图像自适应识别方法[J].科学技术与工程,2019,19(7):148-153. 被引量：28

二级引证文献41

1吴彦春.面向新工科机械设计的FFP教学模型设计与实践[J].机械设计,2020,37(S02):259-260.
2林明,孙巍,孙丹,李艳琴,郑榆瀞.基于深度学习的图像识别技术在除霜系统中的应用[J].办公自动化,2021,26(17):22-24.
3杨静.体育视频中羽毛球运动员的动作识别[J].自动化技术与应用,2018,37(10):120-124. 被引量：11
4马慧,孙万春,史君华,杨馨竹,郑集元.基于Curvelet变换的低分辨率人脸识别方法[J].重庆理工大学学报（自然科学）,2018,32(11):162-168. 被引量：6
5马慧,孙万春,汪炜玮,程代娣.基于低分辨率的人脸识别方法研究[J].计算机测量与控制,2019,27(3):250-253.
6胡小军.健美操运动中动作识别方法浅析[J].活力,2019,0(24):392-392.
7毕雪超.基于计算机视觉的舞蹈视频动作识别技术研究[J].电子设计工程,2020,28(7):151-155. 被引量：6
8曹凯,王召,高嵩,宋晓茹,陈超波.基于改进的单次多盒检测器室内人数检测[J].科学技术与工程,2020,20(11):4451-4457. 被引量：1
9蔡小爱.基于蚁群算法的一卡通数据精准挖掘方法[J].齐齐哈尔大学学报（自然科学版）,2020,36(4):25-28. 被引量：1
10许毓晓.基于AVI视频的舞蹈动作步态轮廓动态识别分析研究[J].现代电子技术,2020,43(16):119-121. 被引量：7

1刘硕明,刘佳.基于生成/判别混合模型的动作识别[J].电子技术与软件工程,2014(12):212-213.
2何妮,赵波.基于姿势估计与显著性目标检测的衣物提取算法[J].计算机应用,2014,34(12):3536-3539. 被引量：1
3孟明,罗志增.基于眼动辅助脑电信号的手部动作分类方法[J].模式识别与人工智能,2012,25(6):1007-1012. 被引量：1
4陈伟,刘丽.结合角点特征与SIFT特征的加速图像匹配[J].计算机技术与发展,2012,22(1):98-102. 被引量：3
5杨薇薇,胡伦俊.一种VAL树结点删除的有效算法[J].计算机杂志,1991,19(5):19-23.
6刘硕明,刘佳.基于动作——身份模型的动作分类[J].中国新技术新产品,2014(8):16-16.
7李飞彬,曹铁勇,宋智军,查绎,王文.利用稀疏协同模型的目标跟踪算法[J].计算机辅助设计与图形学学报,2016,28(12):2175-2185. 被引量：2
8谢福鼎,李迎,孙岩,张永.改进的符号化时间序列处理方法[J].计算机工程与设计,2012,33(10):3950-3953. 被引量：5
9张世辉,张钰程.基于单幅深度图像遮挡信息的下一最佳观测方位确定方法[J].电子学报,2016,44(2):445-452. 被引量：2
10余宏生,金伟其.视频图像的SIFT特征点自适应提取算法[J].红外技术,2013,35(12):768-772. 被引量：4

中国图象图形学报

2015年第11期

浏览历史

内容加载中请稍等...

融合多姿势估计特征的动作识别被引量：5

参考文献16

二级参考文献47

共引文献36

同被引文献36

引证文献5

二级引证文献41

相关作者

相关机构

相关主题

浏览历史

融合多姿势估计特征的动作识别 被引量：5

参考文献16

二级参考文献47

共引文献36

同被引文献36

引证文献5

二级引证文献41

相关作者

相关机构

相关主题

浏览历史

融合多姿势估计特征的动作识别被引量：5