摘要
针对传统的彩色视频中动作识别算法成本高,且二维信息不足导致动作识别效果不佳的问题,提出一种新的基于三维深度图像序列的动作识别方法。该算法在时间维度上提出了时间深度模型(TDM)来描述动作。在三个正交的笛卡尔平面上,将深度图像序列分成几个子动作,对所有子动作作帧间差分并累积能量,形成深度运动图来描述动作的动态特征。在空间维度上,用空间金字塔方向梯度直方图(SPHOG)对时间深度模型进行编码得到了最终的描述符。最后用支持向量机(SVM)进行动作的分类。在两个权威数据库MSR Action3D和MSRGesture3D上进行实验验证,该方法识别率分别达到了94.90%(交叉测试组)和94.86%。实验结果表明,该方法能够快速对深度图像序列进行计算并取得较高的识别率,并基本满足深度视频序列的实时性要求。
Concerning the high costs of traditional action recognition algorithm in color video and poor recognition performance caused by insufficient two-dimensional information, a new human action recognition method based on threedimensional depth image sequence was put forward. On the temporal dimension, Temporal Depth Model( TDM) was proposed to describe the action. Specially, the entire depth maps were divided into several sub-actions under three orthogonal Cartesian planes. The absolute difference between two consecutive projected maps was accumulated to form a depth motion map to describe the dynamic feature of an action. On the spatial-dimension, Spatial Pyramid Histogram of Oriented Gradient( SPHOG) was computed from the TDM for the representation of an action to obtain the final descriptor. Support Vector Machine( SVM) was used to classify the proposed descriptors at last. The proposed method was tested on two authoritative datasets including MSR Action3 D dataset and MSRGesture3 D dataset, the recognition rates were 94. 90%( cross subject test)and 94. 86% respectively. The experimental results demonstrate that the proposed method has fast speed and better recognition, also it meets the real-time requirement in the depth video sequence system basically.
出处
《计算机应用》
CSCD
北大核心
2016年第2期568-573,579,共7页
journal of Computer Applications
基金
国家自然科学基金重大国际合作项目(61210005)
国家自然科学基金重点项目(61331021)~~
关键词
动作识别
三维深度图像
方向梯度直方图
时空金字塔
深度运动图
action recognition
three-dimensional depth image
Histogram of Oriented Gradient(HOG)
spatio-temporal pyramid
depth motion map