Motion Enhanced Model Based on High-Level Spatial Features

下载PDF

导出

摘要 Action recognition has become a current research hotspot in computer vision.Compared to other deep learning methods,Two-stream convolutional network structure achieves better performance in action recognition,which divides the network into spatial and temporal streams,using video frame images as well as dense optical streams in the network,respectively,to obtain the category labels.However,the two-stream network has some drawbacks,i.e.,using dense optical flow as the input of the temporal stream,which is computationally expensive and extremely time-consuming for the current extraction algorithm and cannot meet the requirements of real-time tasks.In this paper,instead of the dense optical flow,the Motion Vectors(MVs)are used and extracted from the compressed domain as temporal features,which greatly reduces the extraction time.However,the motion pattern that MVs contain is coarser,which leads to low accuracy.In this paper,we propose two strategies to improve the accuracy:firstly,an accumulated strategy is used to enhance the motion information and continuity of MVs;secondly,knowledge distillation is used to fuse the spatial information into the temporal stream so that more information(e.g.,motion details,colors,etc.)is obtainable.Experimental results show that the accuracy of MV can be greatly improved by the strategies proposed in this paper and the final recognition for human actions accuracy is guaranteed without using optical flow.

作者 Yang Wu Lei Guo Xiaodong Dai Bin Zhang Dong-Won Park Ming Ma

机构地区 College of Computer Science and Engineering Department of Information and Communications

出处《Computers, Materials & Continua》 SCIE EI 2022年第12期5911-5924,共14页 计算机、材料和连续体（英文）

基金 This work is supported by the Inner Mongolia Natural Science Foundation of China under Grant No.2021MS06016 the CERNET Innovation Project(NGII20190625).

关键词 Action recognition motion vectors two-stream knowledge distillation accumulate strategy

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

参考文献6

1Dengyong Zhang,Jiawei Hu,Feng Li,Xiangling Ding,Arun Kumar Sangaiah,Victor SSheng.Small Object Detection via Precise Region-Based Fully Convolutional Networks[J].Computers, Materials & Continua,2021(11):1503-1517. 被引量：9
2Wei Sun,Xuan Chen,Xiaorui Zhang,Guangzhao Dai,Pengshuai Chang,Xiaozheng He.A Multi-Feature Learning Model with Enhanced Local Attention for Vehicle Re-Identification[J].Computers, Materials & Continua,2021(12):3549-3561. 被引量：19
3Binjie Gu.Human Action Recognition Based on Supervised Class-Specific Dictionary Learning with Deep Convolutional Neural Network Features[J].Computers, Materials & Continua,2020(4):243-262. 被引量：6
4Shuren Zhou,Le Chen,Vijayan Sugumaran.Hidden Two-Stream Collaborative Learning Network for Action Recognition[J].Computers, Materials & Continua,2020(6):1545-1561. 被引量：4
5Shiqi Wang,Yimin Yang,Ruizhong Wei,Qingming Jonathan Wu.3-Dimensional Bag of Visual Words Framework on Action Recognition[J].Computers, Materials & Continua,2020(6):1081-1091. 被引量：1
6Chao Zhu,Yike Wang,Dongbing Pu,Miao Qi,Hui Sun,Lei Tan.Multi-Modality Video Representation for Action Recognition[J].Journal on Big Data,2020,2(3):95-104. 被引量：4

二级参考文献7

1涂宏斌,岳艳艳,周新建,罗锟.一种基于改进PLSA和案例推理的行为识别算法[J].计算机科学,2017,44(6):283-289. 被引量：1
2Chuanlong Li,Xingming Sun,Junhao Cai.Intelligent Mobile Drone System Based on Real-Time Object Detection[J].Journal on Artificial Intelligence,2019,1(1):1-8. 被引量：2
3张小瑞,陈旋,孙伟,葛楷.基于深度学习的车辆再识别研究进展[J].计算机工程,2020,46(11):1-11. 被引量：14
4Wei Song,Jing Yu,Xiaobing Zhao,Antai Wang.Research on Action Recognition and Content Analysis in Videos Based on DNN and MLN[J].Computers, Materials & Continua,2019(9):1189-1204. 被引量：2
5Chen Song,Xu Cheng,Yongxiang Gu,Beijing Chen,Zhangjie Fu.A Review of Object Detectors in Deep Learning[J].Journal on Artificial Intelligence,2020,2(2):59-77. 被引量：4
6Wei Sun,Xiaorui Zhang,Xiaozheng He,Yan Jin,Xu Zhang.A Two-Stage Vehicle Type Recognition Method Combining the Most Effective Gabor Features[J].Computers, Materials & Continua,2020(12):2489-2510. 被引量：5
7Xiaorui Zhang,Xuan Chen,Wei Sun,Xiaozheng He.Vehicle Re-Identication Model Based on Optimized DenseNet121 with Joint Loss[J].Computers, Materials & Continua,2021(6):3933-3948. 被引量：12

共引文献37

1陈恒俊,蔡明志,陈乐,许文杰.基于计算机视觉的管路振动感知算法[J].计算机系统应用,2021,30(9):171-178.
2Jun Wang,Suncheng Feng,Yong Cheng,Najla Al-Nabhan.Survey on the Loss Function of Deep Learning in Face Recognition[J].Journal of Information Hiding and Privacy Protection,2021,3(1):29-45.
3Yue Li,Jin Liu,Shengjie Shang.WMA:A Multi-Scale Self-Attention Feature Extraction Network Based on Weight Sharing for VQA[J].Journal on Big Data,2021,3(3):111-118. 被引量：1
4Kunkun Wang,Xianda Liu.An Anomaly Detection Method of Industrial Data Based on Stacking Integration[J].Journal on Artificial Intelligence,2021,3(1):9-19.
5Huiping Jiang,Rui Jiao,Demeng Wu,Wenbo Wu.Emotion Analysis: Bimodal Fusion of Facial Expressions and EEG[J].Computers, Materials & Continua,2021(8):2315-2327. 被引量：1
6Abdulaziz Alhumam.Explainable Software Fault Localization Model: From Blackbox to Whitebox[J].Computers, Materials & Continua,2022(10):1463-1482.
7Zhilin Zhang,Ting Zhang,Zhaoying Liu,Peijie Zhang,Shanshan Tu,Yujian Li,Muhammad Waqas.Fine-grained Ship Image Recognition Based on BCNN with Inception and AM-Softmax[J].Computers, Materials & Continua,2022(10):1527-1539.
8Mansour F.Yassen.Multilevel Modelling for Surgical Tool Calibration Using LINEX Loss Function[J].Computers, Materials & Continua,2022(10):1691-1706.
9Jun Li,Xiang Li,Yifei Wei,Mei Song,Xiaojun Wang.Multi-Level Feature Aggregation-Based Joint Keypoint Detection and Description[J].Computers, Materials & Continua,2022(11):2529-2540.
10Shaozhe Guo,Yong Li,Xuyang Chen,Youshan Zhang.Anchor-free Siamese Network Based on Visual Tracking[J].Computers, Materials & Continua,2022(11):3137-3148.

1Jian Zhao,Shangwu Chong,Liang Huang,Xin Li,Chen He,Jian Jia.Action Recognition Based on CSI Signal Using Improved Deep Residual Network Model[J].Computer Modeling in Engineering & Sciences,2022(3):1827-1851.
2Jiaxu Zhang,Gaoxiang Ye,Zhigang Tu,Yongtao Qin,Qianqing Qin,Jinlu Zhang,Jun Liu.A spatial attentive and temporal dilated(SATD)GCN for skeleton-based action recognition[J].CAAI Transactions on Intelligence Technology,2022,7(1):46-55. 被引量：11
3LI Yanshan,GUO Tianyu,LIU Xing,LUO Wenhan,XIE Weixin.Action Status Based Novel Relative Feature Representations for Interaction Recognition[J].Chinese Journal of Electronics,2022,31(1):168-180. 被引量：3
4Salwa A. K. Mostafa,Abdelrahman Ali.Multiresolution Video Watermarking Algorithm Exploiting the Block-Based Motion Estimation[J].Journal of Information Security,2016,7(4):260-268. 被引量：2
5Muhammad Attique Khan,Majed Alhaisoni,Ammar Armghan,Fayadh Alenezi,Usman Tariq,Yunyoung Nam,Tallha Akram.Video Analytics Framework for Human Action Recognition[J].Computers, Materials & Continua,2021(9):3841-3859. 被引量：1
6Xiao-Yu Zhang,Hai-Chao Shi,Chang-Sheng Li,Li-Xin Duan.TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization[J].Machine Intelligence Research,2022,19(3):227-246. 被引量：1
7R. I. Minu,G. Nagarajan.Fuzzy Empowered Cognitive Spatial Relation Identification and Semantic Action Recognition[J].Circuits and Systems,2016,7(8):1906-1915.
8Dianchun Bai,Tie Liu,Xinghua Han,Hongyu Yi.Application Research on Optimization Algorithm of sEMG Gesture Recognition Based on Light CNN+LSTM Model[J].Cyborg and Bionic Systems,2021(1):67-78. 被引量：4
9Sadaf Hafeez,Yazeed Yasin Ghadi,Mohammed Alarfaj,Tamara al Shloul,Ahmad Jalal,Shaharyar Kama,Dong-Seong Kim.Sensors-Based Ambient Assistant Living via E-Monitoring Technology[J].Computers, Materials & Continua,2022(12):4935-4952.
10Haiqun Qin,Ziyang Zhen,Kun Ma.Moving object detection based on optical flow and neural network fusion[J].International Journal of Intelligent Computing and Cybernetics,2016,9(4):325-335.

Computers, Materials & Continua

2022年第12期

浏览历史

内容加载中请稍等...

Motion Enhanced Model Based on High-Level Spatial Features

参考文献6

二级参考文献7

共引文献37

相关作者

相关机构

相关主题

浏览历史