Advances in Deep Learning Methods for Visual Tracking:Literature Review and Fundamentals 被引量：5

导出

摘要 Recently,deep learning has achieved great success in visual tracking tasks,particularly in single-object tracking.This paper provides a comprehensive review of state-of-the-art single-object tracking algorithms based on deep learning.First,we introduce basic knowledge of deep visual tracking,including fundamental concepts,existing algorithms,and previous reviews.Second,we briefly review existing deep learning methods by categorizing them into data-invariant and data-adaptive methods based on whether they can dynamically change their model parameters or architectures.Then,we conclude with the general components of deep trackers.In this way,we systematically analyze the novelties of several recently proposed deep trackers.Thereafter,popular datasets such as Object Tracking Benchmark(OTB)and Visual Object Tracking(VOT)are discussed,along with the performances of several deep trackers.Finally,based on observations and experimental results,we discuss three different characteristics of deep trackers,i.e.,the relationships between their general components,exploration of more effective tracking frameworks,and interpretability of their motion estimation components.

作者 Xiao-Qin Zhang Run-Hua Jiang Chen-Xiang Fan Tian-Yu Tong Tao Wang Peng-Cheng Huang

机构地区 College of Computer Science and Artificial Intelligence

出处《International Journal of Automation and computing》 EI CSCD 2021年第3期311-333,共23页 国际自动化与计算杂志（英文版）

基金 supported by National Natural Science Foundation of China(Nos.61922064 and U2033210) Zhejiang Provincial Natural Science Foundation(Nos.LR17F030001 and LQ19F020005) the Project of Science and Technology Plans of Wenzhou City(Nos.C20170008 and ZG2017016)。

关键词 Deep learning visual tracking data-invariant data-adaptive general components

分类号 TP391.41 [自动化与计算机技术—计算机应用技术] TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献5

1Fu-Qiang Liu,Zong-Yi Wang.Automatic"Ground Truth"Annotation and Industrial Workpiece Dataset Generation for Deep Learning[J].International Journal of Automation and computing,2020,17(4):539-550. 被引量：2
2卢湖川,李佩霞,王栋.目标跟踪算法综述[J].模式识别与人工智能,2018,31(1):61-76. 被引量：163
3李玺,查宇飞,张天柱,崔振,左旺孟,侯志强,卢湖川,王菡子.深度学习的目标跟踪算法综述[J].中国图象图形学报,2019,24(12):2057-2080. 被引量：108
4Guo-Bing Zhou,Jianxin Wu,Chen-Lin Zhang,Zhi-Hua Zhou.Minimal Gated Unit for Recurrent Neural Networks[J].International Journal of Automation and computing,2016,13(3):226-234. 被引量：38
5Qiang Fu,Xiang-Yang Chen,Wei He.A Survey on 3D Visual Tracking of Multicopters[J].International Journal of Automation and computing,2019,16(6):707-719. 被引量：5

二级参考文献37

1侯志强,韩崇昭.视觉跟踪技术综述[J].自动化学报,2006,32(4):603-617. 被引量：255
2王亮,吴福朝.基于一维标定物的多摄像机标定[J].自动化学报,2007,33(3):225-231. 被引量：38
3Y. LeCun, L. Bottou, Y. Bengio, P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the 1EEE, vol. 86, no. 11, pp. 2278-2324, 1998.
4A. Krizhevsky, I. Sutskever, G. E. Hinton. ImageNet clas- sification with deep convolutional neural networks. In Pro- ceedings of Advances in Neural Information Processing Sys- tems 25, NIPS, Lake Tahoe, Nevada, USA, pp. 1091105, 2012.
5K. Cho, B. van Merinboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, Y. Bengio. Learning phrase repre- sentations using RNN encoder-decoder for statistical ma- chine translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Doha, Qatar, pp. 1721734, 2014.
6I. Sutskever, O. Vinyals, Q. V. Le. Sequence to sequence learning with neural networks. In Proceedings of Advances in Neural Information Processing Systems 27, NIPS, Mon- treal, Canada, pp. 3104-3112, 2014.
7D. Bahdanau, K. Cho, Y. Bengio. Neural machine transla- tion by jointly learning to align and translate. In Interna- tional Conference on Learning Representations 2015, San Diego, USA, 2015.
8A. Graves, A. R. Mohamed, G. Hinton. Speech recogni- tion with deep recurrent neural networks. In Proceedings of International Conference on Acoustics, Speech and Sig- nal Processing, IEEE, Vancouver, Canada, pp. 6645-6649, 2013.
9K. Xu, J. L. Ba, R. Kiros, K. Cho, A. Courville, R. Salakhudinov, R. S. Zemel, Y. Bengio. Show, attend and tell: Neural image caption generation with visual atten- tion. In Proceedings of the 32nd International Conference on Machine Learning, Lille, prance, vol. 37, pp. 2048 2057, 2015.
10A. Karpathy, F. F. Li. Deep visual-semantic alignments for generating image descriptions. In Proceedings of IEEE In- ternational Conference on Computer Vision and Pattern Recognition, IEEE, Boston, USA, pp. 3128 3137, 2015.

共引文献294

1李零,杨捷,段明明.基于长短时记忆网络的电力故障维修效果情感分析[J].云南大学学报（自然科学版）,2020,42(S02):44-48. 被引量：2
2付兴武,杨哲,姜文涛.因式分解卷积运算的多尺度目标跟踪算法[J].辽宁工程技术大学学报（自然科学版）,2019,38(5):463-471.
3张兴国,周英迪,石新雨,罗霄月,顾杨旸.一种球机视频全景拼接及空间化方法[J].测绘科学,2022,47(5):203-211. 被引量：1
4马素刚,赵祥模,侯志强,王忠民,孙韩林.一种基于ResNet网络特征的视觉目标跟踪算法[J].北京邮电大学学报,2020(2):129-134. 被引量：8
5丁明远,蔡靖,周冕,薛彦兵,温显斌.跟踪状态自适应的判别式行人单目标跟踪算法研究[J].光电子．激光,2022,33(9):940-947. 被引量：1
6陈逸博.鲜花装扮迷人的巴黎[J].花卉,2000(3):34-34.
7安晓卫,崔丽菊.有限元图形的快速显示和消隐处理[J].沈阳工业学院学报,2000,19(1):12-16. 被引量：1
8李惠峰,易文峰,程晓明.基于近似动态规划的目标追踪控制算法[J].北京航空航天大学学报,2019,45(3):597-605. 被引量：3
9冯棐,吴小俊,徐天阳.基于子空间和直方图的多记忆自适应相关滤波目标跟踪算法[J].模式识别与人工智能,2018,31(7):612-624. 被引量：10
10范文兵,赵周鼎,王诗.多特征融合的自适应相关滤波跟踪算法[J].计算机工程与应用,2018,54(14):19-25. 被引量：8

同被引文献9

1Quan-shi ZHANG,Song-chun ZHU.Visual interpretability for deep learning：a survey[J].Frontiers of Information Technology & Electronic Engineering,2018,19(1):27-39. 被引量：49
2李玺,查宇飞,张天柱,崔振,左旺孟,侯志强,卢湖川,王菡子.深度学习的目标跟踪算法综述[J].中国图象图形学报,2019,24(12):2057-2080. 被引量：108
3罗元,肖航,欧俊雄.基于深度学习的目标跟踪技术的研究综述[J].半导体光电,2020,41(6):757-767. 被引量：18
4Shuo Huang,Wei Shao,Mei-Ling Wang,Dao-Qiang Zhang.fMRI-based Decoding of Visual Information from Human Brain Activity: A Brief Review[J].International Journal of Automation and computing,2021,18(2):170-184. 被引量：3
5Dong Chen,Fan Tang,Weiming Dong,Hanxing Yao,Changsheng Xu.SiamCPN:Visual tracking with the Siamese center-prediction network[J].Computational Visual Media,2021,7(2):253-265. 被引量：2
6Li-Fang Wu,Qi Wang,Meng Jian,Yu Qiao,Bo-Xuan Zhao.A Comprehensive Review of Group Activity Recognition in Videos[J].International Journal of Automation and computing,2021,18(3):334-350. 被引量：3
7Min Ren,Yun-Long Wang,Zhao-Feng He.Towards Interpretable Defense Against Adversarial Attacks via Causal Inference[J].Machine Intelligence Research,2022,19(3):209-226. 被引量：1
8Zhangdong Wang,Jiaohua Qin,Xuyu Xiang,Yun Tan,Neal N.Xiong.Criss-Cross Attentional Siamese Networks for Object Tracking[J].Computers, Materials & Continua,2022(11):2931-2946. 被引量：1
9陈旭,孟朝晖.基于深度学习的目标视频跟踪算法综述[J].计算机系统应用,2019,28(1):1-9. 被引量：23

引证文献5

1曹建荣,张玉婷,朱亚琴,武欣莹,杨红娟.基于改进MDNet的视频目标跟踪算法[J].计算机系统应用,2022,31(5):277-284. 被引量：1
2Qiongyi Zhou,Changde Du,Huiguang He.Exploring the Brain-like Properties of Deep Neural Networks:A Neural Encoding Perspective[J].Machine Intelligence Research,2022,19(5):439-455. 被引量：1
3Yang Liu,Yu-Shen Wei,Hong Yan,Guan-Bin Li,Liang Lin.Causal Reasoning Meets Visual Representation Learning: A Prospective Study[J].Machine Intelligence Research,2022,19(6):485-511. 被引量：4
4Chang Liu,Xiao-Fan Chen,Chun-Juan Bo,Dong Wang.Long-term Visual Tracking: Review and Experimental Comparison[J].Machine Intelligence Research,2022,19(6):512-530. 被引量：1
5贺泽民,曾俊涛,袁宝玺,梁德建,苗宗成.视觉跟踪技术中孪生网络的研究进展[J].液晶与显示,2024,39(2):192-204. 被引量：1

二级引证文献7

1ZHANG KeXuan,SUN QiYu,ZHAO ChaoQiang,TANG Yang.Causal reasoning in typical computer vision tasks[J].Science China(Technological Sciences),2024,67(1):105-120.
2Haitao Wang,Wei Jia.Enhance the Performance of Directional Feature-based Palmprint Recognition by Directional Response Stability Measurement[J].Machine Intelligence Research,2024,21(3):597-614.
3冯文亮,孟凡宝,余川,游安清.基于多重注意力机制与响应融合的孪生单目标跟踪算法[J].强激光与粒子束,2024,36(8):140-148. 被引量：1
4Xiangwei Kong,Shujie Liu,Luhao Zhu.Toward Human-centered XAIin Practice:A survey[J].Machine Intelligence Research,2024,21(4):740-770.
5Alhassan Mumuni,Fuseini Mumuni,Nana Kobina Gerrar.A Survey of Synthetic Data Augmentation Methods in Machine Vision[J].Machine Intelligence Research,2024,21(5):831-869.
6Yongxian Wei,Xiu-Shen Wei.Task-specific Part Discovery for Fine-grained Few-shot Classification[J].Machine Intelligence Research,2024,21(5):954-965.
7杜秀芝,许跃.基于对抗样本的负图片对分类网络的影响探究[J].黑河学院学报,2024,15(10):182-185.

1何情祖,钟传奇,李翔,帅建伟,韩家淮.数据不依赖获取的质谱数据的深度学习分析方法[J].厦门大学学报（自然科学版）,2021,60(1):97-103. 被引量：1
2朱均安,陈涛,曹景太.基于显著性区域加权的相关滤波目标跟踪[J].光学精密工程,2021,29(2):363-373. 被引量：4
3周岳,杨湘云,黄敏,李想,关锋.Orbitrap Exploris 480质谱在定量蛋白质组学应用中的优化和评测[J].生物化学与生物物理进展,2021,48(2):214-226. 被引量：2
4周岳,黄敏,李想,关锋.3种轨道阱质谱仪数据非依赖性扫描的定量分析蛋白质组学性能评测[J].分析化学,2021,49(5):820-829. 被引量：2
5Dong Chen,Fan Tang,Weiming Dong,Hanxing Yao,Changsheng Xu.SiamCPN:Visual tracking with the Siamese center-prediction network[J].Computational Visual Media,2021,7(2):253-265. 被引量：2
6Wen Huang,Xuwen Xia,Chen Zhu,Parker Steichen,Weidong Quan,Weiwei Mao,Jianping Yang,Liang Chu,Xing’ao Li.Memristive Artificial Synapses for Neuromorphic Computing[J].Nano-Micro Letters,2021,13(5):218-245. 被引量：8
7Azidine Guezzaz,Younes Asimi,Mourade Azrour,Ahmed Asimi.Mathematical Validation of Proposed Machine Learning Classifier for Heterogeneous Traffic and Anomaly Detection[J].Big Data Mining and Analytics,2021,4(1):18-24. 被引量：4
8Chandra Sekhar Bhusal.Systematic Review on Social Engineering: Hacking by Manipulating Humans[J].Journal of Information Security,2021,12(1):104-114.
9王献海,宋慧慧,张开华,刘青山.增强二阶网络调制的目标跟踪[J].中国图象图形学报,2021,26(3):516-526. 被引量：2
10Adam J.Hepworth,Daniel P.Baxter,Aya Hussein,Kate J.Yaxley,Essam Debie,Hussein A.Abbass.Human-Swarm-Teaming Transparency and Trust Architecture[J].IEEE/CAA Journal of Automatica Sinica,2021,8(7):1281-1295. 被引量：1

International Journal of Automation and computing

2021年第3期

浏览历史

内容加载中请稍等...