矩阵低秩表示的目标跟踪算法

Object tracking algorithm based on matrix low-rank representation

导出

摘要目的目标跟踪中,遮挡、强烈光照及运动模糊等干扰对跟踪精度的影响较大,其为目标外观的观测建模精度带来一定的困难。此外,很多现有算法在观测建模中都以向量形式表示样本数据,使得样本数据原有结构及其各像素的潜在关系被有意改变,从而导致观测模型数据维度及计算复杂度的提高。方法本文通过深入研究跟踪框架的观测建模问题,提出一种新颖的基于矩阵低秩表示的观测建模方法及其相应的似然度测度函数,使得跟踪算法能够充分挖掘样本数据的潜在特征结构,从而更加精确探测目标在遮挡或强烈光照等各种复杂干扰下的外观变化。同时,以矩阵形式表述样本信号的数据格式,使得其视觉特征的空间分布保留完好,并有效降低数据维度和计算复杂度。结果本文跟踪算法在富有挑战性干扰因素的跟踪环境中体现出更为鲁棒的跟踪性能,能够较好地解决跟踪中遮挡或强烈光照所引起的模型退化和漂移等问题。在10个经典测试视频中,本文跟踪算法的平均中心点误差为5.29像素,平均跟踪重叠率为78%,平均跟踪成功率为98.28%,均优于其他同类算法。结论本文以2维矩阵数据原型为载体,提出了一种新的多任务观测建模框架和最大似然度估计模型。实验数据的定性与定量分析结果表明,本文算法与一些优秀的同类算法相比,其跟踪建模精度达到相同甚至更高的水平。 Objective Visual object tracking is a significant computer vision task that can be applied to many domains,such as military,robotics,intelligent visual surveillance,human-computer interaction,and medical diagnosis. A large variety of trackers that have been proposed in the literature in the past decades have delivered satisfactory performances. Despite the success of researching on this topic,visual object tracking still suffers from difficulties in handling complex object appearance changes caused by factors such as illumination,partial occlusion,shape deformation,background clutter,low contrast,specularities,camera motion,and at least seven more aspects. Generally,visual tracking is a search（or classification） problem that continuously infers the state of a target in video sequences,aims to identify the candidate while it matches to the target template accurately,and returns it as a tracking result. Constructing an effective and high-performance tracker has two core issues. The first is the issue of representative feature learning and high-level modeling. The second is the problem of filtering and efficient searching. Given that the target states in every video frame are represented using several online learned feature templates,the modeling capability of the tracker will significantly depend on the generalizability of template data and accurate model representation with error estimation precision because of the complex interference factors caused by the target itself or the scene conditions. In addition,the relationship between each data pixel is significantly damaged while its original data structures are being changed because the sample data are intentionally forced into vector form in most existing algorithms. Moreover,the computational complexity with high data dimensionality must be increased.Therefore,designing an effective model representation mechanism of the 2D appearance of moving objects with the appropriate data expression is the key issue for the success of a visual tracker. Method In this study,the appearance model representation problem of generative-model-based visual object tracking algorithm is investigated in depth. In a prior work,we formulated the observation model via tensor（3D array） nuclear norm regularization. The tracker is called tensor nuclear norm regression-based tracker（TNRT） and has achieved favorable results in many tracking environments. However,the TNRT requires high hardware conditions and graphics processing unit computing demands,which will lead to slow tracking speeds if some practical uses require low hardware conditions. Therefore,we redesign a novel matrix low-rank representation-based observation model and its corresponding likelihood measurement function,as well as maintain several good properties of the TNRT algorithm,such as multitask joint learning,nuclear norm regularization-based model representation,and original data structures of sample signals. In the proposed tracking framework,several critical feature templates（dictionary or subspace） are learned from online data using the incremental principal component analysis algorithm. Then,in accordance with the appearance information of an incoming video frame,the proposed appearance modeling mechanism will use the feature templates to represent the target candidate linearly with independent and identically distributed Gaussian-Laplacian mixture noise by adopting the multitask joint learning strategy. Subsequently,the matrix nuclear norm and weighted L_1-norm-based joint maximum likelihood function measure the distances between target candidates and feature subspace scrupulously. Given that the intrinsic data structures of samples are guaranteed using the matrix form and the spatial distributions of visual features remain intact,the proposed multitask observation modeling via matrix low-rank regularizationbased objective function will construct more accurate and flexible sample signals than L_1,L_2,or other hybrid regularizationbased model representation methods. Then,in every frame,the identical likelihood measurement function of our algorithm measures each candidate sample with obvious comparability. Finally,the tracker is able to explore the potential characteristics of the sample data fully and further detect the complex appearance changes of the target with some challenging disturbances,such as occlusion or strong illuminations. Meanwhile,the observation model,which formulates matrix-form-based data prototypes,can improve the tracking speed remarkably with its distinctly reduced data dimensionality and low computational complexity. Result Although the pixels of residual data always show similar grayscale intensities and share some spatial information with 2D data prototypes,such as block-shaped linking areas,the conventional observation model using L_1,L_2,or other hybrid regularization-based model representation methods cannot fully examine the potential structure of residual data. In comparison to these traditional methods,the matrix low-rank regression model（MLRM） more precisely explores the residual data and further detects the spatial characteristics of reconstruction error. In other words,the MLRM significantly discovers the low-rank characteristics of the residual matrix. In this study,we aim to evaluate our proposed tracking algorithm systematically and experimentally on 10 public video fragments that cover the previously mentioned challenging noisy factors and compare it with several state-of-the-art algorithms commonly cited in influential literature. We indicate that each tracker can be evaluated objectively using survival curves,such as average center point error（ACE）,average overlap rate（AOR）,and average success rate（ASR）. Our tracking algorithm reflects the favorable robustness in these noisy environments and obtains the best results in each video sequence,with ACE,AOR,and ASR of 5.29 pixels, 78%, and 98.28%,respectively. Conclusion In this study,a novel multitask matrix low-rank model representation method and its corresponding maximum likelihood estimation function are designed. The analysis of a large variety of circumstances in several public video sequences provides objective insight into the strengths and weaknesses of each tracker. The appearance modeling mechanism and maximum likelihood estimation function of the proposed MLRM algorithm play critical roles and achieve favorable tracking results in several challenging video sequences. Qualitative and quantitative experimental evaluations of a number of challenging noisy environments indicate that the proposed MLRM algorithm can reflect the best robustness to elevate the model degradation or drifting problem caused by occlusion and strong illumination and can achieve the same or even better results when compared with several state-of-the-art algorithms.

作者亚森江.木沙木合塔尔.克力木 Yasin Musa;Muhtar Kerim(School of Mechanical Engineering, Xinjiang University, Urumqi 830046, Chin)

机构地区新疆大学机械工程学院

出处《中国图象图形学报》 CSCD 北大核心 2018年第5期674-687,共14页 Journal of Image and Graphics

基金国家自然科学基金项目(51365052)~~

关键词数据原型矩阵低秩表示多任务观测建模似然度估计目标跟踪 data prototypes matrix low-rank representation multi-task observation modeliug likelihood estimation ob-ject tracking

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1王海军,葛红娟,张圣燕.在线低秩表示的目标跟踪算法[J].西安电子科技大学学报,2016,43(5):98-104. 被引量：4
2陈芸,吴飞,荆晓远.鲁棒低秩稀疏表示的在线目标跟踪[J].计算机工程与设计,2016,37(4):1062-1066. 被引量：4
3亚森江·木沙,木合塔尔·克力木,赵春霞.张量核范数回归的目标跟踪[J].中国图象图形学报,2016,21(6):781-795. 被引量：1

二级参考文献44

1Ross D A,Lim J,Lin R S,et al.Incremental learning for robust visual tracking[J].International Journal of Computer Vision,2008,77(1-3):125-141.
2Babenko B,Yang M H,Belongie S.Robust object tracking with online multiple instance learning[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(8):1619-1632.
3Mei X,Ling H,Wu Y,et al.Minimum error bounded efficient L1tracker with occlusion detection[C]//IEEE International Conference on Computer Vision and Pattern Recognition.Colorado,USA IEEE,2011:1257-1264.
4Mei X,Ling H.Robust visual tracking and vehicle classification via sparse representation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2011,33(11):2259-2272.
5Zhang T,Ghanem B,Liu S,et al.Low-rank sparse learning for robust visual tracking[C]//Proceedings of the 12th European Conference on Computer Vision.Florence:Springer,2012:470-484.
6Candès E J,Li X,Ma Y,et al.Robust principal component analysis?[J].Journal of the ACM,2011,58(3):11.
7Peng Y,Ganesh A,Wright J,et al.Robust alignment by sparse and low-rank decomposition for linearly correlated images[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2012,34(11):2233-2246.
8Zhang Z,Ganesh A,Liang X,et al.TILT:Transform-invariant low-rank textures[J].International Journal of Computer Vision,2012,99(1):1-24.
9Lin Z,Chen M,Ma Y.The augmented Lagrange multiplier method for exact recovery of corrupted low-rank matrix[J].arXiv preprint arXiv:1009.5055,2010.
10Wang D,Lu H,Yang M H.Online object tracking with sparse prototypes[J].IEEE Transactions on Image Processing,2013,22(1):314-325.

共引文献6

1孔繁锵,王丹丹,沈秋,卞陈鼎,严小乐.在线低秩稀疏表示的鲁棒视觉跟踪[J].工程科学与技术,2017,49(4):151-157. 被引量：2
2宫海洋,任红格,史涛,李福进.基于改进粒子滤波的稀疏子空间单目标跟踪算法[J].现代电子技术,2018,41(13):10-13. 被引量：5
3李新卫,吴飞,荆晓远.基于协同矩阵分解的单标签跨模态检索[J].计算机技术与发展,2018,28(11):99-102.
4王海军,张圣燕.自适应权值卷积特征的鲁棒目标跟踪算法[J].西安电子科技大学学报,2019,46(1):117-123. 被引量：7
5李福进,李军,宫海洋.基于稀疏子空间的卷积神经网络目标跟踪[J].中国测试,2019,45(7):122-127.
6李长江,肖文显,王俊阁.基于相似度优化的混合式视觉跟踪方法[J].太赫兹科学与电子信息学报,2022,20(11):1198-1204.

1代翔.张量分解及其在推荐系统中的应用[J].信息与电脑,2016,28(22):34-37. 被引量：1
2马秀峰,郭顺利,宋凯.基于LDA主题模型的“内容-方法”共现分析研究——以情报学领域为例[J].情报科学,2018,36(4):69-74. 被引量：15
3曹太云.我国服务业发展因素分析[J].北方经贸,2018(4):17-18. 被引量：1
4王婧,朱虹.基于多模态词典学习的目标跟踪算法[J].西安理工大学学报,2017,33(3):259-264.
5王海霞,周迎春.初中生参加课外体育活动内部动机的调查与分析[J].武夷学院学报,2017,36(12):83-88. 被引量：1
6金鼎林.量子力学教程中一个习题的纠正[J].承德民族师专学报,1999,19(2):71-72.
7杨嘉骏,沙征,张嵩,冯文俊,胡海敏,蔡斌,施俊,左亚南.配电站定位系统电子地图设计研究[J].电力与能源,2018,39(2):206-208. 被引量：3
8曹健,秦荣环,孙会清,毕长泉.基于Hadoop的高校图书馆数字资源整合利用研究[J].图书馆工作与研究,2018(3):74-78. 被引量：27
9吴俊绒.数字图书馆数据查询影响因子及用户兴趣模型设计[J].微型电脑应用,2018,34(3):53-54. 被引量：1
10陈玉芳,白雪彬,杨清辉.城市河道整治中生态护坡技术的运用[J].内蒙古水利,2018(3):63-64. 被引量：6

中国图象图形学报

2018年第5期

浏览历史

内容加载中请稍等...

矩阵低秩表示的目标跟踪算法

参考文献3

二级参考文献44

共引文献6

相关作者

相关机构

相关主题

浏览历史