基于光流与多尺度上下文的图像序列运动遮挡检测

Occlusion Detection Based on Optical Flow and Multiscale Context

下载PDF

导出

摘要针对非刚性运动和大位移场景下运动遮挡检测的准确性与鲁棒性问题,提出一种基于光流与多尺度上下文的图像序列运动遮挡检测方法.首先,设计基于扩张卷积的多尺度上下文信息聚合网络,通过图像序列多尺度上下文信息获取更大范围的图像特征;然后,采用特征金字塔构建基于多尺度上下文与光流的端到端运动遮挡检测网络模型,利用光流优化非刚性运动和大位移区域的运动检测遮挡信息;最后,构造基于运动边缘的网络模型训练损失函数,获取准确的运动遮挡边界.分别采用MPI-Sintel和KITTI测试数据集对所提方法与现有的代表性方法进行实验对比与分析.实验结果表明,所提方法能够有效提高运动遮挡检测的准确性和鲁棒性,尤其在非刚性运动和大位移等困难场景下具有更好的遮挡检测鲁棒性. In order to improve the accuracy and robustness of occlusion detection under non-rigid motion and large displacements,we propose an occlusion detection method of image sequence motion based on optical flow and multiscale context.First,we design a multiscale context information aggregation network based on dilated convolution which obtains a wider range of image features through multiscale context information of image sequence.Then,we construct an end-to-end motion occlusion detection network model based on multiscale context and optical flow using feature pyramid,utilize the optical flow to optimize the performance of occlusion detection in areas of non-rigid motion and large displacements region.Finally,we present a novel motion edge training loss function to obtain the accurate motion occlusion boundary.We compare and analysis our method with the existing representative approaches by using the MPI-Sintel datasets and KITTI datasets,respectively.The experimental results show that the proposed method can effectively improve the accuracy and robustness of motion occlusion detection,especially gains the better occlusion detection robustness under non-rigid motion and large displacements.

作者冯诚张聪炫陈震李兵黎明 FENG Cheng;ZHANG Cong-Xuan;CHEN Zhen;LI Bing;LI Ming(School of Measuring and Optical Engineering,Nanchang Hangkong University,Nanchang 330063;National Key Laboratory of Pattern Recognition,Institute of Automation,Chinese Academy of Sciences,Beijing 100190;School of Information Engineering,Nanchang Hangkong University,Nanchang 330063)

机构地区南昌航空大学测试与光电工程学院中国科学院自动化研究所模式识别国家重点实验室南昌航空大学信息工程学院

出处《自动化学报》 EI CAS CSCD 北大核心 2024年第9期1854-1865,共12页 Acta Automatica Sinica

基金国家重点研发计划(2020YFC2003800) 国家自然科学基金(61866026,61772255,62222206) 江西省杰出青年人才计划(20192BCB23011) 江西省自然科学基金重点项目(20202ACB214007) 江西省优势科技创新团队(20165BCB19007)资助。

关键词图像序列遮挡检测深度学习多尺度上下文非刚性运动 Image sequence occlusion detection deep learning multiscale context non-rigid motion

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1张世辉,何琦,董利健,杜雪哲.基于遮挡区域建模和目标运动估计的动态遮挡规避方法[J].自动化学报,2019,45(4):771-786. 被引量：4
2张聪炫,陈震,熊帆,黎明,葛利跃,陈昊.非刚性稠密匹配大位移运动光流估计[J].电子学报,2019,47(6):1316-1323. 被引量：10
3姚乃明,郭清沛,乔逢春,陈辉,王宏安.基于生成式对抗网络的鲁棒人脸表情识别[J].自动化学报,2018,44(5):865-877. 被引量：47
4刘鑫,许华荣,胡占义.基于GPU和Kinect的快速物体重建[J].自动化学报,2012,38(8):1288-1297. 被引量：50
5张聪炫,陈震,黎明.单目图像序列光流三维重建技术研究综述[J].电子学报,2016,44(12):3044-3052. 被引量：22
6张聪炫,陈震,汪明润,黎明,江少锋.基于光流与Delaunay三角网格的图像序列运动遮挡检测[J].电子学报,2018,46(2):479-485. 被引量：3

二级参考文献40

1Engelhard N, Endres F, Hess J, Sturm J, Burgard W. Real-time 3D visual SLAM with a hand-held RGB-D camera. In: Proceedings of the 2011 RGB-D Workshop on 3D Perception in Robotics at the European Robotics Forum. V?steras, Sweden: Robotdalen, 2011.
2Henry P, Krainin M, Herbst E, Ren X, Fox D. RGB-D mapping: using depth cameras for dense 3D modeling of indoor environments. In: Proceedings of the 12th International Symposium on Experimental Robotics. Delhi, India: IEEE, 2010.
3Du H, Henry P, Ren X F, Cheng M, Goldman D B, Seitz S M, Fox D. Interactive 3D modeling of indoor environments with a consumer depth camera. In: Proceedings of the 13th International Conference on Ubiquitous Computing. Beijing, China: IEEE, 2011. 75-84.
4Izadi S, Newcombe R A, Kim D, Hilliges O, Molyneaux D, Hodges S, Kohli P, Davison A, Fitzgibbon A. KinectFusion: real-time dynamic 3D surface reconstruction and interaction. In: Proceedings of the 2011 International Conference on Computer Graphics and Interactive Techniques. Vancouver, Canada: ACM, 2011.
5Izadi S, Kim D, Hilliges O, Molyneaux D, Newcombe R, Kohli P, Shotton J, Hodges S, Freeman D, Davison A, Fitzgibbon A. KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera. In: Proceedings of the 2011 Annual ACM Symposium on User Interface Software and Technology. Santa Barbara, CA: ACM, 2011. 559-568.
6Tong J, Zhou J, Liu L G, Pan Z G, Yan H. Scanning 3D full human bodies using Kinects. IEEE Transactions on Visualization and Computer Graphics, 2012, 18(4): 643-650.
7Bay H, Ess A, Tuytelaars T, van Gool L. Speeded-up robust features (SURF). Computer Vision and Image Understanding, 2008, 110(3): 346-359.
8Besl P J, McKay H D. A method for registration of 3-d shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1992, 14(2): 239-256.
9Konolige K, Mihelich P. Technical description of Kinect calibration [Online], available: http://www.ros.org/wiki/ kinect_calibration/technical, November 3, 2011.
10Gu Zhao-Peng. A Study on Monocular Simultaneous Localization and Mapping [Ph.D. dissertation]. Institute of Automation, Chinese Academy of Sciences, China, 2011.

共引文献129

1丁名都,李琳.基于CNN和HOG双路特征融合的人脸表情识别[J].信息与控制,2020,49(1):47-54. 被引量：17
2尹潘龙,徐光柱,雷帮军,曹维华.Kinect下深度信息获取技术及其在三维目标识别中的应用综述[J].集成技术,2013,2(6):94-99. 被引量：5
3孙树森,马文娟,桂江生,宋瑾钰.基于Kinect的《互动应用开发》课程开发探究[J].中国校外教育,2012(10):161-161. 被引量：4
4杨鸿,钱堃,戴先中,马旭东,房芳.基于Kinect传感器的移动机器人室内环境三维地图创建[J].东南大学学报（自然科学版）,2013,43(A01):183-187. 被引量：18
5杨戈,尤晓旭.网络环境下智能监控综述[J].计算机系统应用,2013,22(12):1-12. 被引量：1
6秦赛玉.Kinect2Scratch:尝试以更自然的方式对话机器[J].中国信息技术教育,2014(1):64-67. 被引量：1
7韦羽棉,尚赵伟.基于Kinect的旋转刚体三维重建方法[J].计算机与现代化,2014(5):89-93. 被引量：9
8王守相,陈海文,潘志新,王建明.采用改进生成式对抗网络的电力系统量测缺失数据重建方法[J].中国电机工程学报,2019,39(1):56-64. 被引量：88
9李晓茹.Kinect体感技术在多媒体互动教学中的应用[J].攀枝花学院学报,2014,31(4):110-112. 被引量：1
10于雅慧,况立群,韩燮,韩慧妍.基于Kinect相机和改进ICP的三维物体重建[J].计算机工程与设计,2014,35(10):3574-3578. 被引量：5

1刘锡琳,潘文松,张爱军.基于改进YOLOv5s的轮毂气门孔检测算法[J].电子设计工程,2024,32(19):140-144.
2盛百卉.卡夫卡文学世界“儿子们”群像塑造[J].文化创新比较研究,2024,8(3):16-20.
3薛晓强,伊春,杨小勇,王忠强,王亚龙.一种基于高效残差分解网络的车道线检测方法[J].光电子．激光,2024,35(8):817-821.
4梅晓虎,吕小强,雷萌.基于Stair−YOLOv7−tiny的煤矿井下输送带异物检测[J].工矿自动化,2024,50(8):99-104.
5肖彬,陈平华.基于改进PSPNet的手机LCD屏幕表面缺陷检测[J].计算机测量与控制,2024,32(9):36-43.
6刘超,任梦瑶,冯禄华.基于辅助信息与长短期偏好的序列推荐[J].计算机应用研究,2024,41(9):2628-2634.
7姜媛媛,刘宋波.基于改进YOLOv8n的煤矿井下钻杆计数方法[J].工矿自动化,2024,50(8):112-119.
8刘磊,葛振业,林杰,陶宇,孙俊杰.基于鱼群涌现行为启发的集群机器人硬注意力强化模型[J].计算机应用研究,2024,41(9):2737-2744.
9万婕,陈拥军,邹庆庆,黄梦洁.基于声纹检验的广西方言分析研究[J].实验与分析,2024,2(3):35-40.

自动化学报

2024年第9期

浏览历史

内容加载中请稍等...

基于光流与多尺度上下文的图像序列运动遮挡检测

参考文献6

二级参考文献40

共引文献129

相关作者

相关机构

相关主题

浏览历史