Camouflaged people are extremely expert in actively concealing themselves by effectively utilizing cover and the surrounding environment. Despite advancements in optical detection capabilities through imaging systems,...Camouflaged people are extremely expert in actively concealing themselves by effectively utilizing cover and the surrounding environment. Despite advancements in optical detection capabilities through imaging systems, including spectral, polarization, and infrared technologies, there is still a lack of effective real-time method for accurately detecting small-size and high-efficient camouflaged people in complex real-world scenes. Here, this study proposes a snapshot multispectral image-based camouflaged detection model, multispectral YOLO(MS-YOLO), which utilizes the SPD-Conv and Sim AM modules to effectively represent targets and suppress background interference by exploiting the spatial-spectral target information. Besides, the study constructs the first real-shot multispectral camouflaged people dataset(MSCPD), which encompasses diverse scenes, target scales, and attitudes. To minimize information redundancy, MS-YOLO selects an optimal subset of 12 bands with strong feature representation and minimal inter-band correlation as input. Through experiments on the MSCPD, MS-YOLO achieves a mean Average Precision of 94.31% and real-time detection at 65 frames per second, which confirms the effectiveness and efficiency of our method in detecting camouflaged people in various typical desert and forest scenes. Our approach offers valuable support to improve the perception capabilities of unmanned aerial vehicles in detecting enemy forces and rescuing personnel in battlefield.展开更多
露天矿区场景复杂,行车障碍物检测受扬尘和颗粒物等粉尘噪声干扰严重,难以准确识别障碍物,尤其是光线较差的夜间,不利于做出正确决策,从而影响无人作业的安全性和整体效率。针对以上问题,提出了一种基于YOLOv8n模型的露天矿区行车障碍...露天矿区场景复杂,行车障碍物检测受扬尘和颗粒物等粉尘噪声干扰严重,难以准确识别障碍物,尤其是光线较差的夜间,不利于做出正确决策,从而影响无人作业的安全性和整体效率。针对以上问题,提出了一种基于YOLOv8n模型的露天矿区行车障碍物检测算法YOLOv8n-Enhanced。该算法主要从3个方面进行了改进,具体包括:首先,针对受粉尘噪声干扰严重和夜间光线不足的问题,提出了C2fCA模块结构,提高了模型特征提取能力;其次,使用轻量级卷积技术GSConv和VoV-GSCSP模块,减轻模型复杂性,实现检测器更高的计算成本效益;最后,使用WIOU损失函数,提高了模型泛化能力。试验结果表明:改进算法在保持实时性的前提下,可将YOLOv8n的平均精度(mean Average Precision,mAP)分别提高1.8%和2.6%,实现白天与夜间场景下不同尺度的障碍物识别。展开更多
为解决当前视觉SLAM(simultaneous localization and mapping,同时定位及地图构建)算法在近处纹理稀缺、动态物体遮挡等复杂交通环境下出现的定位失效的问题,提出一种基于关键目标的视觉SLAM算法。首先,以典型交通场景环境感知算法所检...为解决当前视觉SLAM(simultaneous localization and mapping,同时定位及地图构建)算法在近处纹理稀缺、动态物体遮挡等复杂交通环境下出现的定位失效的问题,提出一种基于关键目标的视觉SLAM算法。首先,以典型交通场景环境感知算法所检测的交通信号、标志等静止目标为基础,在静止目标中进行特征提取并筛选关键目标。其次,通过关键目标的类别和几何参数完成相连帧之间关键目标的匹配。然后,基于关键目标进行SLAM系统的初始化和跟踪,并通过最小化重投影误差求解当前相机位姿。最后,在局部建图线程中对相机位姿和关键目标三维坐标联合优化,并在局部地图中更新。经实验验证,所提算法能有效解决近处纹理缺失环境下的定位失效问题,保持了较高的定位精度,具有良好的环境适应性。展开更多
In order to rapidly and accurately detect infrared small and dim targets in the infrared image of complex scene collected by virtual prototyping of space-based downward-looking multiband detection,an improved detectio...In order to rapidly and accurately detect infrared small and dim targets in the infrared image of complex scene collected by virtual prototyping of space-based downward-looking multiband detection,an improved detection algorithm of infrared small and dim target is proposed in this paper.Firstly,the original infrared images are changed into a new infrared patch tensor mode through data reconstruction.Then,the infrared small and dim target detection problems are converted to low-rank tensor recovery problems based on tensor nuclear norm in accordance with patch tensor characteristics,and inverse variance weighted entropy is defined for self-adaptive adjustment of sparseness.Finally,the low-rank tensor recovery problem with noise is solved by alternating the direction method to obtain the sparse target image,and the final small target is worked out by a simple partitioning algorithm.The test results in various spacebased downward-looking complex scenes show that such method can restrain complex background well by virtue of rapid arithmetic speed with high detection probability and low false alarm rate.It is a kind of infrared small and dim target detection method with good performance.展开更多
Aiming at the problem that a single correlation filter model is sensitive to complex scenes such as background interference and occlusion,a tracking algorithm based on multi-time-space perception and instance-specific...Aiming at the problem that a single correlation filter model is sensitive to complex scenes such as background interference and occlusion,a tracking algorithm based on multi-time-space perception and instance-specific proposals is proposed to optimize the mathematical model of the correlation filter(CF).Firstly,according to the consistency of the changes between the object frames and the filter frames,the mask matrix is introduced into the objective function of the filter,so as to extract the spatio-temporal information of the object with background awareness.Secondly,the object function of multi-feature fusion is constructed for the object location,which is optimized by the Lagrange method and solved by closed iteration.In the process of filter optimization,the constraints term of time-space perception is designed to enhance the learning ability of the CF to optimize the final track-ing results.Finally,when the tracking results fluctuate,the boundary suppres-sion factor is introduced into the instance-specific proposals to reduce the risk of model drift effectively.The accuracy and success rate of the proposed algorithm are verified by simulation analysis on two popular benchmarks,the object tracking benchmark 2015(OTB2015)and the temple color 128(TC-128).Extensive experimental results illustrate that the optimized appearance model of the proposed algorithm is effective.The distance precision rate and overlap success rate of the proposed algorithm are 0.756 and 0.656 on the OTB2015 benchmark,which are better than the results of other competing algorithms.The results of this study can solve the problem of real-time object tracking in the real traffic environment and provide a specific reference for the detection of traffic abnormalities.展开更多
Due to the existing limited dynamic range a camera cannot reveal all the details in a high-dynamic range scene. In order to solve this problem,this paper presents a multi-exposure fusion method for getting high qualit...Due to the existing limited dynamic range a camera cannot reveal all the details in a high-dynamic range scene. In order to solve this problem,this paper presents a multi-exposure fusion method for getting high quality images in high dynamic range scene. First,a set of multi-exposure images is obtained by multiple exposures in a same scene and their brightness condition is analyzed. Then,multi-exposure images under the same scene are decomposed using dual-tree complex wavelet transform( DT-CWT),and their low and high frequency components are obtained. Weight maps according to the brightness condition are assigned to the low components for fusion. Maximizing the region Sum Modified-Laplacian( SML) is adopted for high-frequency components fusing. Finally,the fused image is acquired by subjecting the low and high frequency coefficients to inverse DT-CWT.Experimental results show that the proposed approach generates high quality results with uniform distributed brightness and rich details. The proposed method is efficient and robust in varies scenes.展开更多
基于深度学习的目标检测方法是目前计算机视觉领域的热点,在目标识别、跟踪等领域发挥了重要的作用.随着研究的深入开展,基于深度学习的目标检测方法主要分为有锚框的目标检测方法和无锚框的目标检测方法,其中无锚框的目标检测方法无需...基于深度学习的目标检测方法是目前计算机视觉领域的热点,在目标识别、跟踪等领域发挥了重要的作用.随着研究的深入开展,基于深度学习的目标检测方法主要分为有锚框的目标检测方法和无锚框的目标检测方法,其中无锚框的目标检测方法无需预定义大量锚框,具有更低的模型复杂度和更稳定的检测性能,是目前目标检测领域中较前沿的方法.在调研国内外相关文献的基础上,梳理基于无锚框的目标检测方法及各场景下的常用数据集,根据样本分配方式不同,分别从基于关键点组合、中心点回归、Transformer、锚框和无锚框融合等4个方面进行整体结构分析和总结,并结合COCO(Common objects in context)数据集上的性能指标进一步对比.在此基础上,介绍了无锚框目标检测方法在重叠目标、小目标和旋转目标等复杂场景情况下的应用,聚焦目标遮挡、尺寸过小和角度多等关键问题,综述现有方法的优缺点及难点.最后对无锚框目标检测方法中仍存在的问题进行总结并对未来发展的应用趋势进行展望.展开更多
基金support by the National Natural Science Foundation of China (Grant No. 62005049)Natural Science Foundation of Fujian Province (Grant Nos. 2020J01451, 2022J05113)Education and Scientific Research Program for Young and Middleaged Teachers in Fujian Province (Grant No. JAT210035)。
文摘Camouflaged people are extremely expert in actively concealing themselves by effectively utilizing cover and the surrounding environment. Despite advancements in optical detection capabilities through imaging systems, including spectral, polarization, and infrared technologies, there is still a lack of effective real-time method for accurately detecting small-size and high-efficient camouflaged people in complex real-world scenes. Here, this study proposes a snapshot multispectral image-based camouflaged detection model, multispectral YOLO(MS-YOLO), which utilizes the SPD-Conv and Sim AM modules to effectively represent targets and suppress background interference by exploiting the spatial-spectral target information. Besides, the study constructs the first real-shot multispectral camouflaged people dataset(MSCPD), which encompasses diverse scenes, target scales, and attitudes. To minimize information redundancy, MS-YOLO selects an optimal subset of 12 bands with strong feature representation and minimal inter-band correlation as input. Through experiments on the MSCPD, MS-YOLO achieves a mean Average Precision of 94.31% and real-time detection at 65 frames per second, which confirms the effectiveness and efficiency of our method in detecting camouflaged people in various typical desert and forest scenes. Our approach offers valuable support to improve the perception capabilities of unmanned aerial vehicles in detecting enemy forces and rescuing personnel in battlefield.
文摘露天矿区场景复杂,行车障碍物检测受扬尘和颗粒物等粉尘噪声干扰严重,难以准确识别障碍物,尤其是光线较差的夜间,不利于做出正确决策,从而影响无人作业的安全性和整体效率。针对以上问题,提出了一种基于YOLOv8n模型的露天矿区行车障碍物检测算法YOLOv8n-Enhanced。该算法主要从3个方面进行了改进,具体包括:首先,针对受粉尘噪声干扰严重和夜间光线不足的问题,提出了C2fCA模块结构,提高了模型特征提取能力;其次,使用轻量级卷积技术GSConv和VoV-GSCSP模块,减轻模型复杂性,实现检测器更高的计算成本效益;最后,使用WIOU损失函数,提高了模型泛化能力。试验结果表明:改进算法在保持实时性的前提下,可将YOLOv8n的平均精度(mean Average Precision,mAP)分别提高1.8%和2.6%,实现白天与夜间场景下不同尺度的障碍物识别。
文摘为解决当前视觉SLAM(simultaneous localization and mapping,同时定位及地图构建)算法在近处纹理稀缺、动态物体遮挡等复杂交通环境下出现的定位失效的问题,提出一种基于关键目标的视觉SLAM算法。首先,以典型交通场景环境感知算法所检测的交通信号、标志等静止目标为基础,在静止目标中进行特征提取并筛选关键目标。其次,通过关键目标的类别和几何参数完成相连帧之间关键目标的匹配。然后,基于关键目标进行SLAM系统的初始化和跟踪,并通过最小化重投影误差求解当前相机位姿。最后,在局部建图线程中对相机位姿和关键目标三维坐标联合优化,并在局部地图中更新。经实验验证,所提算法能有效解决近处纹理缺失环境下的定位失效问题,保持了较高的定位精度,具有良好的环境适应性。
文摘In order to rapidly and accurately detect infrared small and dim targets in the infrared image of complex scene collected by virtual prototyping of space-based downward-looking multiband detection,an improved detection algorithm of infrared small and dim target is proposed in this paper.Firstly,the original infrared images are changed into a new infrared patch tensor mode through data reconstruction.Then,the infrared small and dim target detection problems are converted to low-rank tensor recovery problems based on tensor nuclear norm in accordance with patch tensor characteristics,and inverse variance weighted entropy is defined for self-adaptive adjustment of sparseness.Finally,the low-rank tensor recovery problem with noise is solved by alternating the direction method to obtain the sparse target image,and the final small target is worked out by a simple partitioning algorithm.The test results in various spacebased downward-looking complex scenes show that such method can restrain complex background well by virtue of rapid arithmetic speed with high detection probability and low false alarm rate.It is a kind of infrared small and dim target detection method with good performance.
基金funded by the Basic Science Major Foundation(Natural Science)of the Jiangsu Higher Education Institutions of China(Grant:22KJA520012)the Xuzhou Science and Technology Plan Project(Grant:KC21303,KC22305)the sixth“333 project”of Jiangsu Province.
文摘Aiming at the problem that a single correlation filter model is sensitive to complex scenes such as background interference and occlusion,a tracking algorithm based on multi-time-space perception and instance-specific proposals is proposed to optimize the mathematical model of the correlation filter(CF).Firstly,according to the consistency of the changes between the object frames and the filter frames,the mask matrix is introduced into the objective function of the filter,so as to extract the spatio-temporal information of the object with background awareness.Secondly,the object function of multi-feature fusion is constructed for the object location,which is optimized by the Lagrange method and solved by closed iteration.In the process of filter optimization,the constraints term of time-space perception is designed to enhance the learning ability of the CF to optimize the final track-ing results.Finally,when the tracking results fluctuate,the boundary suppres-sion factor is introduced into the instance-specific proposals to reduce the risk of model drift effectively.The accuracy and success rate of the proposed algorithm are verified by simulation analysis on two popular benchmarks,the object tracking benchmark 2015(OTB2015)and the temple color 128(TC-128).Extensive experimental results illustrate that the optimized appearance model of the proposed algorithm is effective.The distance precision rate and overlap success rate of the proposed algorithm are 0.756 and 0.656 on the OTB2015 benchmark,which are better than the results of other competing algorithms.The results of this study can solve the problem of real-time object tracking in the real traffic environment and provide a specific reference for the detection of traffic abnormalities.
基金Supported by the National Natural Science Foundation of China(No.61308099,61304032)
文摘Due to the existing limited dynamic range a camera cannot reveal all the details in a high-dynamic range scene. In order to solve this problem,this paper presents a multi-exposure fusion method for getting high quality images in high dynamic range scene. First,a set of multi-exposure images is obtained by multiple exposures in a same scene and their brightness condition is analyzed. Then,multi-exposure images under the same scene are decomposed using dual-tree complex wavelet transform( DT-CWT),and their low and high frequency components are obtained. Weight maps according to the brightness condition are assigned to the low components for fusion. Maximizing the region Sum Modified-Laplacian( SML) is adopted for high-frequency components fusing. Finally,the fused image is acquired by subjecting the low and high frequency coefficients to inverse DT-CWT.Experimental results show that the proposed approach generates high quality results with uniform distributed brightness and rich details. The proposed method is efficient and robust in varies scenes.
文摘基于深度学习的目标检测方法是目前计算机视觉领域的热点,在目标识别、跟踪等领域发挥了重要的作用.随着研究的深入开展,基于深度学习的目标检测方法主要分为有锚框的目标检测方法和无锚框的目标检测方法,其中无锚框的目标检测方法无需预定义大量锚框,具有更低的模型复杂度和更稳定的检测性能,是目前目标检测领域中较前沿的方法.在调研国内外相关文献的基础上,梳理基于无锚框的目标检测方法及各场景下的常用数据集,根据样本分配方式不同,分别从基于关键点组合、中心点回归、Transformer、锚框和无锚框融合等4个方面进行整体结构分析和总结,并结合COCO(Common objects in context)数据集上的性能指标进一步对比.在此基础上,介绍了无锚框目标检测方法在重叠目标、小目标和旋转目标等复杂场景情况下的应用,聚焦目标遮挡、尺寸过小和角度多等关键问题,综述现有方法的优缺点及难点.最后对无锚框目标检测方法中仍存在的问题进行总结并对未来发展的应用趋势进行展望.