摘要
针对现有煤矸检测模型存在的特征提取不充分、参数量大、检测精度低且实时性差等问题,提出了一种基于YOLOv5s−FSW模型的选煤厂煤矸检测方法。该模型在YOLOv5s的基础上进行改进,首先将主干网络的C3模块替换为FasterNet Block结构,通过降低模型的参数量和计算量提高检测速度;然后,在颈部网络引入无参型SimAM注意力机制,增强模型对复杂环境下重要目标的关注,进一步提高模型的特征提取能力;最后,在输出端用Wise−IoU替换CIoU边界框损失函数,使模型聚焦普通质量锚框,提高收敛速度和边框的检测精度。消融实验结果表明:与YOLOv5s模型相比,YOLOv5s−FSW模型的平均精度均值(mAP)提高了1.9%,模型权重减少了0.6 MiB,参数量减少了4.7%,检测速度提高了19.3%。对比实验结果表明:YOLOv5s−FSW模型的mAP达95.8%,较YOLOv5s−CBC,YOLOv5s−ASA,YOLOv5s−SDE模型分别提高了1.1%,1.5%和1.2%,较YOLOv5m,YOLOv6s模型分别提高了0.3%,0.6%;检测速度达36.4帧/s,较YOLOv5s−CBC,YOLOv5s−ASA模型分别提高了28.2%和20.5%,较YOLOv5m,YOLOv6s,YOLOv7模型分别提高了16.3%,15.2%,45.0%。热力图可视化实验结果表明:YOLOv5s−FSW模型对煤矸目标特征区域更加敏感且关注度更高。检测实验结果表明:在环境昏暗、图像模糊、目标相互遮挡的复杂场景下,YOLOv5s−FSW模型对煤矸目标检测的置信度得分高于YOLOv5s模型,且有效避免了误检和漏检现象的发生。
A coal gangue detection method in coal preparation plant based on YOLOv5s-FSW model is proposed to address the problems of insufficient feature extraction,large parameter quantity,low detection precision,and poor real-time performance in existing coal gangue detection models.This model is improved on the basis of YOLOv5s.Firstly,the C3 module in the Backbone section is replaced with a FasterNet Block structure,which improves detection speed by reducing the number of model parameters and computation.Secondly,in the Neck section,a parameter free SimAM attention mechanism is introduced to enhance the model's attention to important targets in complex environments,further improving the model's feature extraction capability.Finally,in the Prediction layer,the CIoU bounding box loss function is replaced with Wise-IoU,and the model focuses on ordinary quality anchor boxes to improve convergence speed and bounding box detection precision.The results of the ablation experiment indicate that compared with the YOLOv5s model,The mean average precision(mAP)of the YOLOv5s-FSW model has been improved by 1.9%,the model weight has been reduced by 0.6 MiB,the number of parameters has been reduced by 4.7%,and the detection speed has been improved by 19.3%.The comparative experimental results show that the YOLOv5s-FSW model has a mAP of 95.8%,which is 1.1%,1.5%,and 1.2%higher compared to the YOLOv5s-CBC,YOLOv5s-ASA,and YOLOv5s-SDE models,respectively,and compared to YOLOv5m,YOLOv6s improved by 0.3%,0.6%respectively.The detection speed of the YOLOv5s-FSW reaches 36.4 frames per second,which is 28.2%and 20.5%higher than the YOLOv5s-CBC and YOLOv5s-ASA models,respectively.Compared to YOLOv5m,YOLOv6s and YOLOv7,the detection speed of the YOLOv5s-FSW has increased by 16.3%,15.2%,and 45.0%,respectively.The visualization experiment results of the thermal map show that the YOLOv5s-FSW model is more sensitive to the target feature areas of coal gangue and has higher attention.The detection experiment results show that in complex scenes with dim environments,blurred images,and mutual occlusion of targets,the YOLOv5s-FSW model has a higher confidence score for coal gangue target detection than the YOLOv5s model,and effectively avoids the occurrence of false positives and missed detection.
作者
燕碧娟
王凯民
郭鹏程
郑馨旭
董浩
刘勇
YAN Bijuan;WANG Kaimin;GUO Pengcheng;ZHENG Xinxu;DONG Hao;LIU Yong(College of Mechanical Engineering,Taiyuan University of Science and Technology,Taiyuan 030024,China;Shanxi Artificial Intelligence Mine Innovation Laboratory Co.,Ltd.,Taiyuan 030032,China)
出处
《工矿自动化》
CSCD
北大核心
2024年第5期36-43,66,共9页
Journal Of Mine Automation
基金
山西省重点研发计划项目(202102010101010)。