基于改进YOLOv5的眼睛及瞳孔检测算法

Eye and Pupil Detection Algorithm Based on Improved YOLOv5

下载PDF

导出

摘要针对眼睛图像易受光照干扰导致的眼睛部位和瞳孔部位检测不准确及误检漏检的问题,提出基于改进YOLOv5的眼睛及瞳孔检测算法。首先,进行图像预处理,对比了三种图像增强方法,决定运用效果较好的CLAHE(限制对比度自适应直方图均衡化)方法进行图像增强,提高对比度;其次,在YOLOv5网络中引入Swin Transformer模块代替骨干网络的最后一个C3模块和三个预测头中的三个C3模块,提高网络的特征提取能力,提升眼睛部位的检测精度;最后,在YOLOv5网络中通过引入多尺度特征跨层融合机制的方法,增加两个目标预测头,降低网络对眼睛部位和瞳孔部位的漏检率。该文从ELSE标准数据集中的Data setXVIII中选取了受光照程度不同的眼睛数据集2 400张,其中,1 600张为训练集,800张为测试集。实验结果表明,改进后的YOLOv5网络能检测出眼睛整体部位及完整的瞳孔部位,检测置信度也较高,mAP提高了3.2百分点,Recall提高了2.7百分点,且具有较好的实时性。 To address the issue of inaccurate and missed eye and pupil detection caused by the susceptibility of eye images to light interference,an improved YOLOv5 based eye and pupil detection algorithm is proposed.First of all,image pre-processing is carried out,and three image enhancement methods are compared.It is decided to use CLAHE(limited contrast Adaptive histogram equalization)method with good effect to enhance the image and improve the contrast;Secondly,the Swin Transformer module is introduced into YOLOv5 network to replace the last C3 module of the backbone network and three C3 modules in the three prediction heads,so as to improve the feature extraction ability of the network and improve the detection accuracy of eye parts;Finally,by introducing a multi-scale feature cross layer fusion mechanism in the YOLOv5 network,two target prediction heads are added to reduce the network's missed detection rate for eye and pupil regions.This article selected 2400 eye datasets with different levels of illumination from the Data setXVIII in the ELSE standard dataset,of which 1600 were training sets and 800 were testing sets.The experimental results show that the improved YOLOv5 network can detect the entire part of the eye and the complete pupil,with a high detection confidence.The mAP has increased by 3.2 percentage points,the Recall has increased by 2.7 percentage points,and has good real-time performance.

作者韩慧妍范鑫茹 HAN Hui-yan;FAN Xin-ru(School of Data Science and Technology,North University of China,Taiyuan 030051,China;Shanxi Key Laboratory of Machine Vision and Virtual Reality,Taiyuan 030051,China;Shanxi Province’s Vision Information Processing and Intelligent Robot Engineering Research Center,Taiyuan 030051,China)

机构地区中北大学计算机科学与技术学院机器视觉与虚拟现实山西省重点实验室山西省视觉信息处理及智能机器人工程研究中心

出处《计算机技术与发展》 2024年第4期76-81,共6页 Computer Technology and Development

基金国家自然科学基金(62106238) 山西省科技重大专项计划“揭榜挂帅”项目(202201150401021) 山西省自然科学基金项目(202303021211153) 山西省科技成果转化引导专项(202104021301055)。

关键词眼睛及瞳孔检测 YOLOv5 CLAHE Swin Transformer 多尺度特征跨层融合机制 eye part detection YOLOv5 CLAHE Swin Transformer Multi scale feature cross layer fusion mechanism

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1薛少雄,张海军,张煦岩.基于直方图和二次椭圆拟合的瞳孔高效检测[J].中国体视学与图像分析,2023,28(4):401-409.
2郭喜华.VR环境下SET GO教学法的构建及在耳鼻喉临床教学的应用[J].中国教育技术装备,2024(4):42-44.
3Peter B.Brown.Letter FROM YOUR Editor[J].空中英语教室（高级版．彭蒙惠英语）,2024(3):1-1.
4致读者[J].Journal of Traditional Chinese Medical Sciences,2023,10(4):513-516.
5云霄,褚菲,张晓光,程小舟.基于沙漏注意力高分辨率网络的人体姿态评估实验[J].实验室研究与探索,2024,43(1):204-208.
6郗涛,王锴,王莉静.基于优化VMD-GRU的滚动轴承剩余使用寿命预测[J].中国工程机械学报,2024,22(1):101-106.
7刘飞翔,李泽荃,赵嘉良,李靖.基于ERNIE-BiGRU-CRF模型的煤矿安全隐患命名实体智能识别研究[J].煤炭工程,2024,56(2):206-212.
8李树萍,李世民,郭焕仙,孙丽娟,董琼.光照对树番茄幼苗碳、氮、磷、钾元素积累与分配的影响[J].西南农业学报,2024,37(2):313-319.
9李紫宣,官赛萍,靳小龙,白龙,郭嘉丰,程学旗.基于多历史序列联合演化建模的两阶段时序知识图谱推理[J].中文信息学报,2024,38(2):46-53.
10王梅,张天时,王志宝,任怡果.基于空间投影和聚类划分的SVR加速算法[J].计算机技术与发展,2024,34(4):24-29.

计算机技术与发展

2024年第4期

浏览历史

内容加载中请稍等...

基于改进YOLOv5的眼睛及瞳孔检测算法

相关作者

相关机构

相关主题

浏览历史