期刊文献+
共找到4,221篇文章
< 1 2 212 >
每页显示 20 50 100
基于Yolopose的挖掘机检测与工作状态识别
1
作者 黄健 赵小飞 +1 位作者 王虎 胡其胜 《计算机系统应用》 2024年第2期299-307,共9页
针对光缆、高压油气管道等地下基础设施周边容易受到挖掘机的野蛮入侵问题.本文提出了一种结合Yolopose和多层感知机的挖掘机检测与工作状态判别方法.首先,设计了基于Yolopose的挖掘机6点姿势的提取网络Yolopose-ex;其次,利用Yolopose-e... 针对光缆、高压油气管道等地下基础设施周边容易受到挖掘机的野蛮入侵问题.本文提出了一种结合Yolopose和多层感知机的挖掘机检测与工作状态判别方法.首先,设计了基于Yolopose的挖掘机6点姿势的提取网络Yolopose-ex;其次,利用Yolopose-ex模型提取视频中挖掘机工作姿态的变化信息,构建了挖掘机的工作状态特征向量(MSV);最后,利用深度学习算法多层感知机(multilayer perceptron,MLP)分析了视频中的挖掘机的工作状态.实验结果表明,所提出的方法克服了复杂背景难以识别的问题,对挖掘机工作状态识别准确率达到了96.6%,具有较高的推理速度和泛化能力. 展开更多
关键词 挖掘机 Yolopose 姿势估计 工作状态识别
下载PDF
Movement Function Assessment Based on Human Pose Estimation from Multi-View
2
作者 Lingling Chen Tong Liu +1 位作者 Zhuo Gong Ding Wang 《Computer Systems Science & Engineering》 2024年第2期321-339,共19页
Human pose estimation is a basic and critical task in the field of computer vision that involves determining the position(or spatial coordinates)of the joints of the human body in a given image or video.It is widely u... Human pose estimation is a basic and critical task in the field of computer vision that involves determining the position(or spatial coordinates)of the joints of the human body in a given image or video.It is widely used in motion analysis,medical evaluation,and behavior monitoring.In this paper,the authors propose a method for multi-view human pose estimation.Two image sensors were placed orthogonally with respect to each other to capture the pose of the subject as they moved,and this yielded accurate and comprehensive results of three-dimensional(3D)motion reconstruction that helped capture their multi-directional poses.Following this,we propose a method based on 3D pose estimation to assess the similarity of the features of motion of patients with motor dysfunction by comparing differences between their range of motion and that of normal subjects.We converted these differences into Fugl–Meyer assessment(FMA)scores in order to quantify them.Finally,we implemented the proposed method in the Unity framework,and built a Virtual Reality platform that provides users with human–computer interaction to make the task more enjoyable for them and ensure their active participation in the assessment process.The goal is to provide a suitable means of assessing movement disorders without requiring the immediate supervision of a physician. 展开更多
关键词 Human pose estimation 3D pose reconstruction assessment of movement function plane of features of human motion
下载PDF
基于YOLO-Pose的城市街景小目标行人姿态估计算法
3
作者 马明旭 马宏 宋华伟 《计算机工程》 CAS CSCD 北大核心 2024年第4期177-186,共10页
现有的姿态估计算法在城市街景中对小目标行人的检测效果不佳。针对该问题,提出一种基于YOLO-Pose的小目标行人姿态估计算法YOLO-Pose-CBAM。通过引入CBAM注意力机制模块,在不增加过多计算量的前提下,增强网络聚焦小目标行人区域的能力... 现有的姿态估计算法在城市街景中对小目标行人的检测效果不佳。针对该问题,提出一种基于YOLO-Pose的小目标行人姿态估计算法YOLO-Pose-CBAM。通过引入CBAM注意力机制模块,在不增加过多计算量的前提下,增强网络聚焦小目标行人区域的能力,提升算法对小目标行人的敏感度,同时在主干网络中使用4个不同尺寸的检测头,丰富算法对图片中不同大小行人的检测手段;在骨干网络和颈部之间架设2条跨层级联通道,提升浅层网络与深层网络之间的特征融合能力,进一步增强信息交流,降低小目标行人漏检率;引入SIoU重新定义边界框回归的定位损失函数,加快训练的收敛速度,提高检测精度;采用k-means++算法代替k-means算法对数据集中标注的锚框进行聚类,避免聚类中心初始化时导致的局部最优解问题,从而选择出更适合检测小目标行人的锚框。对比实验结果表明,在小目标行人Wider Keypoints数据集上,所提算法相较于YOLO-Pose和YOLOv7-Pose在平均精度上分别提升了4.6和6.5个百分比。 展开更多
关键词 YOLO-pose算法 姿态估计 跨层级联 CBAM注意力机制 SIo U损失函数 k-means++算法
下载PDF
基于MediaPipe Pose的人体动作识别方法研究
4
作者 张恒博 刘大铭 +1 位作者 伏娜娜 邢霄海 《宁夏工程技术》 CAS 2024年第1期79-84,91,共7页
针对已有人体动作识别方法存在识别效率低、检测速度慢等问题,提出了基于MediaPipe Pose算法的人体动作识别方法。具体内容:将摄像头实时采集数据输入到检测网络以获取人体33个关键点的坐标信息,然后通过关键点的空间位置组合来确定人... 针对已有人体动作识别方法存在识别效率低、检测速度慢等问题,提出了基于MediaPipe Pose算法的人体动作识别方法。具体内容:将摄像头实时采集数据输入到检测网络以获取人体33个关键点的坐标信息,然后通过关键点的空间位置组合来确定人体动作类别;采用COCO数据集格式标定动作类别,并且对动作标签进行onehot编码,训练人体动作识别模型;利用单目RGB摄像头对8类动作进行实验验证。结果表明,基于MediaPipe Pose算法的人体动作识别方法其帧率达到30帧/s,识别精确率为96.67%,能够实时、准确地识别人体动作。 展开更多
关键词 MediaPipe pose 人体动作识别 深度学习
下载PDF
Lightweight Multi-Resolution Network for Human Pose Estimation
5
作者 Pengxin Li Rong Wang +2 位作者 Wenjing Zhang Yinuo Liu Chenyue Xu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第3期2239-2255,共17页
Human pose estimation aims to localize the body joints from image or video data.With the development of deeplearning,pose estimation has become a hot research topic in the field of computer vision.In recent years,huma... Human pose estimation aims to localize the body joints from image or video data.With the development of deeplearning,pose estimation has become a hot research topic in the field of computer vision.In recent years,humanpose estimation has achieved great success in multiple fields such as animation and sports.However,to obtainaccurate positioning results,existing methods may suffer from large model sizes,a high number of parameters,and increased complexity,leading to high computing costs.In this paper,we propose a new lightweight featureencoder to construct a high-resolution network that reduces the number of parameters and lowers the computingcost.We also introduced a semantic enhancement module that improves global feature extraction and networkperformance by combining channel and spatial dimensions.Furthermore,we propose a dense connected spatialpyramid pooling module to compensate for the decrease in image resolution and information loss in the network.Finally,ourmethod effectively reduces the number of parameters and complexitywhile ensuring high performance.Extensive experiments show that our method achieves a competitive performance while dramatically reducing thenumber of parameters,and operational complexity.Specifically,our method can obtain 89.9%AP score on MPIIVAL,while the number of parameters and the complexity of operations were reduced by 41%and 36%,respectively. 展开更多
关键词 LIGHTWEIGHT human pose estimation keypoint detection high resolution network
下载PDF
基于改进YOLOv5s-pose的多人人体姿态估计
6
作者 蒋锦华 庄丽萍 +2 位作者 陈锦 姚洪泽 蔡志明 《软件工程》 2024年第1期74-78,共5页
为了提高多人人体姿态检测的准确率,本研究采用YOLOv5s模型用于多人人体姿态检测并对模型进行改进。首先,引入坐标注意力(Coordinate Attention)模块改进骨干网络,将注意力资源分配给关键区域,降低复杂环境中的背景干扰,增强模型对多人... 为了提高多人人体姿态检测的准确率,本研究采用YOLOv5s模型用于多人人体姿态检测并对模型进行改进。首先,引入坐标注意力(Coordinate Attention)模块改进骨干网络,将注意力资源分配给关键区域,降低复杂环境中的背景干扰,增强模型对多人目标的精准定位能力。其次,使用双向特征金字塔网络改进YOLOv5s的特征融合网络,增强网络的信息表达能力。实验结果表明:在多人人体姿态MS COCO2017验证集上,经改进的YOLOv5s算法的检测平均精度高达61.9%,相比原始YOLOv5s网络,平均精度提升了1.5%。由此可见,改进后的网络能更加精准、有效地检测多人人体姿态。 展开更多
关键词 多人人体姿态检测 YOLOv5s 双向特征金字塔网络 检测精度
下载PDF
基于改进YOLO v8-Pose的红熟期草莓识别和果柄检测 被引量:1
7
作者 刘莫尘 褚镇源 +3 位作者 崔明诗 杨庆璐 王金星 杨化伟 《农业机械学报》 EI CAS CSCD 北大核心 2023年第S02期244-251,共8页
针对高架栽培模式下的大棚草莓,借鉴人体姿态检测算法,建立了改进YOLO v8-Pose模型对红熟期草莓进行识别与果柄关键点检测。通过对比YOLO v5-Pose、YOLO v7-Pose、YOLO v8-Pose模型,确定使用YOLO v8-Pose模型作为对红熟期草莓识别与关... 针对高架栽培模式下的大棚草莓,借鉴人体姿态检测算法,建立了改进YOLO v8-Pose模型对红熟期草莓进行识别与果柄关键点检测。通过对比YOLO v5-Pose、YOLO v7-Pose、YOLO v8-Pose模型,确定使用YOLO v8-Pose模型作为对红熟期草莓识别与关键点预测的模型。以YOLO v8-Pose为基础,对其网络结构添加Slim-neck模块与CBAM注意力机制模块,提高模型对小目标物体的特征提取能力,以适应草莓数据集的特点。改进YOLO v8-Pose能够有效检测红熟期草莓并准确标记出果柄关键点,P、R、mAP-kp分别为98.14%、94.54%、97.91%,比YOLO v8-Pose分别提高5.41、5.31、8.29个百分点。模型内存占用量为22 MB,比YOLO v8-Pose的占用量小6 MB。此外,针对果园非结构化的特征,探究了光线、遮挡与拍摄角度对模型预测的影响。对比改进前后的模型在复杂环境下对红熟期草莓的识别与果柄预测情况,改进YOLO v8-Pose在受遮挡、光线和角度影响情况下的mAPkp分别为94.52%、95.48%、94.63%,较YOLO v8-Pose分别提高8.9、10.75、5.17个百分点。改进YOLO v8-Pose可在保证网络模型精度的同时对遮挡、光线和拍摄角度等影响均具有较好的鲁棒性,能够实现对复杂环境下红熟期草莓识别与果柄关键点预测。 展开更多
关键词 红熟期草莓识别 关键点预测 YOLO v8-pose 注意力机制
下载PDF
基于改进YOLO-Pose的复杂环境下拖拉机驾驶员关键点检测
8
作者 徐红梅 杨浩 +3 位作者 李亚林 张文杰 赵亚兵 吴擎 《农业工程学报》 EI CAS CSCD 北大核心 2023年第16期139-149,共11页
为解决农田复杂作业环境下拖拉机驾驶员因光照、背景及遮挡造成的关键点漏检、误检等难识别问题,该研究提出了一种基于改进YOLO-Pose的复杂环境下驾驶员关键点检测方法。首先,在主干网络的顶层C3模块中嵌入Swin Transformer编码器,提高... 为解决农田复杂作业环境下拖拉机驾驶员因光照、背景及遮挡造成的关键点漏检、误检等难识别问题,该研究提出了一种基于改进YOLO-Pose的复杂环境下驾驶员关键点检测方法。首先,在主干网络的顶层C3模块中嵌入Swin Transformer编码器,提高遮挡状况下关键点的检测效率。其次,采用高效层聚合网络RepGFPN作为颈部网络,通过融合高层语义信息和低层空间信息,增强多尺度检测能力,同时在颈部网络采用金字塔卷积替换标准3×3卷积,在减少模型参数量的同时有效地捕获不同层级的特征信息。最后,嵌入坐标注意力机制优化关键点解耦头,增强预测过程对关键点空间位置的敏感程度。试验结果表明,改进后算法mAP0.5(目标关键点相似度Loks阈值取0.5时平均精度均值)为89.59%,mAP0.5:0.95(目标关键点相似度Loks阈值取0.5,0.55,…,0.95时的平均精度均值)为62.58%,相比于基线模型分别提高了4.24和4.15个百分点,单张图像平均检测时间为21.9 ms,与当前主流关键点检测网络Hourglass、HRNet-W32及DEKR相比,mAP0.5分别提升了7.94、5.27、2.66个百分点,模型大小分别减少了257.5、8.2、9.3 M。改进后的关键点检测算法具有较高的检测精度和推理速度,可为拖拉机驾驶员的异常行为识别和状态监测提供技术支持。 展开更多
关键词 拖拉机 深度学习 检测 驾驶员 YOLO-pose 关键点
下载PDF
基于YOLOPose的人体姿态估计轻量级网络
9
作者 王红霞 李枝峻 顾鹏 《沈阳理工大学学报》 CAS 2023年第6期10-16,共7页
为解决人体姿态估计模型在提升预测精度时参数量和计算量增多导致模型运行效率低下的问题,在YOLOPose模型基础上设计出一种轻量级人体姿态估计网络MWE-YOLOPose。选择轻量级MobileNetV3网络重新构建骨干网络,保持特征丰富性同时加快特... 为解决人体姿态估计模型在提升预测精度时参数量和计算量增多导致模型运行效率低下的问题,在YOLOPose模型基础上设计出一种轻量级人体姿态估计网络MWE-YOLOPose。选择轻量级MobileNetV3网络重新构建骨干网络,保持特征丰富性同时加快特征提取速度;调整特征融合层通道数并添加ECA注意力机制进行跨通道交互,实现模型轻量化与准确度的平衡;引用WIOUV2损失函数降低几何因素的惩罚,增强模型的鲁棒性和泛化能力。实验结果显示,在OC_Human数据集上,改进后模型对比原始YOLOPose模型,在保持一定准确度的情况下,模型参数量和计算量分别降低86.8%和71.2%,有效降低了模型运算复杂度。 展开更多
关键词 人体姿态估计 YOLOpose MobileNetV3 WIOUV2 ECA注意力机制
下载PDF
Overview of 3D Human Pose Estimation 被引量:1
10
作者 Jianchu Lin Shuang Li +5 位作者 Hong Qin Hongchang Wang Ning Cui Qian Jiang Haifang Jian Gongming Wang 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第3期1621-1651,共31页
3D human pose estimation is a major focus area in the field of computer vision,which plays an important role in practical applications.This article summarizes the framework and research progress related to the estimat... 3D human pose estimation is a major focus area in the field of computer vision,which plays an important role in practical applications.This article summarizes the framework and research progress related to the estimation of monocular RGB images and videos.An overall perspective ofmethods integrated with deep learning is introduced.Novel image-based and video-based inputs are proposed as the analysis framework.From this viewpoint,common problems are discussed.The diversity of human postures usually leads to problems such as occlusion and ambiguity,and the lack of training datasets often results in poor generalization ability of the model.Regression methods are crucial for solving such problems.Considering image-based input,the multi-view method is commonly used to solve occlusion problems.Here,the multi-view method is analyzed comprehensively.By referring to video-based input,the human prior knowledge of restricted motion is used to predict human postures.In addition,structural constraints are widely used as prior knowledge.Furthermore,weakly supervised learningmethods are studied and discussed for these two types of inputs to improve the model generalization ability.The problem of insufficient training datasets must also be considered,especially because 3D datasets are usually biased and limited.Finally,emerging and popular datasets and evaluation indicators are discussed.The characteristics of the datasets and the relationships of the indicators are explained and highlighted.Thus,this article can be useful and instructive for researchers who are lacking in experience and find this field confusing.In addition,by providing an overview of 3D human pose estimation,this article sorts and refines recent studies on 3D human pose estimation.It describes kernel problems and common useful methods,and discusses the scope for further research. 展开更多
关键词 3D human pose estimation monocular camera deep learning MULTI-VIEW INDICATOR
下载PDF
Squirrel Search Optimization with Deep Convolutional Neural Network for Human Pose Estimation 被引量:1
11
作者 K.Ishwarya A.Alice Nithya 《Computers, Materials & Continua》 SCIE EI 2023年第3期6081-6099,共19页
Human pose estimation(HPE)is a procedure for determining the structure of the body pose and it is considered a challenging issue in the computer vision(CV)communities.HPE finds its applications in several fields namel... Human pose estimation(HPE)is a procedure for determining the structure of the body pose and it is considered a challenging issue in the computer vision(CV)communities.HPE finds its applications in several fields namely activity recognition and human-computer interface.Despite the benefits of HPE,it is still a challenging process due to the variations in visual appearances,lighting,occlusions,dimensionality,etc.To resolve these issues,this paper presents a squirrel search optimization with a deep convolutional neural network for HPE(SSDCNN-HPE)technique.The major intention of the SSDCNN-HPE technique is to identify the human pose accurately and efficiently.Primarily,the video frame conversion process is performed and pre-processing takes place via bilateral filtering-based noise removal process.Then,the EfficientNet model is applied to identify the body points of a person with no problem constraints.Besides,the hyperparameter tuning of the EfficientNet model takes place by the use of the squirrel search algorithm(SSA).In the final stage,the multiclass support vector machine(M-SVM)technique was utilized for the identification and classification of human poses.The design of bilateral filtering followed by SSA based EfficientNetmodel for HPE depicts the novelty of the work.To demonstrate the enhanced outcomes of the SSDCNN-HPE approach,a series of simulations are executed.The experimental results reported the betterment of the SSDCNN-HPE system over the recent existing techniques in terms of different measures. 展开更多
关键词 Parameter tuning human pose estimation deep learning squirrel search algorithm activity recognition
下载PDF
Optimal Deep Convolutional Neural Network with Pose Estimation for Human Activity Recognition 被引量:1
12
作者 S.Nandagopal G.Karthy +1 位作者 A.Sheryl Oliver M.Subha 《Computer Systems Science & Engineering》 SCIE EI 2023年第2期1719-1733,共15页
Human Action Recognition(HAR)and pose estimation from videos have gained significant attention among research communities due to its applica-tion in several areas namely intelligent surveillance,human robot interaction... Human Action Recognition(HAR)and pose estimation from videos have gained significant attention among research communities due to its applica-tion in several areas namely intelligent surveillance,human robot interaction,robot vision,etc.Though considerable improvements have been made in recent days,design of an effective and accurate action recognition model is yet a difficult process owing to the existence of different obstacles such as variations in camera angle,occlusion,background,movement speed,and so on.From the literature,it is observed that hard to deal with the temporal dimension in the action recognition process.Convolutional neural network(CNN)models could be used widely to solve this.With this motivation,this study designs a novel key point extraction with deep convolutional neural networks based pose estimation(KPE-DCNN)model for activity recognition.The KPE-DCNN technique initially converts the input video into a sequence of frames followed by a three stage process namely key point extraction,hyperparameter tuning,and pose estimation.In the keypoint extraction process an OpenPose model is designed to compute the accurate key-points in the human pose.Then,an optimal DCNN model is developed to classify the human activities label based on the extracted key points.For improving the training process of the DCNN technique,RMSProp optimizer is used to optimally adjust the hyperparameters such as learning rate,batch size,and epoch count.The experimental results tested using benchmark dataset like UCF sports dataset showed that KPE-DCNN technique is able to achieve good results compared with benchmark algorithms like CNN,DBN,SVM,STAL,T-CNN and so on. 展开更多
关键词 Human activity recognition pose estimation key point extraction classification deep learning RMSProp
下载PDF
基于AlphaPose的行人重识别姿态评价方法 被引量:1
13
作者 刘立名 马传香 《湖北大学学报(自然科学版)》 CAS 2023年第5期702-711,共10页
行人重识别旨在不同时间、不同摄像头拍摄范围中检索特定目标行人,在实际应用场景中,可能会存在行人被严重遮挡的图像,不仅不利于行人检测,还会消耗大量的时间.行人姿态检测可以通过定位行人关键点位置判断行人是否存在遮挡,因此,本研... 行人重识别旨在不同时间、不同摄像头拍摄范围中检索特定目标行人,在实际应用场景中,可能会存在行人被严重遮挡的图像,不仅不利于行人检测,还会消耗大量的时间.行人姿态检测可以通过定位行人关键点位置判断行人是否存在遮挡,因此,本研究提出在重识别检测之前,对行人姿态进行分析,提出一种基于AlphaPose的重识别行人姿态评价方法.首先,利用AlphaPose进行姿态检测,得到行人各个关键点的置信度;然后,利用各个关键点的置信度得到各个行人的姿态评分;最后,根据姿态评分结果筛选出多个测试集进行验证分析.利用torchreid框架在数据集DukeMTMC-reID及Market1501进行实验,实验结果表明,与初始测试集相比,筛选后的测试集检测效率明显提高,且mAP和rank-n值也有所提高. 展开更多
关键词 姿态检测 行人重识别 Alphapose检测 姿态评分 torchreid
下载PDF
基于改进OpenPose算法的矿工危险行为识别研究 被引量:1
14
作者 刘斌 贾浩强 +3 位作者 杨一 申佳 盖美辰 宋天霖 《电视技术》 2023年第2期20-23,共4页
针对现有危险行为检测中原有的人体姿态行为识别算法OpenPose存在参数量大、算力要求高的缺点,在保证准确度的情况下,提出采用MobileNet v3网络的1—18层代替OpenPose原有的VGG19网络前10层,同时采用ST-GCN时空图卷积网络完成动作的分类... 针对现有危险行为检测中原有的人体姿态行为识别算法OpenPose存在参数量大、算力要求高的缺点,在保证准确度的情况下,提出采用MobileNet v3网络的1—18层代替OpenPose原有的VGG19网络前10层,同时采用ST-GCN时空图卷积网络完成动作的分类,实现对危险行为的识别。实验结果表明,本次改进的算法对于摔倒、危险攀爬等行为的识别准确率达到94%以上,较原有的模型准确率提升了5%以上。同时,经过对模型的轻量化改变,使得模型体积缩小、参数较少,提高了模型的运算速度。 展开更多
关键词 Openpose MobileNet v3 姿态检测 ST-GCN
下载PDF
基于AlphaPose姿态识别模型的武术评价研究与实验
15
作者 王帅 姚翠莉 +2 位作者 杨雨龙 刘昊 魏维 《现代计算机》 2023年第15期50-54,共5页
以传统武术项目为切入点设计实验,利用深度学习模型智能完成武术评价矫正任务,解决使用者时间个性化和评价稳定性的需求。针对相关难点和痛点,设计了准确有效的动作评估方法、迁移向量自适应被测试目标身体比例的匹配机制;对于具有预存... 以传统武术项目为切入点设计实验,利用深度学习模型智能完成武术评价矫正任务,解决使用者时间个性化和评价稳定性的需求。针对相关难点和痛点,设计了准确有效的动作评估方法、迁移向量自适应被测试目标身体比例的匹配机制;对于具有预存模型的单人评价与矫正提示任务,解决了基准动作向量与被测试目标动作向量之间的时间序列和空间序列匹配问题,测试实验结果表明了动作评价矫正方法的有效性。 展开更多
关键词 Alphapose模型 姿态估计 武术评价 分数拟合
下载PDF
Spacecraft Pose Estimation Based on Different Camera Models
16
作者 Lidong Mo Naiming Qi Zhenqing Zhao 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2023年第3期262-268,共7页
Spacecraft pose estimation is an important technology to maintain or change the spacecraft orientation in space.For spacecraft pose estimation,when two spacecraft are relatively distant,the depth information of the sp... Spacecraft pose estimation is an important technology to maintain or change the spacecraft orientation in space.For spacecraft pose estimation,when two spacecraft are relatively distant,the depth information of the space point is less than that of the measuring distance,so the camera model can be seen as a weak perspective projection model.In this paper,a spacecraft pose estimation algorithm based on four symmetrical points of the spacecraft outline is proposed.The analytical solution of the spacecraft pose is obtained by solving the weak perspective projection model,which can satisfy the requirements of the measurement model when the measurement distance is long.The optimal solution is obtained from the weak perspective projection model to the perspective projection model,which can meet the measurement requirements when the measuring distance is small.The simulation results show that the proposed algorithm can obtain better results,even though the noise is large. 展开更多
关键词 Spacecraft pose estimation Weak perspective projection Optimal solution
下载PDF
A Survey on Deep Learning-Based 2D Human Pose Estimation Models
17
作者 Sani Salisu A.S.A.Mohamed +2 位作者 M.H.Jaafar Ainun S.B.Pauzi Hussain A.Younis 《Computers, Materials & Continua》 SCIE EI 2023年第8期2385-2400,共16页
In this article,a comprehensive survey of deep learning-based(DLbased)human pose estimation(HPE)that can help researchers in the domain of computer vision is presented.HPE is among the fastest-growing research domains... In this article,a comprehensive survey of deep learning-based(DLbased)human pose estimation(HPE)that can help researchers in the domain of computer vision is presented.HPE is among the fastest-growing research domains of computer vision and is used in solving several problems for human endeavours.After the detailed introduction,three different human body modes followed by the main stages of HPE and two pipelines of twodimensional(2D)HPE are presented.The details of the four components of HPE are also presented.The keypoints output format of two popular 2D HPE datasets and the most cited DL-based HPE articles from the year of breakthrough are both shown in tabular form.This study intends to highlight the limitations of published reviews and surveys respecting presenting a systematic review of the current DL-based solution to the 2D HPE model.Furthermore,a detailed and meaningful survey that will guide new and existing researchers on DL-based 2D HPE models is achieved.Finally,some future research directions in the field of HPE,such as limited data on disabled persons and multi-training DL-based models,are revealed to encourage researchers and promote the growth of HPE research. 展开更多
关键词 Human pose estimation deep learning 2D DATASET MODELS body parts
下载PDF
3D Human Pose Estimation Using Two-Stream Architecture with Joint Training
18
作者 Jian Kang Wanshu Fan +2 位作者 Yijing Li Rui Liu Dongsheng Zhou 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第10期607-629,共23页
With the advancement of image sensing technology, estimating 3Dhuman pose frommonocular video has becomea hot research topic in computer vision. 3D human pose estimation is an essential prerequisite for subsequentacti... With the advancement of image sensing technology, estimating 3Dhuman pose frommonocular video has becomea hot research topic in computer vision. 3D human pose estimation is an essential prerequisite for subsequentaction analysis and understanding. It empowers a wide spectrum of potential applications in various areas, suchas intelligent transportation, human-computer interaction, and medical rehabilitation. Currently, some methodsfor 3D human pose estimation in monocular video employ temporal convolutional network (TCN) to extractinter-frame feature relationships, but the majority of them suffer from insufficient inter-frame feature relationshipextractions. In this paper, we decompose the 3D joint location regression into the bone direction and length, wepropose the TCG, a temporal convolutional network incorporating Gaussian error linear units (GELU), to solvebone direction. It enablesmore inter-frame features to be captured andmakes the utmost of the feature relationshipsbetween data. Furthermore, we adopt kinematic structural information to solve bone length enhancing the use ofintra-frame joint features. Finally, we design a loss function for joint training of the bone direction estimationnetwork with the bone length estimation network. The proposed method has extensively experimented on thepublic benchmark dataset Human3.6M. Both quantitative and qualitative experimental results showed that theproposed method can achieve more accurate 3D human pose estimations. 展开更多
关键词 3D human pose improved TCN GELU kinematic structure
下载PDF
Exploiting Human Pose and Scene Information for Interaction Detection
19
作者 Manahil Waheed Samia Allaoua Chelloug +4 位作者 Mohammad Shorfuzzaman Abdulmajeed Alsufyani Ahmad Jalal Khaled Alnowaiser Jeongmin Park 《Computers, Materials & Continua》 SCIE EI 2023年第3期5853-5870,共18页
Identifying human actions and interactions finds its use in manyareas, such as security, surveillance, assisted living, patient monitoring, rehabilitation,sports, and e-learning. This wide range of applications has at... Identifying human actions and interactions finds its use in manyareas, such as security, surveillance, assisted living, patient monitoring, rehabilitation,sports, and e-learning. This wide range of applications has attractedmany researchers to this field. Inspired by the existing recognition systems,this paper proposes a new and efficient human-object interaction recognition(HOIR) model which is based on modeling human pose and scene featureinformation. There are different aspects involved in an interaction, includingthe humans, the objects, the various body parts of the human, and the backgroundscene. Themain objectives of this research include critically examiningthe importance of all these elements in determining the interaction, estimatinghuman pose through image foresting transform (IFT), and detecting the performedinteractions based on an optimizedmulti-feature vector. The proposedmethodology has six main phases. The first phase involves preprocessing theimages. During preprocessing stages, the videos are converted into imageframes. Then their contrast is adjusted, and noise is removed. In the secondphase, the human-object pair is detected and extracted from each image frame.The third phase involves the identification of key body parts of the detectedhumans using IFT. The fourth phase relates to three different kinds of featureextraction techniques. Then these features are combined and optimized duringthe fifth phase. The optimized vector is used to classify the interactions in thelast phase. TheMSRDaily Activity 3D dataset has been used to test this modeland to prove its efficiency. The proposed system obtains an average accuracyof 91.7% on this dataset. 展开更多
关键词 Artificial intelligence daily activities human interactions human pose information image foresting transform scene feature information
下载PDF
ER-Net:Efficient Recalibration Network for Multi-ViewMulti-Person 3D Pose Estimation
20
作者 Mi Zhou Rui Liu +1 位作者 Pengfei Yi Dongsheng Zhou 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第8期2093-2109,共17页
Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application scenarios.With the introduction of end-to-end direct regression methods,the fi... Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application scenarios.With the introduction of end-to-end direct regression methods,the field has entered a new stage of development.However,the regression results of joints that are more heavily influenced by external factors are not accurate enough even for the optimal method.In this paper,we propose an effective feature recalibration module based on the channel attention mechanism and a relative optimal calibration strategy,which is applied to themulti-viewmulti-person 3D human pose estimation task to achieve improved detection accuracy for joints that are more severely affected by external factors.Specifically,it achieves relative optimal weight adjustment of joint feature information through the recalibration module and strategy,which enables the model to learn the dependencies between joints and the dependencies between people and their corresponding joints.We call this method as the Efficient Recalibration Network(ER-Net).Finally,experiments were conducted on two benchmark datasets for this task,Campus and Shelf,in which the PCP reached 97.3% and 98.3%,respectively. 展开更多
关键词 Multi-view multi-person pose estimation attention mechanism computer vision
下载PDF
上一页 1 2 212 下一页 到第
使用帮助 返回顶部