期刊文献+
共找到15篇文章
< 1 >
每页显示 20 50 100
CAW-YOLO:Cross-Layer Fusion and Weighted Receptive Field-Based YOLO for Small Object Detection in Remote Sensing
1
作者 Weiya Shi Shaowen Zhang Shiqiang Zhang 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第6期3209-3231,共23页
In recent years,there has been extensive research on object detection methods applied to optical remote sensing images utilizing convolutional neural networks.Despite these efforts,the detection of small objects in re... In recent years,there has been extensive research on object detection methods applied to optical remote sensing images utilizing convolutional neural networks.Despite these efforts,the detection of small objects in remote sensing remains a formidable challenge.The deep network structure will bring about the loss of object features,resulting in the loss of object features and the near elimination of some subtle features associated with small objects in deep layers.Additionally,the features of small objects are susceptible to interference from background features contained within the image,leading to a decline in detection accuracy.Moreover,the sensitivity of small objects to the bounding box perturbation further increases the detection difficulty.In this paper,we introduce a novel approach,Cross-Layer Fusion and Weighted Receptive Field-based YOLO(CAW-YOLO),specifically designed for small object detection in remote sensing.To address feature loss in deep layers,we have devised a cross-layer attention fusion module.Background noise is effectively filtered through the incorporation of Bi-Level Routing Attention(BRA).To enhance the model’s capacity to perceive multi-scale objects,particularly small-scale objects,we introduce a weightedmulti-receptive field atrous spatial pyramid poolingmodule.Furthermore,wemitigate the sensitivity arising from bounding box perturbation by incorporating the joint Normalized Wasserstein Distance(NWD)and Efficient Intersection over Union(EIoU)losses.The efficacy of the proposedmodel in detecting small objects in remote sensing has been validated through experiments conducted on three publicly available datasets.The experimental results unequivocally demonstrate the model’s pronounced advantages in small object detection for remote sensing,surpassing the performance of current mainstream models. 展开更多
关键词 small object detection attention mechanism cross-layer fusion discrete cosine transform
下载PDF
DSAFF-Net:A Backbone Network Based on Mask R-CNN for Small Object Detection
2
作者 Jian Peng Yifang Zhao +2 位作者 Dengyong Zhang Feng Li Arun Kumar Sangaiah 《Computers, Materials & Continua》 SCIE EI 2023年第2期3405-3419,共15页
Recently,object detection based on convolutional neural networks(CNNs)has developed rapidly.The backbone networks for basic feature extraction are an important component of the whole detection task.Therefore,we presen... Recently,object detection based on convolutional neural networks(CNNs)has developed rapidly.The backbone networks for basic feature extraction are an important component of the whole detection task.Therefore,we present a new feature extraction strategy in this paper,which name is DSAFF-Net.In this strategy,we design:1)a sandwich attention feature fusion module(SAFF module).Its purpose is to enhance the semantic information of shallow features and resolution of deep features,which is beneficial to small object detection after feature fusion.2)to add a new stage called D-block to alleviate the disadvantages of decreasing spatial resolution when the pooling layer increases the receptive field.The method proposed in the new stage replaces the original method of obtaining the P6 feature map and uses the result as the input of the regional proposal network(RPN).In the experimental phase,we use the new strategy to extract features.The experiment takes the public dataset of Microsoft Common Objects in Context(MS COCO)object detection and the dataset of Corona Virus Disease 2019(COVID-19)image classification as the experimental object respectively.The results show that the average recognition accuracy of COVID-19 in the classification dataset is improved to 98.163%,and small object detection in object detection tasks is improved by 4.0%. 展开更多
关键词 small object detection classification RPN MS COCO COVID-19
下载PDF
Interactive Transformer for Small Object Detection
3
作者 Jian Wei Qinzhao Wang Zixu Zhao 《Computers, Materials & Continua》 SCIE EI 2023年第11期1699-1717,共19页
The detection of large-scale objects has achieved high accuracy,but due to the low peak signal to noise ratio(PSNR),fewer distinguishing features,and ease of being occluded by the surroundings,the detection of small o... The detection of large-scale objects has achieved high accuracy,but due to the low peak signal to noise ratio(PSNR),fewer distinguishing features,and ease of being occluded by the surroundings,the detection of small objects,however,does not enjoy similar success.Endeavor to solve the problem,this paper proposes an attention mechanism based on cross-Key values.Based on the traditional transformer,this paper first improves the feature processing with the convolution module,effectively maintaining the local semantic context in the middle layer,and significantly reducing the number of parameters of the model.Then,to enhance the effectiveness of the attention mask,two Key values are calculated simultaneously along Query and Value by using the method of dual-branch parallel processing,which is used to strengthen the attention acquisition mode and improve the coupling of key information.Finally,focusing on the feature maps of different channels,the multi-head attention mechanism is applied to the channel attention mask to improve the feature utilization effect of the middle layer.By comparing three small object datasets,the plug-and-play interactive transformer(IT-transformer)module designed by us effectively improves the detection results of the baseline. 展开更多
关键词 small object detection ATTENTION TRANSFORMER plug-and-play
下载PDF
Small Object Detection via Precise Region-Based Fully Convolutional Networks
4
作者 Dengyong Zhang Jiawei Hu +3 位作者 Feng Li Xiangling Ding Arun Kumar Sangaiah Victor SSheng 《Computers, Materials & Continua》 SCIE EI 2021年第11期1503-1517,共15页
In the past several years,remarkable achievements have been made in the field of object detection.Although performance is generally improving,the accuracy of small object detection remains low compared with that of la... In the past several years,remarkable achievements have been made in the field of object detection.Although performance is generally improving,the accuracy of small object detection remains low compared with that of large object detection.In addition,localization misalignment issues are common for small objects,as seen in GoogLeNets and residual networks(ResNets).To address this problem,we propose an improved region-based fully convolutional network(R-FCN).The presented technique improves detection accuracy and eliminates localization misalignment by replacing positionsensitive region of interest(PS-RoI)pooling with position-sensitive precise region of interest(PS-Pr-RoI)pooling,which avoids coordinate quantization and directly calculates two-order integrals for position-sensitive score maps,thus preventing a loss of spatial precision.A validation experiment was conducted in which the Microsoft common objects in context(MS COCO)training dataset was oversampled.Results showed an accuracy improvement of 3.7%for object detection tasks and an increase of 6.0%for small objects. 展开更多
关键词 small object detection precise R-FCN PS-Pr-RoI pooling two-stage detector
下载PDF
MSC-YOLO:Improved YOLOv7 Based onMulti-Scale Spatial Context for Small Object Detection in UAV-View
5
作者 Xiangyan Tang Chengchun Ruan +2 位作者 Xiulai Li Binbin Li Cebin Fu 《Computers, Materials & Continua》 SCIE EI 2024年第4期983-1003,共21页
Accurately identifying small objects in high-resolution aerial images presents a complex and crucial task in thefield of small object detection on unmanned aerial vehicles(UAVs).This task is challenging due to variati... Accurately identifying small objects in high-resolution aerial images presents a complex and crucial task in thefield of small object detection on unmanned aerial vehicles(UAVs).This task is challenging due to variations inUAV flight altitude,differences in object scales,as well as factors like flight speed and motion blur.To enhancethe detection efficacy of small targets in drone aerial imagery,we propose an enhanced You Only Look Onceversion 7(YOLOv7)algorithm based on multi-scale spatial context.We build the MSC-YOLO model,whichincorporates an additional prediction head,denoted as P2,to improve adaptability for small objects.We replaceconventional downsampling with a Spatial-to-Depth Convolutional Combination(CSPDC)module to mitigatethe loss of intricate feature details related to small objects.Furthermore,we propose a Spatial Context Pyramidwith Multi-Scale Attention(SCPMA)module,which captures spatial and channel-dependent features of smalltargets acrossmultiple scales.This module enhances the perception of spatial contextual features and the utilizationof multiscale feature information.On the Visdrone2023 and UAVDT datasets,MSC-YOLO achieves remarkableresults,outperforming the baseline method YOLOv7 by 3.0%in terms ofmean average precision(mAP).The MSCYOLOalgorithm proposed in this paper has demonstrated satisfactory performance in detecting small targets inUAV aerial photography,providing strong support for practical applications. 展开更多
关键词 small object detection YOLOv7 multi-scale attention spatial context
下载PDF
Enhancing Dense Small Object Detection in UAV Images Based on Hybrid Transformer
6
作者 Changfeng Feng Chunping Wang +2 位作者 Dongdong Zhang Renke Kou Qiang Fu 《Computers, Materials & Continua》 SCIE EI 2024年第3期3993-4013,共21页
Transformer-based models have facilitated significant advances in object detection.However,their extensive computational consumption and suboptimal detection of dense small objects curtail their applicability in unman... Transformer-based models have facilitated significant advances in object detection.However,their extensive computational consumption and suboptimal detection of dense small objects curtail their applicability in unmanned aerial vehicle(UAV)imagery.Addressing these limitations,we propose a hybrid transformer-based detector,H-DETR,and enhance it for dense small objects,leading to an accurate and efficient model.Firstly,we introduce a hybrid transformer encoder,which integrates a convolutional neural network-based cross-scale fusion module with the original encoder to handle multi-scale feature sequences more efficiently.Furthermore,we propose two novel strategies to enhance detection performance without incurring additional inference computation.Query filter is designed to cope with the dense clustering inherent in drone-captured images by counteracting similar queries with a training-aware non-maximum suppression.Adversarial denoising learning is a novel enhancement method inspired by adversarial learning,which improves the detection of numerous small targets by counteracting the effects of artificial spatial and semantic noise.Extensive experiments on the VisDrone and UAVDT datasets substantiate the effectiveness of our approach,achieving a significant improvement in accuracy with a reduction in computational complexity.Our method achieves 31.9%and 21.1%AP on the VisDrone and UAVDT datasets,respectively,and has a faster inference speed,making it a competitive model in UAV image object detection. 展开更多
关键词 UAV images transformer dense small object detection
下载PDF
Density Map Guided Region Localization for End-to-End Small Object Detection
7
作者 Bo LI Kai HUANG +1 位作者 Junhui LI Yufu LIAO 《Journal of Systems Science and Information》 CSCD 2023年第6期776-794,共19页
With the advancement of society and science and technology, the demand for detecting small objects in practical scenarios becomes stronger. Such objects are only represented by relatively small coverage of pixels, and... With the advancement of society and science and technology, the demand for detecting small objects in practical scenarios becomes stronger. Such objects are only represented by relatively small coverage of pixels, and the features are degraded severely after being extracted by a deep convolutional neural network, which is detrimental to the detection performance for small objects. Therefore, an intuitive solution is to increase the resolution of small objects by cropping the original image. In this paper, we propose a simple but effective object density map guided region localization module (DMGRL) to locate and crop the regions of interest where small objects may exist. Firstly, the density map of the objects is estimated by object density map estimation network, and then the coordinates of the small object regions are calculated;Secondly, the continuous differentiable affine transformation is utilized to crop these regions so that the detector with DMGRL can be trained end-to-end instead of two-stage training. Finally, the all prediction results of input image and cropped region images are merged together to output the final detection results by non maximum suppression (NMS). Extensive experiments demonstrate the superior performance of the detector incorporated DMGRL. 展开更多
关键词 small object detection density map estimation end-to-end training affine transformation
原文传递
Feature-Enhanced RefineDet: Fast Detection of Small Objects 被引量:1
8
作者 Lei Zhao Ming Zhao 《Journal of Information Hiding and Privacy Protection》 2021年第1期1-8,共8页
Object detection has been studied for many years.The convolutional neural network has made great progress in the accuracy and speed of object detection.However,due to the low resolution of small objects and the repres... Object detection has been studied for many years.The convolutional neural network has made great progress in the accuracy and speed of object detection.However,due to the low resolution of small objects and the representation of fuzzy features,one of the challenges now is how to effectively detect small objects in images.Existing target detectors for small objects:one is to use high-resolution images as input,the other is to increase the depth of the CNN network,but these two methods will undoubtedly increase the cost of calculation and time-consuming.In this paper,based on the RefineDet network framework,we propose our network structure RF2Det by introducing Receptive Field Block to solve the problem of small object detection,so as to achieve the balance of speed and accuracy.At the same time,we propose a Medium-level Feature Pyramid Networks,which combines appropriate high-level context features with low-level features,so that the network can use the features of both the low-level and the high-level for multi-scale target detection,and the accuracy of the small target detection task based on the low-level features is improved.Extensive experiments on the MS COCO dataset demonstrate that compared to other most advanced methods,our proposed method shows significant performance improvement in the detection of small objects. 展开更多
关键词 small object detection feature fusion receptive field block
下载PDF
MPFracNet:A Deep Learning Algorithm for Metacarpophalangeal Fracture Detection with Varied Difficulties
9
作者 Geng Qin Ping Luo +5 位作者 Kaiyuan Li Yufeng Sun Shiwei Wang Xiaoting Li Shuang Liu Linyan Xue 《Computers, Materials & Continua》 SCIE EI 2023年第4期999-1015,共17页
Due to small size and high occult,metacarpophalangeal fracturediagnosis displays a low accuracy in terms of fracture detection and locationin X-ray images.To efficiently detect metacarpophalangeal fractures on Xrayima... Due to small size and high occult,metacarpophalangeal fracturediagnosis displays a low accuracy in terms of fracture detection and locationin X-ray images.To efficiently detect metacarpophalangeal fractures on Xrayimages as the second opinion for radiologists,we proposed a novel onestageneural network namedMPFracNet based onRetinaNet.InMPFracNet,a deformable bottleneck block(DBB)was integrated into the bottleneckto better adapt to the geometric variation of the fractures.Furthermore,an integrated feature fusion module(IFFM)was employed to obtain morein-depth semantic and shallow detail features.Specifically,Focal Loss andBalanced L1 Loss were introduced to respectively attenuate the imbalancebetween positive and negative classes and the imbalance between detectionand location tasks.We assessed the proposed model on the test set andachieved an AP of 80.4%for the metacarpophalangeal fracture detection.To estimate the detection performance for fractures with different difficulties,the proposed model was tested on the subsets of metacarpal,phalangeal andtiny fracture test sets and achieved APs of 82.7%,78.5%and 74.9%,respectively.Our proposed framework has state-of-the-art performance for detectingmetacarpophalangeal fractures,which has a strong potential application valuein practical clinical environments. 展开更多
关键词 Deep learning small object detection metacarpophalangeal fractures computer-aided diagnosis(CAD)
下载PDF
Research on Traffic Sign Detection Based on Improved YOLOv8
10
作者 Zhongjie Huang Lintao Li +1 位作者 Gerd Christian Krizek Linhao Sun 《Journal of Computer and Communications》 2023年第7期226-232,共7页
Aiming at solving the problem of missed detection and low accuracy in detecting traffic signs in the wild, an improved method of YOLOv8 is proposed. Firstly, combined with the characteristics of small target objects i... Aiming at solving the problem of missed detection and low accuracy in detecting traffic signs in the wild, an improved method of YOLOv8 is proposed. Firstly, combined with the characteristics of small target objects in the actual scene, this paper further adds blur and noise operation. Then, the asymptotic feature pyramid network (AFPN) is introduced to highlight the influence of key layer features after feature fusion, and simultaneously solve the direct interaction of non-adjacent layers. Experimental results on the TT100K dataset show that compared with the YOLOv8, the detection accuracy and recall are higher. . 展开更多
关键词 Traffic Sign detection small object detection YOLOv8 Feature Fusion
下载PDF
Small Target HelmetWearing Detection Algorithm Based on Improved YOLO V5
11
作者 Jiajing Hu Junqiu Li Qinghui Zhang 《国际计算机前沿大会会议论文集》 EI 2023年第1期60-77,共18页
To solve problems such as the low detection accuracy of helmet wear-ing,missing detection and poor real-time performance of embedded equipment in the scene of remote and small targets at the construction site,the text... To solve problems such as the low detection accuracy of helmet wear-ing,missing detection and poor real-time performance of embedded equipment in the scene of remote and small targets at the construction site,the text proposes an improved YOLO v5 for small target helmet wearing detection.Based on YOLO v5,the self-attention transformer mechanism and swin transformer module are introduced in the feature fusion step to increase the receptivefield of the con-volution kernel and globally model the high-level semantic feature information extracted from the backbone network to make the model more focused on hel-met feature learning.Replace some convolution operators with lighter and more efficient Involution operators to reduce the number of parameters.The connection mode of the Concat is improved,and 1×1 convolution is added.The experimental results compared with YOLO v5 show that the size of the improved helmet detec-tion model is reduced by 17.8%occupying only 33.2 MB,FPS increased by 5%,and mAP@0.5 reached 94.9%.This approach effectively improves the accuracy of small target helmet wear detection,and meets the deployment requirements for low computational power embedded devices. 展开更多
关键词 Helmet wearing detection YOLO V5 small object detection TRANSFORMER Swin Transformer INVOLUTION
原文传递
DLF-YOLOF:an improved YOLOF-based surface defect detection for steel plate
12
作者 Guang-hu Liu Mao-xiang Chu +1 位作者 Rong-fen Gong Ze-hao Zheng 《Journal of Iron and Steel Research(International)》 SCIE EI CAS CSCD 2024年第2期442-451,共10页
Surface defects can affect the quality of steel plate.Many methods based on computer vision are currently applied to surface defect detection of steel plate.However,their real-time performance and object detection of ... Surface defects can affect the quality of steel plate.Many methods based on computer vision are currently applied to surface defect detection of steel plate.However,their real-time performance and object detection of small defect are still unsatisfactory.An improved object detection network based on You Only Look One-level Feature(YOLOF)is proposed to show excellent performance in surface defect detection of steel plate,called DLF-YOLOF.First,the anchor-free detector is used to reduce the network hyperparameters.Secondly,deformable convolution network and local spatial attention module are introduced into the feature extraction network to increase the contextual information in the feature maps.Also,the soft non-maximum suppression is used to improve detection accuracy significantly.Finally,data augmentation is performed for small defect objects during training to improve detection accuracy.Experiments show the average precision and average precision for small objects are 42.7%and 33.5%at a detection speed of 62 frames per second on a single GPU,respectively.This shows that DLF-YOLOF has excellent performance to meet the needs of industrial real-time detection. 展开更多
关键词 Steel surface defects detection YOLOF Anchor-free detector small object detection Real-time detection
原文传递
A Study on Enhancing Chip Detection Efficiency Using the Lightweight Van-YOLOv8 Network
13
作者 Meng Huang Honglei Wei Xianyi Zhai 《Computers, Materials & Continua》 SCIE EI 2024年第4期531-547,共17页
In pursuit of cost-effective manufacturing,enterprises are increasingly adopting the practice of utilizing recycled semiconductor chips.To ensure consistent chip orientation during packaging,a circular marker on the f... In pursuit of cost-effective manufacturing,enterprises are increasingly adopting the practice of utilizing recycled semiconductor chips.To ensure consistent chip orientation during packaging,a circular marker on the front side is employed for pin alignment following successful functional testing.However,recycled chips often exhibit substantial surface wear,and the identification of the relatively small marker proves challenging.Moreover,the complexity of generic target detection algorithms hampers seamless deployment.Addressing these issues,this paper introduces a lightweight YOLOv8s-based network tailored for detecting markings on recycled chips,termed Van-YOLOv8.Initially,to alleviate the influence of diminutive,low-resolution markings on the precision of deep learning models,we utilize an upscaling approach for enhanced resolution.This technique relies on the Super-Resolution Generative Adversarial Network with Extended Training(SRGANext)network,facilitating the reconstruction of high-fidelity images that align with input specifications.Subsequently,we replace the original YOLOv8smodel’s backbone feature extraction network with the lightweight VanillaNetwork(VanillaNet),simplifying the branch structure to reduce network parameters.Finally,a Hybrid Attention Mechanism(HAM)is implemented to capture essential details from input images,improving feature representation while concurrently expediting model inference speed.Experimental results demonstrate that the Van-YOLOv8 network outperforms the original YOLOv8s on a recycled chip dataset in various aspects.Significantly,it demonstrates superiority in parameter count,computational intricacy,precision in identifying targets,and speed when compared to certain prevalent algorithms in the current landscape.The proposed approach proves promising for real-time detection of recycled chips in practical factory settings. 展开更多
关键词 Lightweight neural networks attention mechanisms image super-resolution enhancement feature extraction small object detection
下载PDF
Adaptive cropping shallow attention network for defect detection of bridge girder steel using unmanned aerial vehicle images 被引量:1
14
作者 Zonghan MU Yong QIN +4 位作者 Chongchong YU Yunpeng WU Zhipeng WANG Huaizhi YANG Yonghui HUANG 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2023年第3期243-256,共14页
Bridges are an important part of railway infrastructure and need regular inspection and maintenance.Using unmanned aerial vehicle(UAV)technology to inspect railway infrastructure is an active research issue.However,du... Bridges are an important part of railway infrastructure and need regular inspection and maintenance.Using unmanned aerial vehicle(UAV)technology to inspect railway infrastructure is an active research issue.However,due to the large size of UAV images,flight distance,and height changes,the object scale changes dramatically.At the same time,the elements of interest in railway bridges,such as bolts and corrosion,are small and dense objects,and the sample data set is seriously unbalanced,posing great challenges to the accurate detection of defects.In this paper,an adaptive cropping shallow attention network(ACSANet)is proposed,which includes an adaptive cropping strategy for large UAV images and a shallow attention network for small object detection in limited samples.To enhance the accuracy and generalization of the model,the shallow attention network model integrates a coordinate attention(CA)mechanism module and an alpha intersection over union(α-IOU)loss function,and then carries out defect detection on the bolts,steel surfaces,and railings of railway bridges.The test results show that the ACSANet model outperforms the YOLOv5s model using adaptive cropping strategy in terms of the total mAP(an evaluation index)and missing bolt mAP by 5%and 30%,respectively.Also,compared with the YOLOv5s model that adopts the common cropping strategy,the total mAP and missing bolt mAP are improved by 10%and 60%,respectively.Compared with the YOLOv5s model without any cropping strategy,the total mAP and missing bolt mAP are improved by 40%and 67%,respectively. 展开更多
关键词 RAILWAY BRIDGE Unmanned aerial vehicle(UAV)image small object detection Defect detection
原文传递
This work is licensed under a Creative Commons Attribution 4.0 International License,which permits unrestricted use,distribution,and reproduction in any medium,provided the original work is properly cited.A Real Time Vision-Based Smoking Detection Framework on Edge
15
作者 Ruilong Chen Guangfu Zeng +2 位作者 Ke Wang Lei Luo Zhiping Cai 《Journal on Internet of Things》 2020年第2期55-64,共10页
Smoking is the main reason for fire disaster and pollution in petrol station,construction site and warehouse.Existing solutions based on wearable devices and smoking sensors were costly and hard to obtain evidence of ... Smoking is the main reason for fire disaster and pollution in petrol station,construction site and warehouse.Existing solutions based on wearable devices and smoking sensors were costly and hard to obtain evidence of smoking in unmanned scenarios.With the developments of closed circuit television(CCTV)system,vision-based methods for object detection,mostly driven by deep learning techniques,were introduced recently.However,the massive GPU computing hardware required by the deep learning algorithm made these methods hard to be deployed.This paper aims at solving the smoking detection problem on edge and proposes the solution that has fast detection speed,high accuracy on micro-objects and low computing budget,i.e.,it could be deployed on the edge device such as NVIDIA JETSON TX2.We designed a new framework named RTVBS based on yolov3 and made a smoking dataset to train our model.We raised several methods to improve detection accuracy during the training step.The validation results show our model has excellent performance in smoking detection. 展开更多
关键词 Smoking detection small object detection real time CNN image processing
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部