To satisfy practical requirements of high real-time accuracy and low computational complexity of synthetic aperture radar (SAR) image ship small target detection, this paper proposes a small ship target detection meth...To satisfy practical requirements of high real-time accuracy and low computational complexity of synthetic aperture radar (SAR) image ship small target detection, this paper proposes a small ship target detection method based on the improved You Only Look Once Version 3 (YOLOv3). The main contributions of this study are threefold. First, the feature extraction network of the original YOLOV3 algorithm is replaced with the VGG16 network convolution layer. Second, general convolution is transformed into depthwise separable convolution, thereby reducing the computational cost of the algorithm. Third, a residual network structure is introduced into the feature extraction network to reuse the shallow target feature information, which enhances the detailed features of the target and ensures the improvement in accuracy of small target detection performance. To evaluate the performance of the proposed method, many experiments are conducted on public SAR image datasets. For ship targets with complex backgrounds and small ship targets in the SAR image, the effectiveness of the proposed algorithm is verified. Results show that the accuracy and recall rate improved by 5.31% and 2.77%, respectively, compared with the original YOLOV3. Furthermore, the proposed model not only significantly reduces the computational effort, but also improves the detection accuracy of ship small target.展开更多
Infrared small target detection technology plays a pivotal role in critical military applications,including early warning systems and precision guidance for missiles and other defense mechanisms.Nevertheless,existing ...Infrared small target detection technology plays a pivotal role in critical military applications,including early warning systems and precision guidance for missiles and other defense mechanisms.Nevertheless,existing traditional methods face several significant challenges,including low background suppression ability,low detection rates,and high false alarm rates when identifying infrared small targets in complex environments.This paper proposes a novel infrared small target detection method based on a transformed Gaussian filter kernel and clustering approach.The method provides improved background suppression and detection accuracy compared to traditional techniques while maintaining simplicity and lower computational costs.In the first step,the infrared image is filtered by a new filter kernel and the results of filtering are normalized.In the second step,an adaptive thresholding method is utilized to determine the pixels in small targets.In the final step,a fuzzy C-mean clustering algorithm is employed to group pixels in the same target,thus yielding the detection results.The results obtained from various real infrared image datasets demonstrate the superiority of the proposed method over traditional approaches.Compared with the traditional method of state of the arts detection method,the detection accuracy of the four sequences is increased by 2.06%,0.95%,1.03%,and 1.01%,respectively,and the false alarm rate is reduced,thus providing a more effective and robust solution.展开更多
Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including hig...Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including high-resolution imagery and exceptional mobility,making them well suited for monitoring flood extent and identifying rescue targets during floods.However,there are some challenges in interpreting rescue information in real time from flood images captured by UAVs,such as the complexity of the scenarios of UAV images,the lack of flood rescue target detection datasets and the limited real-time processing capabilities of the airborne on-board platform.Thus,we propose a real-time rescue target detection method for UAVs that is capable of efficiently delineating flood extent and identifying rescue targets(i.e.,pedestrians and vehicles trapped by floods).The proposed method achieves real-time rescue information extraction for UAV platforms by lightweight processing and fusion of flood extent extraction model and target detection model.The flood inundation range is extracted by the proposed method in real time and detects targets such as people and vehicles to be rescued based on this layer.Our experimental results demonstrate that the Intersection over Union(IoU)for flood water extraction reaches an impressive 80%,and the IoU for real-time flood water extraction stands at a commendable 76.4%.The information on flood stricken targets extracted by this method in real time can be used for flood emergency rescue.展开更多
Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightwe...Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightweight Ghost-YOLO(You Only Look Once)v8 algorithm.The algorithmintegrates advanced attention mechanisms and a smalltarget detection head to significantly enhance detection performance and efficiency.Firstly,an SE(Squeeze-and-Excitation)mechanism is incorporated into the backbone network to fortify the extraction of resilient features and precise target localization.This mechanism models feature channel dependencies,enabling adaptive adjustment of channel importance,thereby improving recognition of floating litter targets.Secondly,a 160×160 small-target detection layer is designed in the feature fusion neck to mitigate semantic information loss due to varying target scales.This design enhances the fusion of deep and shallow semantic information,improving small target feature representation and enabling better capture and identification of tiny floating litter.Thirdly,to balance performance and efficiency,the GhostConv module replaces part of the conventional convolutions in the feature fusion neck.Additionally,a novel C2fGhost(CSPDarknet53 to 2-Stage Feature Pyramid Networks Ghost)module is introduced to further reduce network parameters.Lastly,to address the challenge of occlusion,a newloss function,WIoU(Wise Intersection over Union)v3 incorporating a flexible and non-monotonic concentration approach,is adopted to improve detection rates for surface floating litter.The outcomes of the experiments demonstrate that the Ghost-YOLO v8 model proposed in this paper performs well in the dataset Marine,significantly enhances precision and recall by 3.3 and 7.6 percentage points,respectively,in contrast with the base model,mAP@0.5 and mAP 0.5:0.95 improve by 5.3 and 4.4 percentage points and reduces the computational volume by 1.88MB,the FPS value hardly decreases,and the efficient real-time identification of floating debris on the water’s surface can be achieved costeffectively.展开更多
Under the influence of air humidity,dust,aerosols,etc.,in real scenes,haze presents an uneven state.In this way,the image quality and contrast will decrease.In this case,It is difficult to detect the target in the ima...Under the influence of air humidity,dust,aerosols,etc.,in real scenes,haze presents an uneven state.In this way,the image quality and contrast will decrease.In this case,It is difficult to detect the target in the image by the universal detection network.Thus,a dual subnet based on multi-task collaborative training(DSMCT)is proposed in this paper.Firstly,in the training phase,the Gated Context Aggregation Network(GCANet)is used as the supervisory network of YOLOX to promote the extraction of clean information in foggy scenes.In the test phase,only the YOLOX branch needs to be activated to ensure the detection speed of the model.Secondly,the deformable convolution module is used to improve GCANet to enhance the model’s ability to capture details of non-homogeneous fog.Finally,the Coordinate Attention mechanism is introduced into the Vision Transformer and the backbone network of YOLOX is redesigned.In this way,the feature extraction ability of the network for deep-level information can be enhanced.The experimental results on artificial fog data set FOG_VOC and real fog data set RTTS show that the map value of DSMCT reached 86.56%and 62.39%,respectively,which was 2.27%and 4.41%higher than the current most advanced detection model.The DSMCT network has high practicality and effectiveness for target detection in real foggy scenes.展开更多
This paper expounds upon a novel target detection methodology distinguished by its elevated discriminatory efficacy,specifically tailored for environments characterized by markedly low luminance levels.Conventional me...This paper expounds upon a novel target detection methodology distinguished by its elevated discriminatory efficacy,specifically tailored for environments characterized by markedly low luminance levels.Conventional methodologies struggle with the challenges posed by luminosity fluctuations,especially in settings characterized by diminished radiance,further exacerbated by the utilization of suboptimal imaging instrumentation.The envisioned approach mandates a departure from the conventional YOLOX model,which exhibits inadequacies in mitigating these challenges.To enhance the efficacy of this approach in low-light conditions,the dehazing algorithm undergoes refinement,effecting a discerning regulation of the transmission rate at the pixel level,reducing it to values below 0.5,thereby resulting in an augmentation of image contrast.Subsequently,the coiflet wavelet transform is employed to discern and isolate high-discriminatory attributes by dismantling low-frequency image attributes and extracting high-frequency attributes across divergent axes.The utilization of CycleGAN serves to elevate the features of low-light imagery across an array of stylistic variances.Advanced computational methodologies are then employed to amalgamate and conflate intricate attributes originating from images characterized by distinct stylistic orientations,thereby augmenting the model’s erudition potential.Empirical validation conducted on the PASCAL VOC and MS COCO 2017 datasets substantiates pronounced advancements.The refined low-light enhancement algorithm yields a discernible 5.9%augmentation in the target detection evaluation index when compared to the original imagery.Mean Average Precision(mAP)undergoes enhancements of 9.45%and 0.052%in low-light visual renditions relative to conventional YOLOX outcomes.The envisaged approach presents a myriad of advantages over prevailing benchmark methodologies in the realm of target detection within environments marked by an acute scarcity of luminosity.展开更多
To address the challenges of missed detections in water surface target detection using solely visual algorithms in unmanned surface vehicle(USV)perception,this paper proposes a method based on the fusion of visual and...To address the challenges of missed detections in water surface target detection using solely visual algorithms in unmanned surface vehicle(USV)perception,this paper proposes a method based on the fusion of visual and LiDAR point-cloud projection for water surface target detection.Firstly,the visual recognition component employs an improved YOLOv7 algorithmbased on a self-built dataset for the detection of water surface targets.This algorithm modifies the original YOLOv7 architecture to a Slim-Neck structure,addressing the problemof excessive redundant information during feature extraction in the original YOLOv7 network model.Simultaneously,this modification simplifies the computational burden of the detector,reduces inference time,and maintains accuracy.Secondly,to tackle the issue of sample imbalance in the self-built dataset,slide loss function is introduced.Finally,this paper replaces the original Complete Intersection over Union(CIoU)loss function with the Minimum Point Distance Intersection over Union(MPDIoU)loss function in the YOLOv7 algorithm,which accelerates model learning and enhances robustness.To mitigate the problem of missed recognitions caused by complex water surface conditions in purely visual algorithms,this paper further adopts the fusion of LiDAR and camera data,projecting the threedimensional point-cloud data from LiDAR onto a two-dimensional pixel plane.This significantly reduces the rate of missed detections for water surface targets.展开更多
The complex operating environment in substations, with different safety distances for live equipment, is a typical high-risk working area, and it is crucial to accurately identify the type of live equipment during aut...The complex operating environment in substations, with different safety distances for live equipment, is a typical high-risk working area, and it is crucial to accurately identify the type of live equipment during automated operations. This paper investigates the detection of live equipment under complex backgrounds and noise disturbances, designs a method for expanding lightweight disturbance data by fitting Gaussian stretched positional information with recurrent neural networks and iterative optimization, and proposes an intelligent detection method for MD-Yolov7 substation environmental targets based on fused multilayer feature fusion (MLFF) and detection transformer (DETR). Subsequently, to verify the performance of the proposed method, an experimental test platform was built to carry out performance validation experiments. The results show that the proposed method has significantly improved the performance of the detection accuracy of live devices compared to the pairwise comparison algorithm, with an average mean accuracy (mAP) of 99.2%, which verifies the feasibility and accuracy of the proposed method and has a high application value.展开更多
Autonomous driving technology has entered a period of rapid development,and traffic sign detection is one of the important tasks.Existing target detection networks are difficult to adapt to scenarios where target size...Autonomous driving technology has entered a period of rapid development,and traffic sign detection is one of the important tasks.Existing target detection networks are difficult to adapt to scenarios where target sizes are seriously imbalanced,and traffic sign targets are small and have unclear features,which makes detection more difficult.Therefore,we propose aHybrid Feature Fusion Traffic Sign detection algorithmbased onYOLOv7(HFFTYOLO).First,a self-attention mechanism is incorporated at the end of the backbone network to calculate feature interactions within scales;Secondly,the cross-scale fusion part of the neck introduces a bottom-up multi-path fusion method.Design reuse paths at the end of the neck,paying particular attention to cross-scale fusion of highlevel features.In addition,we found the appropriate channel width through a lot of experiments and reduced the superfluous parameters.In terms of training,a newregression lossCMPDIoUis proposed,which not only considers the problem of loss degradation when the aspect ratio is the same but the width and height are different,but also enables the penalty term to dynamically change at different scales.Finally,our proposed improved method shows excellent results on the TT100K dataset.Compared with the baseline model,without increasing the number of parameters and computational complexity,AP0.5 and AP increased by 2.2%and 2.7%,respectively,reaching 92.9%and 58.1%.展开更多
The detection of hypersonic targets usually confronts range migration(RM)issue before coherent integration(CI).The traditional methods aiming at correcting RM to obtain CI mainly considers the narrow-band radar condit...The detection of hypersonic targets usually confronts range migration(RM)issue before coherent integration(CI).The traditional methods aiming at correcting RM to obtain CI mainly considers the narrow-band radar condition.However,with the increasing requirement of far-range detection,the time bandwidth product,which is corresponding to radar’s mean power,should be promoted in actual application.Thus,the echo signal generates the scale effect(SE)at large time bandwidth product situation,influencing the intra and inter pulse integration performance.To eliminate SE and correct RM,this paper proposes an effective algorithm,i.e.,scaled location rotation transform(ScLRT).The ScLRT can remove SE to obtain the matching pulse compression(PC)as well as correct RM to complete CI via the location rotation transform,being implemented by seeking the actual rotation angle.Compared to the traditional coherent detection algorithms,Sc LRT can address the SE problem to achieve better detection/estimation capabilities.At last,this paper gives several simulations to assess the viability of ScLRT.展开更多
In order to address the problem of high false alarm rate and low probabilities of infrared small target detection in complex low-altitude background,an infrared small target detection method based on improved weighted...In order to address the problem of high false alarm rate and low probabilities of infrared small target detection in complex low-altitude background,an infrared small target detection method based on improved weighted local contrast is proposed in this paper.First,the ratio information between the target and local background is utilized as an enhancement factor.The local contrast is calculated by incorporating the heterogeneity between the target and local background.Then,a local product weighted method is designed based on the spatial dissimilarity between target and background to further enhance target while suppressing background.Finally,the location of target is obtained by adaptive threshold segmentation.As experimental results demonstrate,the method shows superior performance in several evaluation metrics compared with six existing algorithms on different datasets containing targets such as unmanned aerial vehicles(UAV).展开更多
In order to solve the problems that the current synthetic aperture radar(SAR)image target detection method cannot adapt to targets of different sizes,and the complex image background leads to low detection accuracy,an...In order to solve the problems that the current synthetic aperture radar(SAR)image target detection method cannot adapt to targets of different sizes,and the complex image background leads to low detection accuracy,an improved SAR image small target detection method based on YOLOv7 was proposed in this study.The proposed method improved the feature extraction network by using Switchable Around Convolution(SAConv)in the backbone network to help the model capture target information at different scales,thus improving the feature extraction ability for small targets.Based on the attention mechanism,the DyHead module was embedded in the target detection head to reduce the impact of complex background,and better focus on the small targets.In addition,the NWD loss function was introduced and combined with CIoU loss.Compared to the CIoU loss function typically used in YOLOv7,the NWD loss function pays more attention to the processing of small targets,so as to further improve the detection ability of small targets.The experimental results on the HRSID dataset indicate that the proposed method achieved mAP@0.5 and mAP@0.95 scores of 93.5%and 71.5%,respectively.Compared to the baseline model,this represents an increase of 7.2%and 7.6%,respectively.The proposed method can effectively complete the task of SAR image small target detection.展开更多
In the study of oriented bounding boxes(OBB)object detection in high-resolution remote sensing images,the problem of missed and wrong detection of small targets occurs because the targets are too small and have differ...In the study of oriented bounding boxes(OBB)object detection in high-resolution remote sensing images,the problem of missed and wrong detection of small targets occurs because the targets are too small and have different orientations.Existing OBB object detection for remote sensing images,although making good progress,mainly focuses on directional modeling,while less consideration is given to the size of the object as well as the problem of missed detection.In this study,a method based on improved YOLOv8 was proposed for detecting oriented objects in remote sensing images,which can improve the detection precision of oriented objects in remote sensing images.Firstly,the ResCBAMG module was innovatively designed,which could better extract channel and spatial correlation information.Secondly,the innovative top-down feature fusion layer network structure was proposed in conjunction with the Efficient Channel Attention(ECA)attention module,which helped to capture inter-local cross-channel interaction information appropriately.Finally,we introduced an innovative ResCBAMG module between the different C2f modules and detection heads of the bottom-up feature fusion layer.This innovative structure helped the model to better focus on the target area.The precision and robustness of oriented target detection were also improved.Experimental results on the DOTA-v1.5 dataset showed that the detection Precision,mAP@0.5,and mAP@0.5:0.95 metrics of the improved model are better compared to the original model.This improvement is effective in detecting small targets and complex scenes.展开更多
This paper presents a method for detecting the small infrared target under complex background. An algorithm, named local mutation weighted information entropy (LMWIE), is proposed to suppress background. Then, the g...This paper presents a method for detecting the small infrared target under complex background. An algorithm, named local mutation weighted information entropy (LMWIE), is proposed to suppress background. Then, the grey value of targets is enhanced by calculating the local energy. Image segmentation based on the adaptive threshold is used to solve the problems that the grey value of noise is enhanced with the grey value improvement of targets. Experimental results show that compared with the adaptive Butterworth high-pass filter method, the proposed algorithm is more effective and faster for the infrared small target detection.展开更多
Object detection plays an important role in the sorting process of mechanical fasteners.Although object detection has been studied for many years,it has always been an industrial problem.Edge-based model matching is o...Object detection plays an important role in the sorting process of mechanical fasteners.Although object detection has been studied for many years,it has always been an industrial problem.Edge-based model matching is only suitable for a small range of illumination changes,and the matching accuracy is low.The optical flow method and the difference method are sensitive to noise and light,and camshift tracking is less effective in complex backgrounds.In this paper,an improved target detection method based on YOLOv3-tiny is proposed.The redundant regression box generated by the prediction network is filtered by soft nonmaximum suppression(NMS)instead of the hard decision NMS algorithm.This not only increases the size of the network structure by 52×52 and improves the detection accuracy of small targets but also uses the basic structure block MobileNetv2 in the feature extraction network,which enhances the feature extraction ability with the increased network layer and improves network performance.The experimental results show that the improved YOLOv3-tiny target detection algorithm improves the detection ability of bolts,nuts,screws and gaskets.The accuracy of a single type has been improved,which shows that the network greatly enhances the ability to learn objects with slightly complex features.The detection result of single shape features is slightly improved,which is higher than the recognition accuracy of other types.The average accuracy is increased from 0.813 to 0.839,an increase of two percentage points.The recall rate is increased from 0.804 to 0.821.展开更多
Rapid and precise vehicle recognition and classification are essential for intelligent transportation systems,and road target detection is one of the most difficult tasks in the field of computer vision.The challenge ...Rapid and precise vehicle recognition and classification are essential for intelligent transportation systems,and road target detection is one of the most difficult tasks in the field of computer vision.The challenge in real-time road target detection is the ability to properly pinpoint relatively small vehicles in complicated environments.However,because road targets are prone to complicated backgrounds and sparse features,it is challenging to detect and identify vehicle kinds fast and reliably.We suggest a new vehicle detection model called MEB-YOLO,which combines Mosaic and MixUp data augmentation,Efficient Channel Attention(ECA)attention mechanism,Bidirectional Feature Pyramid Network(BiFPN)with You Only Look Once(YOLO)model,to overcome this problem.Four sections make up this model:Input,Backbone,Neck,and Prediction.First,to improve the detection dataset and strengthen the network,MixUp and Mosaic data improvement are used during the picture processing step.Second,an attention mechanism is introduced to the backbone network,which is Cross Stage Par-tial Darknet(CSPDarknet),to reduce the influence of irrelevant features in images.Third,to achieve more sophisticated feature fusion without increasing computing cost,the BiFPN structure is utilized to build the Neck network of the model.The final prediction results are then obtained using Decoupled Head.Experiments demonstrate that the proposed model outperforms several already available detection methods and delivers good detection results on the University at Albany DEtection and TRACking(UA-DETRAC)public dataset.It also enables effective vehicle detection on real traffic monitoring data.As a result,this technique is efficient for detecting road targets.展开更多
In this paper,an advanced YOLOv7 model is proposed to tackle the challenges associated with ship detection and recognition tasks,such as the irregular shapes and varying sizes of ships.The improved model replaces the ...In this paper,an advanced YOLOv7 model is proposed to tackle the challenges associated with ship detection and recognition tasks,such as the irregular shapes and varying sizes of ships.The improved model replaces the fixed anchor boxes utilized in conventional YOLOv7 models with a set of more suitable anchor boxes specifically designed based on the size distribution of ships in the dataset.This paper also introduces a novel multi-scale feature fusion module,which comprises Path Aggregation Network(PAN)modules,enabling the efficient capture of ship features across different scales.Furthermore,data preprocessing is enhanced through the application of data augmentation techniques,including random rotation,scaling,and cropping,which serve to bolster data diversity and robustness.The distribution of positive and negative samples in the dataset is balanced using random sampling,ensuring a more accurate representation of real-world scenarios.Comprehensive experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art approaches in terms of both detection accuracy and robustness,highlighting the potential of the improved YOLOv7 model for practical applications in the maritime domain.展开更多
Small targets and occluded targets will inevitably appear in the image during the shooting process due to the influence of angle,distance,complex scene,illumination intensity,and other factors.These targets have few e...Small targets and occluded targets will inevitably appear in the image during the shooting process due to the influence of angle,distance,complex scene,illumination intensity,and other factors.These targets have few effective pixels,few features,and no apparent features,which makes extracting their efficient features difficult and easily leads to false detection,missed detection,and repeated detection,affecting the performance of target detection models.An improved faster region convolutional neural network(RCNN)algorithm(CF-RCNN)integrating convolutional block attention module(CBAM)and feature pyramid networks(FPN)is proposed to improve the detection and recognition accuracy of small-size objects,occluded or truncated objects in complex scenes.Firstly,the CBAM mechanism is integrated into the feature extraction network to improve the detection ability of occluded or truncated objects.Secondly,the FPN-featured pyramid structure is introduced to obtain high-resolution and vital semantic data to enhance the detection effect of small-size objects.The experimental results show that the mean average precision of target detection of the improved algorithm on PASCAL VOC2012 is improved to 76.1%,which is 13.8 percentage points higher than that of the commonly used Faster RCNN and other algorithms.Furthermore,it is better than the commonly used small sample target detection algorithm.展开更多
Considering the problem that the scattering echo images of airborne Doppler weather radar are often reduced by ground clutters,the accuracy and confidence of meteorology target detection are reduced.In this paper,a de...Considering the problem that the scattering echo images of airborne Doppler weather radar are often reduced by ground clutters,the accuracy and confidence of meteorology target detection are reduced.In this paper,a deep convolutional neural network(DCNN)is proposed for meteorology target detection and ground clutter suppression with a large collection of airborne weather radar images as network input.For each weather radar image,the corresponding digital elevation model(DEM)image is extracted on basis of the radar antenna scan-ning parameters and plane position,and is further fed to the net-work as a supplement for ground clutter suppression.The fea-tures of actual meteorology targets are learned in each bottle-neck module of the proposed network and convolved into deeper iterations in the forward propagation process.Then the network parameters are updated by the back propagation itera-tion of the training error.Experimental results on the real mea-sured images show that our proposed DCNN outperforms the counterparts in terms of six evaluation factors.Meanwhile,the network outputs are in good agreement with the expected mete-orology detection results(labels).It is demonstrated that the pro-posed network would have a promising meteorology observa-tion application with minimal effort on network variables or parameter changes.展开更多
Rapid advancement of intelligent transportation systems(ITS)and autonomous driving(AD)have shown the importance of accurate and efficient detection of traffic signs.However,certain drawbacks,such as balancing accuracy...Rapid advancement of intelligent transportation systems(ITS)and autonomous driving(AD)have shown the importance of accurate and efficient detection of traffic signs.However,certain drawbacks,such as balancing accuracy and real-time performance,hinder the deployment of traffic sign detection algorithms in ITS and AD domains.In this study,a novel traffic sign detection algorithm was proposed based on the bidirectional Res2Net architecture to achieve an improved balance between accuracy and speed.An enhanced backbone network module,called C2Net,which uses an upgraded bidirectional Res2Net,was introduced to mitigate information loss in the feature extraction process and to achieve information complementarity.Furthermore,a squeeze-and-excitation attention mechanism was incorporated within the channel attention of the architecture to perform channel-level feature correction on the input feature map,which effectively retains valuable features while removing non-essential features.A series of ablation experiments were conducted to validate the efficacy of the proposed methodology.The performance was evaluated using two distinct datasets:the Tsinghua-Tencent 100K and the CSUST Chinese traffic sign detection benchmark 2021.On the TT100K dataset,the method achieves precision,recall,and Map0.5 scores of 83.3%,79.3%,and 84.2%,respectively.Similarly,on the CCTSDB 2021 dataset,the method achieves precision,recall,and Map0.5 scores of 91.49%,73.79%,and 81.03%,respectively.Experimental results revealed that the proposed method had superior performance compared to conventional models,which includes the faster region-based convolutional neural network,single shot multibox detector,and you only look once version 5.展开更多
文摘To satisfy practical requirements of high real-time accuracy and low computational complexity of synthetic aperture radar (SAR) image ship small target detection, this paper proposes a small ship target detection method based on the improved You Only Look Once Version 3 (YOLOv3). The main contributions of this study are threefold. First, the feature extraction network of the original YOLOV3 algorithm is replaced with the VGG16 network convolution layer. Second, general convolution is transformed into depthwise separable convolution, thereby reducing the computational cost of the algorithm. Third, a residual network structure is introduced into the feature extraction network to reuse the shallow target feature information, which enhances the detailed features of the target and ensures the improvement in accuracy of small target detection performance. To evaluate the performance of the proposed method, many experiments are conducted on public SAR image datasets. For ship targets with complex backgrounds and small ship targets in the SAR image, the effectiveness of the proposed algorithm is verified. Results show that the accuracy and recall rate improved by 5.31% and 2.77%, respectively, compared with the original YOLOV3. Furthermore, the proposed model not only significantly reduces the computational effort, but also improves the detection accuracy of ship small target.
基金supported by the Funding of Jiangsu University of Science and Technology,under the grant number:1132921208.
文摘Infrared small target detection technology plays a pivotal role in critical military applications,including early warning systems and precision guidance for missiles and other defense mechanisms.Nevertheless,existing traditional methods face several significant challenges,including low background suppression ability,low detection rates,and high false alarm rates when identifying infrared small targets in complex environments.This paper proposes a novel infrared small target detection method based on a transformed Gaussian filter kernel and clustering approach.The method provides improved background suppression and detection accuracy compared to traditional techniques while maintaining simplicity and lower computational costs.In the first step,the infrared image is filtered by a new filter kernel and the results of filtering are normalized.In the second step,an adaptive thresholding method is utilized to determine the pixels in small targets.In the final step,a fuzzy C-mean clustering algorithm is employed to group pixels in the same target,thus yielding the detection results.The results obtained from various real infrared image datasets demonstrate the superiority of the proposed method over traditional approaches.Compared with the traditional method of state of the arts detection method,the detection accuracy of the four sequences is increased by 2.06%,0.95%,1.03%,and 1.01%,respectively,and the false alarm rate is reduced,thus providing a more effective and robust solution.
基金National Natural Science Foundation of China(No.42271416)Guangxi Science and Technology Major Project(No.AA22068072)Shennongjia National Park Resources Comprehensive Investigation Research Project(No.SNJNP2023015).
文摘Timely acquisition of rescue target information is critical for emergency response after a flood disaster.Unmanned Aerial Vehicles(UAVs)equipped with remote sensing capabilities offer distinct advantages,including high-resolution imagery and exceptional mobility,making them well suited for monitoring flood extent and identifying rescue targets during floods.However,there are some challenges in interpreting rescue information in real time from flood images captured by UAVs,such as the complexity of the scenarios of UAV images,the lack of flood rescue target detection datasets and the limited real-time processing capabilities of the airborne on-board platform.Thus,we propose a real-time rescue target detection method for UAVs that is capable of efficiently delineating flood extent and identifying rescue targets(i.e.,pedestrians and vehicles trapped by floods).The proposed method achieves real-time rescue information extraction for UAV platforms by lightweight processing and fusion of flood extent extraction model and target detection model.The flood inundation range is extracted by the proposed method in real time and detects targets such as people and vehicles to be rescued based on this layer.Our experimental results demonstrate that the Intersection over Union(IoU)for flood water extraction reaches an impressive 80%,and the IoU for real-time flood water extraction stands at a commendable 76.4%.The information on flood stricken targets extracted by this method in real time can be used for flood emergency rescue.
基金Supported by the fund of the Henan Province Science and Technology Research Project(No.242102210213).
文摘Addressing the challenges in detecting surface floating litter in artificial lakes,including complex environments,uneven illumination,and susceptibility to noise andweather,this paper proposes an efficient and lightweight Ghost-YOLO(You Only Look Once)v8 algorithm.The algorithmintegrates advanced attention mechanisms and a smalltarget detection head to significantly enhance detection performance and efficiency.Firstly,an SE(Squeeze-and-Excitation)mechanism is incorporated into the backbone network to fortify the extraction of resilient features and precise target localization.This mechanism models feature channel dependencies,enabling adaptive adjustment of channel importance,thereby improving recognition of floating litter targets.Secondly,a 160×160 small-target detection layer is designed in the feature fusion neck to mitigate semantic information loss due to varying target scales.This design enhances the fusion of deep and shallow semantic information,improving small target feature representation and enabling better capture and identification of tiny floating litter.Thirdly,to balance performance and efficiency,the GhostConv module replaces part of the conventional convolutions in the feature fusion neck.Additionally,a novel C2fGhost(CSPDarknet53 to 2-Stage Feature Pyramid Networks Ghost)module is introduced to further reduce network parameters.Lastly,to address the challenge of occlusion,a newloss function,WIoU(Wise Intersection over Union)v3 incorporating a flexible and non-monotonic concentration approach,is adopted to improve detection rates for surface floating litter.The outcomes of the experiments demonstrate that the Ghost-YOLO v8 model proposed in this paper performs well in the dataset Marine,significantly enhances precision and recall by 3.3 and 7.6 percentage points,respectively,in contrast with the base model,mAP@0.5 and mAP 0.5:0.95 improve by 5.3 and 4.4 percentage points and reduces the computational volume by 1.88MB,the FPS value hardly decreases,and the efficient real-time identification of floating debris on the water’s surface can be achieved costeffectively.
基金This work was jointly supported by the Special Fund for Transformation and Upgrade of Jiangsu Industry and Information Industry-Key Core Technologies(Equipment)Key Industrialization Projects in 2022(No.CMHI-2022-RDG-004):“Key Technology Research for Development of Intelligent Wind Power Operation and Maintenance Mothership in Deep Sea”.
文摘Under the influence of air humidity,dust,aerosols,etc.,in real scenes,haze presents an uneven state.In this way,the image quality and contrast will decrease.In this case,It is difficult to detect the target in the image by the universal detection network.Thus,a dual subnet based on multi-task collaborative training(DSMCT)is proposed in this paper.Firstly,in the training phase,the Gated Context Aggregation Network(GCANet)is used as the supervisory network of YOLOX to promote the extraction of clean information in foggy scenes.In the test phase,only the YOLOX branch needs to be activated to ensure the detection speed of the model.Secondly,the deformable convolution module is used to improve GCANet to enhance the model’s ability to capture details of non-homogeneous fog.Finally,the Coordinate Attention mechanism is introduced into the Vision Transformer and the backbone network of YOLOX is redesigned.In this way,the feature extraction ability of the network for deep-level information can be enhanced.The experimental results on artificial fog data set FOG_VOC and real fog data set RTTS show that the map value of DSMCT reached 86.56%and 62.39%,respectively,which was 2.27%and 4.41%higher than the current most advanced detection model.The DSMCT network has high practicality and effectiveness for target detection in real foggy scenes.
基金supported by National Sciences Foundation of China Grants(No.61902158).
文摘This paper expounds upon a novel target detection methodology distinguished by its elevated discriminatory efficacy,specifically tailored for environments characterized by markedly low luminance levels.Conventional methodologies struggle with the challenges posed by luminosity fluctuations,especially in settings characterized by diminished radiance,further exacerbated by the utilization of suboptimal imaging instrumentation.The envisioned approach mandates a departure from the conventional YOLOX model,which exhibits inadequacies in mitigating these challenges.To enhance the efficacy of this approach in low-light conditions,the dehazing algorithm undergoes refinement,effecting a discerning regulation of the transmission rate at the pixel level,reducing it to values below 0.5,thereby resulting in an augmentation of image contrast.Subsequently,the coiflet wavelet transform is employed to discern and isolate high-discriminatory attributes by dismantling low-frequency image attributes and extracting high-frequency attributes across divergent axes.The utilization of CycleGAN serves to elevate the features of low-light imagery across an array of stylistic variances.Advanced computational methodologies are then employed to amalgamate and conflate intricate attributes originating from images characterized by distinct stylistic orientations,thereby augmenting the model’s erudition potential.Empirical validation conducted on the PASCAL VOC and MS COCO 2017 datasets substantiates pronounced advancements.The refined low-light enhancement algorithm yields a discernible 5.9%augmentation in the target detection evaluation index when compared to the original imagery.Mean Average Precision(mAP)undergoes enhancements of 9.45%and 0.052%in low-light visual renditions relative to conventional YOLOX outcomes.The envisaged approach presents a myriad of advantages over prevailing benchmark methodologies in the realm of target detection within environments marked by an acute scarcity of luminosity.
基金supported by the National Natural Science Foundation of China(No.51876114)the Shanghai Engineering Research Center of Marine Renewable Energy(Grant No.19DZ2254800).
文摘To address the challenges of missed detections in water surface target detection using solely visual algorithms in unmanned surface vehicle(USV)perception,this paper proposes a method based on the fusion of visual and LiDAR point-cloud projection for water surface target detection.Firstly,the visual recognition component employs an improved YOLOv7 algorithmbased on a self-built dataset for the detection of water surface targets.This algorithm modifies the original YOLOv7 architecture to a Slim-Neck structure,addressing the problemof excessive redundant information during feature extraction in the original YOLOv7 network model.Simultaneously,this modification simplifies the computational burden of the detector,reduces inference time,and maintains accuracy.Secondly,to tackle the issue of sample imbalance in the self-built dataset,slide loss function is introduced.Finally,this paper replaces the original Complete Intersection over Union(CIoU)loss function with the Minimum Point Distance Intersection over Union(MPDIoU)loss function in the YOLOv7 algorithm,which accelerates model learning and enhances robustness.To mitigate the problem of missed recognitions caused by complex water surface conditions in purely visual algorithms,this paper further adopts the fusion of LiDAR and camera data,projecting the threedimensional point-cloud data from LiDAR onto a two-dimensional pixel plane.This significantly reduces the rate of missed detections for water surface targets.
文摘The complex operating environment in substations, with different safety distances for live equipment, is a typical high-risk working area, and it is crucial to accurately identify the type of live equipment during automated operations. This paper investigates the detection of live equipment under complex backgrounds and noise disturbances, designs a method for expanding lightweight disturbance data by fitting Gaussian stretched positional information with recurrent neural networks and iterative optimization, and proposes an intelligent detection method for MD-Yolov7 substation environmental targets based on fused multilayer feature fusion (MLFF) and detection transformer (DETR). Subsequently, to verify the performance of the proposed method, an experimental test platform was built to carry out performance validation experiments. The results show that the proposed method has significantly improved the performance of the detection accuracy of live devices compared to the pairwise comparison algorithm, with an average mean accuracy (mAP) of 99.2%, which verifies the feasibility and accuracy of the proposed method and has a high application value.
基金funded by National Natural Science Foundation of China(Grant No.U2004163).
文摘Autonomous driving technology has entered a period of rapid development,and traffic sign detection is one of the important tasks.Existing target detection networks are difficult to adapt to scenarios where target sizes are seriously imbalanced,and traffic sign targets are small and have unclear features,which makes detection more difficult.Therefore,we propose aHybrid Feature Fusion Traffic Sign detection algorithmbased onYOLOv7(HFFTYOLO).First,a self-attention mechanism is incorporated at the end of the backbone network to calculate feature interactions within scales;Secondly,the cross-scale fusion part of the neck introduces a bottom-up multi-path fusion method.Design reuse paths at the end of the neck,paying particular attention to cross-scale fusion of highlevel features.In addition,we found the appropriate channel width through a lot of experiments and reduced the superfluous parameters.In terms of training,a newregression lossCMPDIoUis proposed,which not only considers the problem of loss degradation when the aspect ratio is the same but the width and height are different,but also enables the penalty term to dynamically change at different scales.Finally,our proposed improved method shows excellent results on the TT100K dataset.Compared with the baseline model,without increasing the number of parameters and computational complexity,AP0.5 and AP increased by 2.2%and 2.7%,respectively,reaching 92.9%and 58.1%.
基金supported by the National Natural Science Foundation of China(62101099)the Chinese Postdoctoral Science Foundation(2021M690558,2022T150100,2018M633352,2019T120825)+3 种基金the Young Elite Scientist Sponsorship Program(YESS20200082)the Aeronautical Science Foundation of China(2022Z017080001)the Open Foundation of Science and Technology on Electronic Information Control Laboratorythe Natural Science Foundation of Sichuan Province(2023NSFSC1386)。
文摘The detection of hypersonic targets usually confronts range migration(RM)issue before coherent integration(CI).The traditional methods aiming at correcting RM to obtain CI mainly considers the narrow-band radar condition.However,with the increasing requirement of far-range detection,the time bandwidth product,which is corresponding to radar’s mean power,should be promoted in actual application.Thus,the echo signal generates the scale effect(SE)at large time bandwidth product situation,influencing the intra and inter pulse integration performance.To eliminate SE and correct RM,this paper proposes an effective algorithm,i.e.,scaled location rotation transform(ScLRT).The ScLRT can remove SE to obtain the matching pulse compression(PC)as well as correct RM to complete CI via the location rotation transform,being implemented by seeking the actual rotation angle.Compared to the traditional coherent detection algorithms,Sc LRT can address the SE problem to achieve better detection/estimation capabilities.At last,this paper gives several simulations to assess the viability of ScLRT.
基金supported by the National Natural Science Foundation of China (No.U1833203),the National Natural Science Foundation of China (No.62301036)the Aviation Science Foundation (No.2020Z019055001)China Postdoctoral Science Foundation Funded Project (No.2022M720446)。
文摘In order to address the problem of high false alarm rate and low probabilities of infrared small target detection in complex low-altitude background,an infrared small target detection method based on improved weighted local contrast is proposed in this paper.First,the ratio information between the target and local background is utilized as an enhancement factor.The local contrast is calculated by incorporating the heterogeneity between the target and local background.Then,a local product weighted method is designed based on the spatial dissimilarity between target and background to further enhance target while suppressing background.Finally,the location of target is obtained by adaptive threshold segmentation.As experimental results demonstrate,the method shows superior performance in several evaluation metrics compared with six existing algorithms on different datasets containing targets such as unmanned aerial vehicles(UAV).
文摘In order to solve the problems that the current synthetic aperture radar(SAR)image target detection method cannot adapt to targets of different sizes,and the complex image background leads to low detection accuracy,an improved SAR image small target detection method based on YOLOv7 was proposed in this study.The proposed method improved the feature extraction network by using Switchable Around Convolution(SAConv)in the backbone network to help the model capture target information at different scales,thus improving the feature extraction ability for small targets.Based on the attention mechanism,the DyHead module was embedded in the target detection head to reduce the impact of complex background,and better focus on the small targets.In addition,the NWD loss function was introduced and combined with CIoU loss.Compared to the CIoU loss function typically used in YOLOv7,the NWD loss function pays more attention to the processing of small targets,so as to further improve the detection ability of small targets.The experimental results on the HRSID dataset indicate that the proposed method achieved mAP@0.5 and mAP@0.95 scores of 93.5%and 71.5%,respectively.Compared to the baseline model,this represents an increase of 7.2%and 7.6%,respectively.The proposed method can effectively complete the task of SAR image small target detection.
文摘In the study of oriented bounding boxes(OBB)object detection in high-resolution remote sensing images,the problem of missed and wrong detection of small targets occurs because the targets are too small and have different orientations.Existing OBB object detection for remote sensing images,although making good progress,mainly focuses on directional modeling,while less consideration is given to the size of the object as well as the problem of missed detection.In this study,a method based on improved YOLOv8 was proposed for detecting oriented objects in remote sensing images,which can improve the detection precision of oriented objects in remote sensing images.Firstly,the ResCBAMG module was innovatively designed,which could better extract channel and spatial correlation information.Secondly,the innovative top-down feature fusion layer network structure was proposed in conjunction with the Efficient Channel Attention(ECA)attention module,which helped to capture inter-local cross-channel interaction information appropriately.Finally,we introduced an innovative ResCBAMG module between the different C2f modules and detection heads of the bottom-up feature fusion layer.This innovative structure helped the model to better focus on the target area.The precision and robustness of oriented target detection were also improved.Experimental results on the DOTA-v1.5 dataset showed that the detection Precision,mAP@0.5,and mAP@0.5:0.95 metrics of the improved model are better compared to the original model.This improvement is effective in detecting small targets and complex scenes.
基金supported by the National Natural Science Foundation of China (61171194)
文摘This paper presents a method for detecting the small infrared target under complex background. An algorithm, named local mutation weighted information entropy (LMWIE), is proposed to suppress background. Then, the grey value of targets is enhanced by calculating the local energy. Image segmentation based on the adaptive threshold is used to solve the problems that the grey value of noise is enhanced with the grey value improvement of targets. Experimental results show that compared with the adaptive Butterworth high-pass filter method, the proposed algorithm is more effective and faster for the infrared small target detection.
基金The authors gratefully acknowledge the support provided by the National Natural Science Foundation of China(No.U20A20265)。
文摘Object detection plays an important role in the sorting process of mechanical fasteners.Although object detection has been studied for many years,it has always been an industrial problem.Edge-based model matching is only suitable for a small range of illumination changes,and the matching accuracy is low.The optical flow method and the difference method are sensitive to noise and light,and camshift tracking is less effective in complex backgrounds.In this paper,an improved target detection method based on YOLOv3-tiny is proposed.The redundant regression box generated by the prediction network is filtered by soft nonmaximum suppression(NMS)instead of the hard decision NMS algorithm.This not only increases the size of the network structure by 52×52 and improves the detection accuracy of small targets but also uses the basic structure block MobileNetv2 in the feature extraction network,which enhances the feature extraction ability with the increased network layer and improves network performance.The experimental results show that the improved YOLOv3-tiny target detection algorithm improves the detection ability of bolts,nuts,screws and gaskets.The accuracy of a single type has been improved,which shows that the network greatly enhances the ability to learn objects with slightly complex features.The detection result of single shape features is slightly improved,which is higher than the recognition accuracy of other types.The average accuracy is increased from 0.813 to 0.839,an increase of two percentage points.The recall rate is increased from 0.804 to 0.821.
基金funded by the National Natural Science Foundation of China(NSFC)(No.61170110)Zhejiang Provincial Natural Science Foundation of China(LY13F020043).
文摘Rapid and precise vehicle recognition and classification are essential for intelligent transportation systems,and road target detection is one of the most difficult tasks in the field of computer vision.The challenge in real-time road target detection is the ability to properly pinpoint relatively small vehicles in complicated environments.However,because road targets are prone to complicated backgrounds and sparse features,it is challenging to detect and identify vehicle kinds fast and reliably.We suggest a new vehicle detection model called MEB-YOLO,which combines Mosaic and MixUp data augmentation,Efficient Channel Attention(ECA)attention mechanism,Bidirectional Feature Pyramid Network(BiFPN)with You Only Look Once(YOLO)model,to overcome this problem.Four sections make up this model:Input,Backbone,Neck,and Prediction.First,to improve the detection dataset and strengthen the network,MixUp and Mosaic data improvement are used during the picture processing step.Second,an attention mechanism is introduced to the backbone network,which is Cross Stage Par-tial Darknet(CSPDarknet),to reduce the influence of irrelevant features in images.Third,to achieve more sophisticated feature fusion without increasing computing cost,the BiFPN structure is utilized to build the Neck network of the model.The final prediction results are then obtained using Decoupled Head.Experiments demonstrate that the proposed model outperforms several already available detection methods and delivers good detection results on the University at Albany DEtection and TRACking(UA-DETRAC)public dataset.It also enables effective vehicle detection on real traffic monitoring data.As a result,this technique is efficient for detecting road targets.
基金supported by the Key R&D Project of Hainan Province(Grant No.ZDYF2022GXJS348,ZDYF2022SHFZ039).
文摘In this paper,an advanced YOLOv7 model is proposed to tackle the challenges associated with ship detection and recognition tasks,such as the irregular shapes and varying sizes of ships.The improved model replaces the fixed anchor boxes utilized in conventional YOLOv7 models with a set of more suitable anchor boxes specifically designed based on the size distribution of ships in the dataset.This paper also introduces a novel multi-scale feature fusion module,which comprises Path Aggregation Network(PAN)modules,enabling the efficient capture of ship features across different scales.Furthermore,data preprocessing is enhanced through the application of data augmentation techniques,including random rotation,scaling,and cropping,which serve to bolster data diversity and robustness.The distribution of positive and negative samples in the dataset is balanced using random sampling,ensuring a more accurate representation of real-world scenarios.Comprehensive experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art approaches in terms of both detection accuracy and robustness,highlighting the potential of the improved YOLOv7 model for practical applications in the maritime domain.
基金sponsored by the Natural Science Research Program of Higher Education Jiangsu Province (19KJD520005)Qing Lan Project of Jiangsu Province (Su Teacher’s Letter [2021]No.11)the Young Teacher Development Fund of Pujiang Institute Nanjing Tech University ( [2021]No.73).
文摘Small targets and occluded targets will inevitably appear in the image during the shooting process due to the influence of angle,distance,complex scene,illumination intensity,and other factors.These targets have few effective pixels,few features,and no apparent features,which makes extracting their efficient features difficult and easily leads to false detection,missed detection,and repeated detection,affecting the performance of target detection models.An improved faster region convolutional neural network(RCNN)algorithm(CF-RCNN)integrating convolutional block attention module(CBAM)and feature pyramid networks(FPN)is proposed to improve the detection and recognition accuracy of small-size objects,occluded or truncated objects in complex scenes.Firstly,the CBAM mechanism is integrated into the feature extraction network to improve the detection ability of occluded or truncated objects.Secondly,the FPN-featured pyramid structure is introduced to obtain high-resolution and vital semantic data to enhance the detection effect of small-size objects.The experimental results show that the mean average precision of target detection of the improved algorithm on PASCAL VOC2012 is improved to 76.1%,which is 13.8 percentage points higher than that of the commonly used Faster RCNN and other algorithms.Furthermore,it is better than the commonly used small sample target detection algorithm.
基金supported by the China Ministry of Industry and Information Technology Foundation and Aeronautical Science Foundation of China(ASFC-201920007002)the National Key Research and Development Plan(2021YFB1600603)the Open Fund of Key Laboratory of Civil Aircraft Airworthiness Technology,Civil Aviation University of China.
文摘Considering the problem that the scattering echo images of airborne Doppler weather radar are often reduced by ground clutters,the accuracy and confidence of meteorology target detection are reduced.In this paper,a deep convolutional neural network(DCNN)is proposed for meteorology target detection and ground clutter suppression with a large collection of airborne weather radar images as network input.For each weather radar image,the corresponding digital elevation model(DEM)image is extracted on basis of the radar antenna scan-ning parameters and plane position,and is further fed to the net-work as a supplement for ground clutter suppression.The fea-tures of actual meteorology targets are learned in each bottle-neck module of the proposed network and convolved into deeper iterations in the forward propagation process.Then the network parameters are updated by the back propagation itera-tion of the training error.Experimental results on the real mea-sured images show that our proposed DCNN outperforms the counterparts in terms of six evaluation factors.Meanwhile,the network outputs are in good agreement with the expected mete-orology detection results(labels).It is demonstrated that the pro-posed network would have a promising meteorology observa-tion application with minimal effort on network variables or parameter changes.
基金funded by the National Key R&D Program of China,Grant Number 2017YFB0802803Beijing Natural Science Foundation,Grant Number 4202002.
文摘Rapid advancement of intelligent transportation systems(ITS)and autonomous driving(AD)have shown the importance of accurate and efficient detection of traffic signs.However,certain drawbacks,such as balancing accuracy and real-time performance,hinder the deployment of traffic sign detection algorithms in ITS and AD domains.In this study,a novel traffic sign detection algorithm was proposed based on the bidirectional Res2Net architecture to achieve an improved balance between accuracy and speed.An enhanced backbone network module,called C2Net,which uses an upgraded bidirectional Res2Net,was introduced to mitigate information loss in the feature extraction process and to achieve information complementarity.Furthermore,a squeeze-and-excitation attention mechanism was incorporated within the channel attention of the architecture to perform channel-level feature correction on the input feature map,which effectively retains valuable features while removing non-essential features.A series of ablation experiments were conducted to validate the efficacy of the proposed methodology.The performance was evaluated using two distinct datasets:the Tsinghua-Tencent 100K and the CSUST Chinese traffic sign detection benchmark 2021.On the TT100K dataset,the method achieves precision,recall,and Map0.5 scores of 83.3%,79.3%,and 84.2%,respectively.Similarly,on the CCTSDB 2021 dataset,the method achieves precision,recall,and Map0.5 scores of 91.49%,73.79%,and 81.03%,respectively.Experimental results revealed that the proposed method had superior performance compared to conventional models,which includes the faster region-based convolutional neural network,single shot multibox detector,and you only look once version 5.