Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the ...Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the AMR method of radiation source signals based on two-dimensional data matrix and improved residual neural network is proposed in this paper.First,the time series of the radiation source signals are reconstructed into two-dimensional data matrix,which greatly simplifies the signal preprocessing process.Second,the depthwise convolution and large-size convolutional kernels based residual neural network(DLRNet)is proposed to improve the feature extraction capability of the AMR model.Finally,the model performs feature extraction and classification on the two-dimensional data matrix to obtain the recognition vector that represents the signal modulation type.Theoretical analysis and simulation results show that the AMR method based on two-dimensional data matrix and improved residual network can significantly improve the accuracy of the AMR method.The recognition accuracy of the proposed method maintains a high level greater than 90% even at -14 dB SNR.展开更多
The carcass layer of flexible pipe comprises a large-angle spiral structure with a complex interlocked stainless steel cross-section profile, which is mainly used to resist radial load. With the complex structure of t...The carcass layer of flexible pipe comprises a large-angle spiral structure with a complex interlocked stainless steel cross-section profile, which is mainly used to resist radial load. With the complex structure of the carcass layer, an equivalent simplified model is used to study the mechanical properties of the carcass layer. However, the current equivalent carcass model only considers the elastic deformation, and this simplification leads to huge errors in the calculation results. In this study, radial compression experiments were carried out to make the carcasses to undergo plastic deformation. Subsequently, a residual neural network based on the experimental data was established to predict the load-displacement curves of carcasses with different inner diameter in plastic states under radial compression.The established neural network model’s high precision was verified by experimental data, and the influence of the number of input variables on the accuracy of the neural network was discussed. The conclusion shows that the residual neural network model established based on the experimental data of the small-diameter carcass layer can predict the load-displacement curve of the large-diameter carcass layer in the plastic stage. With the decrease of input data, the prediction accuracy of residual network model in plasticity stage will decrease.展开更多
Bus arrival time prediction contributes to the quality improvement of public transport services.Passengers can arrange departure time effectively if they know the accurate bus arrival time in advance.We proposed a mac...Bus arrival time prediction contributes to the quality improvement of public transport services.Passengers can arrange departure time effectively if they know the accurate bus arrival time in advance.We proposed a machine⁃learning approach,RTSI⁃ResNet,to forecast the bus arrival time at target stations.The residual neural network framework was employed to model the bus route temporal⁃spatial information.It was found that the bus travel time on a segment between two stations not only had correlation with the preceding buses,but also had common change trends with nearby downstream/upstream segments.Two features about bus travel time and headway were extracted from bus route including target section in both forward and reverse directions to constitute the route temporal⁃spatial information,which reflects the road traffic conditions comprehensively.Experiments on the bus trajectory data of route No.10 in Shenzhen public transport system demonstrated that the proposed RTSI⁃ResNet outperformed other well⁃known methods(e.g.,RNN/LSTM,SVM).Specifically,the advantage was more significant when the distance between bus and the target station was farther.展开更多
Robotic grasps play an important role in the service and industrial fields,and the robotic arm can grasp the object properly depends on the accuracy of the grasping detection result.In order to predict grasping detect...Robotic grasps play an important role in the service and industrial fields,and the robotic arm can grasp the object properly depends on the accuracy of the grasping detection result.In order to predict grasping detection positions for known or unknown objects by a modular robotic system,a convolutional neural network(CNN)with the residual block is proposed,which can be used to generate accurate grasping detection for input images of the scene.The proposed model architecture was trained on the standard Cornell grasp dataset and evaluated on the test dataset.Moreover,it was evaluated on different types of household objects and cluttered multi-objects.On the Cornell grasp dataset,the accuracy of the model on image-wise splitting detection and object-wise splitting detection achieved 95.5%and 93.6%,respectively.Further,the real detection time per image was 109 ms.The experimental results show that the model can quickly detect the grasping positions of a single object or multiple objects in image pixels in real time,and it keeps good stability and robustness.展开更多
Taking the real part and the imaginary part of complex sound pressure of the sound field as features,a transfer learning model is constructed.Based on the pre-training of a large amount of underwater acoustic data in ...Taking the real part and the imaginary part of complex sound pressure of the sound field as features,a transfer learning model is constructed.Based on the pre-training of a large amount of underwater acoustic data in the preselected sea area using the convolutional neural network(CNN),the few-shot underwater acoustic data in the test sea area are retrained to study the underwater sound source ranging problem.The S5 voyage data of SWellEX-96 experiment is used to verify the proposed method,realize the range estimation for the shallow source in the experiment,and compare the range estimation performance of the underwater target sound source of four methods:matched field processing(MFP),generalized regression neural network(GRNN),traditional CNN,and transfer learning.Experimental data processing results show that the transfer learning model based on residual CNN can effectively realize range estimation in few-shot scenes,and the estimation performance is remarkably better than that of other methods.展开更多
Because behavior recognition is based on video frame sequences,this paper proposes a behavior recognition algorithm that combines 3D residual convolutional neural network(R3D)and long short-term memory(LSTM).First,the...Because behavior recognition is based on video frame sequences,this paper proposes a behavior recognition algorithm that combines 3D residual convolutional neural network(R3D)and long short-term memory(LSTM).First,the residual module is extended to three dimensions,which can extract features in the time and space domain at the same time.Second,by changing the size of the pooling layer window the integrity of the time domain features is preserved,at the same time,in order to overcome the difficulty of network training and over-fitting problems,the batch normalization(BN)layer and the dropout layer are added.After that,because the global average pooling layer(GAP)is affected by the size of the feature map,the network cannot be further deepened,so the convolution layer and maxpool layer are added to the R3D network.Finally,because LSTM has the ability to memorize information and can extract more abstract timing features,the LSTM network is introduced into the R3D network.Experimental results show that the R3D+LSTM network achieves 91%recognition rate on the UCF-101 dataset.展开更多
Traffic accidents are caused by driver fatigue or distraction in many cases.To prevent accidents,several low-cost hypovigilance(hypo-V)systems were developed in the past based on a multimodal-hybrid(physiological and ...Traffic accidents are caused by driver fatigue or distraction in many cases.To prevent accidents,several low-cost hypovigilance(hypo-V)systems were developed in the past based on a multimodal-hybrid(physiological and behavioral)feature set.Similarly in this paper,real-time driver inattention and fatigue(Hypo-Driver)detection system is proposed through multi-view cameras and biosignal sensors to extract hybrid features.The considered features are derived from non-intrusive sensors that are related to the changes in driving behavior and visual facial expressions.To get enhanced visual facial features in uncontrolled environment,three cameras are deployed on multiview points(0◦,45◦,and 90◦)of the drivers.To develop a Hypo-Driver system,the physiological signals(electroencephalography(EEG),electrocardiography(ECG),electro-myography(sEMG),and electrooculography(EOG))and behavioral information(PERCLOS70-80-90%,mouth aspect ratio(MAR),eye aspect ratio(EAR),blinking frequency(BF),head-titled ratio(HT-R))are collected and pre-processed,then followed by feature selection and fusion techniques.The driver behaviors are classified into five stages such as normal,fatigue,visual inattention,cognitive inattention,and drowsy.This improved hypo-Driver system utilized trained behavioral features by a convolutional neural network(CNNs),recurrent neural network and long short-term memory(RNN-LSTM)model is used to extract physiological features.After fusion of these features,the Hypo-Driver system is classified hypo-V into five stages based on trained layers and dropout-layer in the deep-residual neural network(DRNN)model.To test the performance of a hypo-Driver system,data from 20 drivers are acquired.The results of Hypo-Driver compared to state-of-theart methods are presented.Compared to the state-of-the-art Hypo-V system,on average,the Hypo-Driver system achieved a detection accuracy(AC)of 96.5%.The obtained results indicate that the Hypo-Driver system based on multimodal and multiview features outperforms other state-of-the-art driver Hypo-V systems by handling many anomalies.展开更多
Stochastic progressive photon mapping(SPPM)is one of the important global illumination methods in computer graphics.It can simulate caustics and specular-diffuse-specular lighting effects efficiently.However,as a bias...Stochastic progressive photon mapping(SPPM)is one of the important global illumination methods in computer graphics.It can simulate caustics and specular-diffuse-specular lighting effects efficiently.However,as a biased method,it always suffers from both bias and variance with limited iterations,and the bias and the variance bring multi-scale noises into SPPM renderings.Recent learning-based methods have shown great advantages on denoising unbiased Monte Carlo(MC)methods,but have not been leveraged for biased ones.In this paper,we present the first learning-based method specially designed for denoising-biased SPPM renderings.Firstly,to avoid conflicting denoising constraints,the radiance of final images is decomposed into two components:caustic and global.These two components are then denoised separately via a two-network framework.In each network,we employ a novel multi-residual block with two sizes of filters,which significantly improves the model’s capabilities,and makes it more suitable for multi-scale noises on both low-frequency and high-frequency areas.We also present a series of photon-related auxiliary features,to better handle noises while preserving illumination details,especially caustics.Compared with other state-of-the-art learning-based denoising methods that we apply to this problem,our method shows a higher denoising quality,which could efficiently denoise multi-scale noises while keeping sharp illuminations.展开更多
This paper presents an off-line handwritten signature verification system based on the Siamese network,where a hybrid architecture is used.The Residual neural Network(ResNet)is used to realize a powerful feature extra...This paper presents an off-line handwritten signature verification system based on the Siamese network,where a hybrid architecture is used.The Residual neural Network(ResNet)is used to realize a powerful feature extraction model such that Writer Independent(WI)features can be effectively learned.A single-layer Siamese Neural Network(NN)is used to realize a Writer Dependent(WD)classifier such that the storage space can be minimized.For the purpose of reducing the impact of the high intraclass variability of the signature and ensuring that the Siamese network can learn more effectively,we propose a method of selecting a reference signature as one of the inputs for the Siamese network.To take full advantage of the reference signature,we modify the conventional contrastive loss function to enhance the accuracy.By using the proposed techniques,the accuracy of the system can be increased by 5.9%.Based on the GPDS signature dataset,the proposed system is able to achieve an accuracy of 94.61%which is better than the accuracy achieved by the current state-of-the-art work.展开更多
Current cancer diagnosis procedure requires expert knowledge and is time-consuming,which raises the need to build an accurate diagnosis support system for lymphoma identification and classification.Many studies have s...Current cancer diagnosis procedure requires expert knowledge and is time-consuming,which raises the need to build an accurate diagnosis support system for lymphoma identification and classification.Many studies have shown promising results using Machine Learning and,recently,Deep Learning to detect malignancy in cancer cells.However,the diversity and complexity of the morphological structure of lymphoma make it a challenging classification problem.In literature,many attempts were made to classify up to four simple types of lymphoma.This paper presents an approach using a reliable model capable of diagnosing seven different categories of rare and aggressive lymphoma.These Lymphoma types are Classical Hodgkin Lymphoma,Nodular Lymphoma Predominant,Burkitt Lymphoma,Follicular Lymphoma,Mantle Lymphoma,Large B-Cell Lymphoma,and T-Cell Lymphoma.Our proposed approach uses Residual Neural Networks,ResNet50,with a Transfer Learning for lymphoma’s detection and classification.The model used results are validated according to the performance evaluation metrics:Accuracy,precision,recall,F-score,and kappa score for the seven multi-classes.Our algorithms are tested,and the results are validated on 323 images of 224×224 pixels resolution.The results are promising and show that our used model can classify and predict the correct lymphoma subtype with an accuracy of 91.6%.展开更多
Glaucoma is a prevalent cause of blindness worldwide.If not treated promptly,it can cause vision and quality of life to deteriorate.According to statistics,glaucoma affects approximately 65 million individuals globall...Glaucoma is a prevalent cause of blindness worldwide.If not treated promptly,it can cause vision and quality of life to deteriorate.According to statistics,glaucoma affects approximately 65 million individuals globally.Fundus image segmentation depends on the optic disc(OD)and optic cup(OC).This paper proposes a computational model to segment and classify retinal fundus images for glaucoma detection.Different data augmentation techniques were applied to prevent overfitting while employing several data pre-processing approaches to improve the image quality and achieve high accuracy.The segmentation models are based on an attention U-Net with three separate convolutional neural networks(CNNs)backbones:Inception-v3,visual geometry group 19(VGG19),and residual neural network 50(ResNet50).The classification models also employ a modified version of the above three CNN architectures.Using the RIM-ONE dataset,the attention U-Net with the ResNet50 model as the encoder backbone,achieved the best accuracy of 99.58%in segmenting OD.The Inception-v3 model had the highest accuracy of 98.79%for glaucoma classification among the evaluated segmentation,followed by the modified classification architectures.展开更多
This paper develops a fully data-driven,missingdata tolerant method for post-fault short-term voltage stability(STVS)assessment of power systems against the incomplete PMU measurements.The super-resolution perception(...This paper develops a fully data-driven,missingdata tolerant method for post-fault short-term voltage stability(STVS)assessment of power systems against the incomplete PMU measurements.The super-resolution perception(SRP),based on a deep residual learning convolutional neural network,is employed to cope with the missing PMU measurements.The incremental broad learning(BL)is used to rapidly update the model to maintain and enhance the online application performance.Being different from the state-of-the-art methods,the proposed method is fully data-driven and can fill up missing data under any PMU placement information loss and network topology change scenario.Simulation results demonstrate that the proposed method has the best performance in terms of STVS assessment accuracy and missing-data tolerance among the existing methods on the benchmark testing system.展开更多
Complex nature of underwater environment poses biggest challenge towards image acquisition and transmission of underwater images.This paper proposes an integrated approach which consists of a non-learning enhancement ...Complex nature of underwater environment poses biggest challenge towards image acquisition and transmission of underwater images.This paper proposes an integrated approach which consists of a non-learning enhancement method with deep Convolutional Neural Networks(CNN)for compression and reconstruction of the image.The proposed method does color and contrast correction for image enhancement.The enhanced images are down-sampled using 9-layer CNN followed by Discrete Wavelet Transform(DWT).The decompression is done by using Inverse DWT.Further,the sub-pixel up-sampled image is de-blurred using a three-layer CNN.Residual Dense CNN(RD-CNN)is used to improve the quality of the reconstructed image after deblurring.The quality of the reconstructed images is measured using Peak Signal to Noise Ratio(PSNR)and Structural Similarity Index Metric(SSIM).The proposed model provides better image enhancement,compression,and reconstruction quality than the existing state-of-the-art methods and Super Resolution CNN(SRCNN)respectively.展开更多
基金National Natural Science Foundation of China under Grant No.61973037China Postdoctoral Science Foundation under Grant No.2022M720419。
文摘Automatic modulation recognition(AMR)of radiation source signals is a research focus in the field of cognitive radio.However,the AMR of radiation source signals at low SNRs still faces a great challenge.Therefore,the AMR method of radiation source signals based on two-dimensional data matrix and improved residual neural network is proposed in this paper.First,the time series of the radiation source signals are reconstructed into two-dimensional data matrix,which greatly simplifies the signal preprocessing process.Second,the depthwise convolution and large-size convolutional kernels based residual neural network(DLRNet)is proposed to improve the feature extraction capability of the AMR model.Finally,the model performs feature extraction and classification on the two-dimensional data matrix to obtain the recognition vector that represents the signal modulation type.Theoretical analysis and simulation results show that the AMR method based on two-dimensional data matrix and improved residual network can significantly improve the accuracy of the AMR method.The recognition accuracy of the proposed method maintains a high level greater than 90% even at -14 dB SNR.
基金financially supported by the National Key R&D Program of China (2021YFA1003501)the National Natural Science Foundation of China (No.U1906233,11732004)the Fundamental Research Funds for the Central Universities (DUT20ZD213,DUT20LAB308)。
文摘The carcass layer of flexible pipe comprises a large-angle spiral structure with a complex interlocked stainless steel cross-section profile, which is mainly used to resist radial load. With the complex structure of the carcass layer, an equivalent simplified model is used to study the mechanical properties of the carcass layer. However, the current equivalent carcass model only considers the elastic deformation, and this simplification leads to huge errors in the calculation results. In this study, radial compression experiments were carried out to make the carcasses to undergo plastic deformation. Subsequently, a residual neural network based on the experimental data was established to predict the load-displacement curves of carcasses with different inner diameter in plastic states under radial compression.The established neural network model’s high precision was verified by experimental data, and the influence of the number of input variables on the accuracy of the neural network was discussed. The conclusion shows that the residual neural network model established based on the experimental data of the small-diameter carcass layer can predict the load-displacement curve of the large-diameter carcass layer in the plastic stage. With the decrease of input data, the prediction accuracy of residual network model in plasticity stage will decrease.
基金Sponsored by the Transportation Science and Technology Planning Project of Henan Province,China(Grant No.2019G-2-2).
文摘Bus arrival time prediction contributes to the quality improvement of public transport services.Passengers can arrange departure time effectively if they know the accurate bus arrival time in advance.We proposed a machine⁃learning approach,RTSI⁃ResNet,to forecast the bus arrival time at target stations.The residual neural network framework was employed to model the bus route temporal⁃spatial information.It was found that the bus travel time on a segment between two stations not only had correlation with the preceding buses,but also had common change trends with nearby downstream/upstream segments.Two features about bus travel time and headway were extracted from bus route including target section in both forward and reverse directions to constitute the route temporal⁃spatial information,which reflects the road traffic conditions comprehensively.Experiments on the bus trajectory data of route No.10 in Shenzhen public transport system demonstrated that the proposed RTSI⁃ResNet outperformed other well⁃known methods(e.g.,RNN/LSTM,SVM).Specifically,the advantage was more significant when the distance between bus and the target station was farther.
基金National Natural Science Foundation of China(No.52101346)Fundamental Research Funds for the Central Universities,China(No.2232019D3-61)Initial Research Fund for the Young Teachers of Donghua University,China。
文摘Robotic grasps play an important role in the service and industrial fields,and the robotic arm can grasp the object properly depends on the accuracy of the grasping detection result.In order to predict grasping detection positions for known or unknown objects by a modular robotic system,a convolutional neural network(CNN)with the residual block is proposed,which can be used to generate accurate grasping detection for input images of the scene.The proposed model architecture was trained on the standard Cornell grasp dataset and evaluated on the test dataset.Moreover,it was evaluated on different types of household objects and cluttered multi-objects.On the Cornell grasp dataset,the accuracy of the model on image-wise splitting detection and object-wise splitting detection achieved 95.5%and 93.6%,respectively.Further,the real detection time per image was 109 ms.The experimental results show that the model can quickly detect the grasping positions of a single object or multiple objects in image pixels in real time,and it keeps good stability and robustness.
基金supported by the National Natural Science Foundation of China(1197428611904274)+1 种基金the Shaanxi Young Science and Technology Star Program(2021KJXX-07)the fundamental research funding for characteristic disciplines(G2022WD0235)。
文摘Taking the real part and the imaginary part of complex sound pressure of the sound field as features,a transfer learning model is constructed.Based on the pre-training of a large amount of underwater acoustic data in the preselected sea area using the convolutional neural network(CNN),the few-shot underwater acoustic data in the test sea area are retrained to study the underwater sound source ranging problem.The S5 voyage data of SWellEX-96 experiment is used to verify the proposed method,realize the range estimation for the shallow source in the experiment,and compare the range estimation performance of the underwater target sound source of four methods:matched field processing(MFP),generalized regression neural network(GRNN),traditional CNN,and transfer learning.Experimental data processing results show that the transfer learning model based on residual CNN can effectively realize range estimation in few-shot scenes,and the estimation performance is remarkably better than that of other methods.
基金Supported by the Shaanxi Province Key Research and Development Project (No. 2021GY-280)Shaanxi Province Natural Science Basic Research Program (No. 2021JM-459)the National Natural Science Foundation of China (No. 61772417)
文摘Because behavior recognition is based on video frame sequences,this paper proposes a behavior recognition algorithm that combines 3D residual convolutional neural network(R3D)and long short-term memory(LSTM).First,the residual module is extended to three dimensions,which can extract features in the time and space domain at the same time.Second,by changing the size of the pooling layer window the integrity of the time domain features is preserved,at the same time,in order to overcome the difficulty of network training and over-fitting problems,the batch normalization(BN)layer and the dropout layer are added.After that,because the global average pooling layer(GAP)is affected by the size of the feature map,the network cannot be further deepened,so the convolution layer and maxpool layer are added to the R3D network.Finally,because LSTM has the ability to memorize information and can extract more abstract timing features,the LSTM network is introduced into the R3D network.Experimental results show that the R3D+LSTM network achieves 91%recognition rate on the UCF-101 dataset.
基金The authors extend their appreciation to the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University for funding this work through Research Group no.RG-21-07-01.
文摘Traffic accidents are caused by driver fatigue or distraction in many cases.To prevent accidents,several low-cost hypovigilance(hypo-V)systems were developed in the past based on a multimodal-hybrid(physiological and behavioral)feature set.Similarly in this paper,real-time driver inattention and fatigue(Hypo-Driver)detection system is proposed through multi-view cameras and biosignal sensors to extract hybrid features.The considered features are derived from non-intrusive sensors that are related to the changes in driving behavior and visual facial expressions.To get enhanced visual facial features in uncontrolled environment,three cameras are deployed on multiview points(0◦,45◦,and 90◦)of the drivers.To develop a Hypo-Driver system,the physiological signals(electroencephalography(EEG),electrocardiography(ECG),electro-myography(sEMG),and electrooculography(EOG))and behavioral information(PERCLOS70-80-90%,mouth aspect ratio(MAR),eye aspect ratio(EAR),blinking frequency(BF),head-titled ratio(HT-R))are collected and pre-processed,then followed by feature selection and fusion techniques.The driver behaviors are classified into five stages such as normal,fatigue,visual inattention,cognitive inattention,and drowsy.This improved hypo-Driver system utilized trained behavioral features by a convolutional neural network(CNNs),recurrent neural network and long short-term memory(RNN-LSTM)model is used to extract physiological features.After fusion of these features,the Hypo-Driver system is classified hypo-V into five stages based on trained layers and dropout-layer in the deep-residual neural network(DRNN)model.To test the performance of a hypo-Driver system,data from 20 drivers are acquired.The results of Hypo-Driver compared to state-of-theart methods are presented.Compared to the state-of-the-art Hypo-V system,on average,the Hypo-Driver system achieved a detection accuracy(AC)of 96.5%.The obtained results indicate that the Hypo-Driver system based on multimodal and multiview features outperforms other state-of-the-art driver Hypo-V systems by handling many anomalies.
基金This work was partially supported by the National Key Research and Development Program of China under Grant No.2017YFB0203000the National Natural Science Foundation of China under Grant Nos.61802187,61872223,and 61702311the Natural Science Foundation of Jiangsu Province of China under Grant No.BK20170857.
文摘Stochastic progressive photon mapping(SPPM)is one of the important global illumination methods in computer graphics.It can simulate caustics and specular-diffuse-specular lighting effects efficiently.However,as a biased method,it always suffers from both bias and variance with limited iterations,and the bias and the variance bring multi-scale noises into SPPM renderings.Recent learning-based methods have shown great advantages on denoising unbiased Monte Carlo(MC)methods,but have not been leveraged for biased ones.In this paper,we present the first learning-based method specially designed for denoising-biased SPPM renderings.Firstly,to avoid conflicting denoising constraints,the radiance of final images is decomposed into two components:caustic and global.These two components are then denoised separately via a two-network framework.In each network,we employ a novel multi-residual block with two sizes of filters,which significantly improves the model’s capabilities,and makes it more suitable for multi-scale noises on both low-frequency and high-frequency areas.We also present a series of photon-related auxiliary features,to better handle noises while preserving illumination details,especially caustics.Compared with other state-of-the-art learning-based denoising methods that we apply to this problem,our method shows a higher denoising quality,which could efficiently denoise multi-scale noises while keeping sharp illuminations.
文摘This paper presents an off-line handwritten signature verification system based on the Siamese network,where a hybrid architecture is used.The Residual neural Network(ResNet)is used to realize a powerful feature extraction model such that Writer Independent(WI)features can be effectively learned.A single-layer Siamese Neural Network(NN)is used to realize a Writer Dependent(WD)classifier such that the storage space can be minimized.For the purpose of reducing the impact of the high intraclass variability of the signature and ensuring that the Siamese network can learn more effectively,we propose a method of selecting a reference signature as one of the inputs for the Siamese network.To take full advantage of the reference signature,we modify the conventional contrastive loss function to enhance the accuracy.By using the proposed techniques,the accuracy of the system can be increased by 5.9%.Based on the GPDS signature dataset,the proposed system is able to achieve an accuracy of 94.61%which is better than the accuracy achieved by the current state-of-the-art work.
文摘Current cancer diagnosis procedure requires expert knowledge and is time-consuming,which raises the need to build an accurate diagnosis support system for lymphoma identification and classification.Many studies have shown promising results using Machine Learning and,recently,Deep Learning to detect malignancy in cancer cells.However,the diversity and complexity of the morphological structure of lymphoma make it a challenging classification problem.In literature,many attempts were made to classify up to four simple types of lymphoma.This paper presents an approach using a reliable model capable of diagnosing seven different categories of rare and aggressive lymphoma.These Lymphoma types are Classical Hodgkin Lymphoma,Nodular Lymphoma Predominant,Burkitt Lymphoma,Follicular Lymphoma,Mantle Lymphoma,Large B-Cell Lymphoma,and T-Cell Lymphoma.Our proposed approach uses Residual Neural Networks,ResNet50,with a Transfer Learning for lymphoma’s detection and classification.The model used results are validated according to the performance evaluation metrics:Accuracy,precision,recall,F-score,and kappa score for the seven multi-classes.Our algorithms are tested,and the results are validated on 323 images of 224×224 pixels resolution.The results are promising and show that our used model can classify and predict the correct lymphoma subtype with an accuracy of 91.6%.
文摘Glaucoma is a prevalent cause of blindness worldwide.If not treated promptly,it can cause vision and quality of life to deteriorate.According to statistics,glaucoma affects approximately 65 million individuals globally.Fundus image segmentation depends on the optic disc(OD)and optic cup(OC).This paper proposes a computational model to segment and classify retinal fundus images for glaucoma detection.Different data augmentation techniques were applied to prevent overfitting while employing several data pre-processing approaches to improve the image quality and achieve high accuracy.The segmentation models are based on an attention U-Net with three separate convolutional neural networks(CNNs)backbones:Inception-v3,visual geometry group 19(VGG19),and residual neural network 50(ResNet50).The classification models also employ a modified version of the above three CNN architectures.Using the RIM-ONE dataset,the attention U-Net with the ResNet50 model as the encoder backbone,achieved the best accuracy of 99.58%in segmenting OD.The Inception-v3 model had the highest accuracy of 98.79%for glaucoma classification among the evaluated segmentation,followed by the modified classification architectures.
基金The work was supported in part by National Natural Science Foundation of China(51807009,71931003,72061147004).
文摘This paper develops a fully data-driven,missingdata tolerant method for post-fault short-term voltage stability(STVS)assessment of power systems against the incomplete PMU measurements.The super-resolution perception(SRP),based on a deep residual learning convolutional neural network,is employed to cope with the missing PMU measurements.The incremental broad learning(BL)is used to rapidly update the model to maintain and enhance the online application performance.Being different from the state-of-the-art methods,the proposed method is fully data-driven and can fill up missing data under any PMU placement information loss and network topology change scenario.Simulation results demonstrate that the proposed method has the best performance in terms of STVS assessment accuracy and missing-data tolerance among the existing methods on the benchmark testing system.
文摘Complex nature of underwater environment poses biggest challenge towards image acquisition and transmission of underwater images.This paper proposes an integrated approach which consists of a non-learning enhancement method with deep Convolutional Neural Networks(CNN)for compression and reconstruction of the image.The proposed method does color and contrast correction for image enhancement.The enhanced images are down-sampled using 9-layer CNN followed by Discrete Wavelet Transform(DWT).The decompression is done by using Inverse DWT.Further,the sub-pixel up-sampled image is de-blurred using a three-layer CNN.Residual Dense CNN(RD-CNN)is used to improve the quality of the reconstructed image after deblurring.The quality of the reconstructed images is measured using Peak Signal to Noise Ratio(PSNR)and Structural Similarity Index Metric(SSIM).The proposed model provides better image enhancement,compression,and reconstruction quality than the existing state-of-the-art methods and Super Resolution CNN(SRCNN)respectively.