The Ki67 index (KI) is a standard clinical marker for tumor proliferation;however, its application is hindered by intratumoral heterogeneity. In this study, we used digital image analysis to comprehensively analyze Ki...The Ki67 index (KI) is a standard clinical marker for tumor proliferation;however, its application is hindered by intratumoral heterogeneity. In this study, we used digital image analysis to comprehensively analyze Ki67 heterogeneity and distribution patterns in breast carcinoma. Using Smart Pathology software, we digitized and analyzed 42 excised breast carcinoma Ki67 slides. Boxplots, histograms, and heat maps were generated to illustrate the KI distribution. We found that 30% of cases (13/42) exhibited discrepancies between global and hotspot KI when using a 14% KI threshold for classification. Patients with higher global or hotspot KI values displayed greater heterogenicity. Ki67 distribution patterns were categorized as randomly distributed (52%, 22/42), peripheral (43%, 18/42), and centered (5%, 2/42). Our sampling simulator indicated analyzing more than 10 high-power fields was typically required to accurately estimate global KI, with sampling size being correlated with heterogeneity. In conclusion, using digital image analysis in whole-slide images allows for comprehensive Ki67 profile assessment, shedding light on heterogeneity and distribution patterns. This spatial information can facilitate KI surveys of breast cancer and other malignancies.展开更多
The continuous growth in the scale of unmanned aerial vehicle (UAV) applications in transmission line inspection has resulted in a corresponding increase in the demand for UAV inspection image processing. Owing to its...The continuous growth in the scale of unmanned aerial vehicle (UAV) applications in transmission line inspection has resulted in a corresponding increase in the demand for UAV inspection image processing. Owing to its excellent performance in computer vision, deep learning has been applied to UAV inspection image processing tasks such as power line identification and insulator defect detection. Despite their excellent performance, electric power UAV inspection image processing models based on deep learning face several problems such as a small application scope, the need for constant retraining and optimization, and high R&D monetary and time costs due to the black-box and scene data-driven characteristics of deep learning. In this study, an automated deep learning system for electric power UAV inspection image analysis and processing is proposed as a solution to the aforementioned problems. This system design is based on the three critical design principles of generalizability, extensibility, and automation. Pre-trained models, fine-tuning (downstream task adaptation), and automated machine learning, which are closely related to these design principles, are reviewed. In addition, an automated deep learning system architecture for electric power UAV inspection image analysis and processing is presented. A prototype system was constructed and experiments were conducted on the two electric power UAV inspection image analysis and processing tasks of insulator self-detonation and bird nest recognition. The models constructed using the prototype system achieved 91.36% and 86.13% mAP for insulator self-detonation and bird nest recognition, respectively. This demonstrates that the system design concept is reasonable and the system architecture feasible .展开更多
Medical image analysis is an active research topic,with thousands of studies published in the past few years.Transfer learning(TL)including convolutional neural networks(CNNs)focused to enhance efficiency on an innova...Medical image analysis is an active research topic,with thousands of studies published in the past few years.Transfer learning(TL)including convolutional neural networks(CNNs)focused to enhance efficiency on an innovative task using the knowledge of the same tasks learnt in advance.It has played a major role in medical image analysis since it solves the data scarcity issue along with that it saves hardware resources and time.This study develops an EnhancedTunicate SwarmOptimization withTransfer Learning EnabledMedical Image Analysis System(ETSOTL-MIAS).The goal of the ETSOTL-MIAS technique lies in the identification and classification of diseases through medical imaging.The ETSOTL-MIAS technique involves the Chan Vese segmentation technique to identify the affected regions in the medical image.For feature extraction purposes,the ETSOTL-MIAS technique designs a modified DarkNet-53 model.To avoid the manual hyperparameter adjustment process,the ETSOTLMIAS technique exploits the ETSO algorithm,showing the novelty of the work.Finally,the classification of medical images takes place by random forest(RF)classifier.The performance validation of the ETSOTL-MIAS technique is tested on a benchmark medical image database.The extensive experimental analysis showed the promising performance of the ETSOTL-MIAS technique under different measures.展开更多
Biomedical image processing is widely utilized for disease detection and classification of biomedical images.Tongue color image analysis is an effective and non-invasive tool for carrying out secondary detection at an...Biomedical image processing is widely utilized for disease detection and classification of biomedical images.Tongue color image analysis is an effective and non-invasive tool for carrying out secondary detection at anytime and anywhere.For removing the qualitative aspect,tongue images are quantitatively inspected,proposing a novel disease classification model in an automated way is preferable.This article introduces a novel political optimizer with deep learning enabled tongue color image analysis(PODL-TCIA)technique.The presented PODL-TCIA model purposes to detect the occurrence of the disease by examining the color of the tongue.To attain this,the PODL-TCIA model initially performs image pre-processing to enhance medical image quality.Followed by,Inception with ResNet-v2 model is employed for feature extraction.Besides,political optimizer(PO)with twin support vector machine(TSVM)model is exploited for image classification process,shows the novelty of the work.The design of PO algorithm assists in the optimal parameter selection of the TSVM model.For ensuring the enhanced outcomes of the PODL-TCIA model,a wide-ranging experimental analysis was applied and the outcomes reported the betterment of the PODL-TCIA model over the recent approaches.展开更多
Carbon anodes are an essential part of the primary aluminum production. They are made of coal tar pitch, calcined petroleum coke, recycled anodes and butts. As pitch acts as a binder for the anode, its proper distribu...Carbon anodes are an essential part of the primary aluminum production. They are made of coal tar pitch, calcined petroleum coke, recycled anodes and butts. As pitch acts as a binder for the anode, its proper distribution in a green anode has a great impact on the properties of the baked anode. Information on cracks in anodes is important for the quality of the baked anode. There is presently no reliable method available to analyze and quantify the amount of coke, pitch and pores/cracks in a green anode sample. In this article, an image analysis technique has been described, that can analyze as well as quantify the area percentage of pores/cracks and weight percentages of pitch and coke. The novelty of the method is its capacity to differentiate the different components of anode.展开更多
Malaria is an important and worldwide fatal disease that has been widely reported by the World Health Organization(WHO),and it has about 219 million cases worldwide,with 435,000 of those mortal.The common malaria diag...Malaria is an important and worldwide fatal disease that has been widely reported by the World Health Organization(WHO),and it has about 219 million cases worldwide,with 435,000 of those mortal.The common malaria diagnosis approach is heavily reliant on highly trained experts,who use a microscope to examine the samples.Therefore,there is a need to create an automated solution for the diagnosis of malaria.One of the main objectives of this work is to create a design tool that could be used to diagnose malaria from the image of a blood sample.In this paper,we firstly developed a graphical user interface that could be used to help segment red blood cells and infected cells and allow the users to analyze the blood samples.Secondly,a Feed-forward Neural Network(FNN)is designed to classify the cells into two classes.The achieved results show that the proposed techniques can be used to detect malaria,as it has achieved 92%accuracy with a database that contains 27,560 benchmark images.展开更多
In this paper,an Automated Brain Image Analysis(ABIA)system that classifies the Magnetic Resonance Imaging(MRI)of human brain is presented.The classification of MRI images into normal or low grade or high grade plays ...In this paper,an Automated Brain Image Analysis(ABIA)system that classifies the Magnetic Resonance Imaging(MRI)of human brain is presented.The classification of MRI images into normal or low grade or high grade plays a vital role for the early diagnosis.The Non-Subsampled Shearlet Transform(NSST)that captures more visual information than conventional wavelet transforms is employed for feature extraction.As the feature space of NSST is very high,a statistical t-test is applied to select the dominant directional sub-bands at each level of NSST decomposition based on sub-band energies.A combination of features that includes Gray Level Co-occurrence Matrix(GLCM)based features,Histograms of Positive Shearlet Coefficients(HPSC),and Histograms of Negative Shearlet Coefficients(HNSC)are estimated.The combined feature set is utilized in the classification phase where a hybrid approach is designed with three classifiers;k-Nearest Neighbor(kNN),Naive Bayes(NB)and Support Vector Machine(SVM)classifiers.The output of individual trained classifiers for a testing input is hybridized to take a final decision.The quantitative results of ABIA system on Repository of Molecular Brain Neoplasia Data(REMBRANDT)database show the overall improved performance in comparison with a single classifier model with accuracy of 99% for normal/abnormal classification and 98% for low and high risk classification.展开更多
This review is focused on using computer image analysis as a means of objective and quantitative characterizing optical images of the macroscopic (e.g. microbial colonies) and the microscopic (e.g. single cell) object...This review is focused on using computer image analysis as a means of objective and quantitative characterizing optical images of the macroscopic (e.g. microbial colonies) and the microscopic (e.g. single cell) objects in the microbiological research. This is the way of making many visual inspection assays more objective and less time and labor consuming. Also, it can provide new visually inaccessible information on relation between some optical parameters and various biological features of the microbial cul-tures. Of special interest is application of image analysis in fluorescence microscopy as it opens new ways of using fluorescence based methodology for single microbial cell studies. Examples of using image analysis in the studies of both the macroscopic and the microscopic microbiological objects obtained by various imaging techniques are presented and discussed.展开更多
BACKGROUND Digital pathology image(DPI)analysis has been developed by machine learning(ML)techniques.However,little attention has been paid to the reproducibility of ML-based histological classification in heterochron...BACKGROUND Digital pathology image(DPI)analysis has been developed by machine learning(ML)techniques.However,little attention has been paid to the reproducibility of ML-based histological classification in heterochronously obtained DPIs of the same hematoxylin and eosin(HE)slide.AIM To elucidate the frequency and preventable causes of discordant classification results of DPI analysis using ML for the heterochronously obtained DPIs.METHODS We created paired DPIs by scanning 298 HE stained slides containing 584 tissues twice with a virtual slide scanner.The paired DPIs were analyzed by our MLaided classification model.We defined non-flipped and flipped groups as the paired DPIs with concordant and discordant classification results,respectively.We compared differences in color and blur between the non-flipped and flipped groups by L1-norm and a blur index,respectively.RESULTS We observed discordant classification results in 23.1%of the paired DPIs obtained by two independent scans of the same microscope slide.We detected no significant difference in the L1-norm of each color channel between the two groups;however,the flipped group showed a significantly higher blur index than the non-flipped group.CONCLUSION Our results suggest that differences in the blur-not the color-of the paired DPIs may cause discordant classification results.An ML-aided classification model for DPI should be tested for this potential cause of the reduced reproducibility of the model.In a future study,a slide scanner and/or a preprocessing method of minimizing DPI blur should be developed.展开更多
Due to their unique structural features, electrospun membranes have gained considerable attention for use in applications where quality of depth filtration is a dominant performance factor. To elucidate the depth filt...Due to their unique structural features, electrospun membranes have gained considerable attention for use in applications where quality of depth filtration is a dominant performance factor. To elucidate the depth filtration phenomena it is important to quantify the intrinsic structural properties independent from the dynamics of transport media. Several methods have been proposed for structural characterization of such membranes. However, these methods do not meet the requirement for the quantification of intrinsic structural properties in depth filtration. This may be due to the complex influence of transport media dynamics and structural elements in the depth filtration process. In addition, the different morphological architectures of electrospun membranes present obstacles to precise quantification. This paper seeks to quantify the structural characteristics of electrospun membranes by introducing a robust image analysis technique and exploiting it to evaluate the permeation-filtration mechanism. To this end, a nanostructured fibrous network was simulated as an ideal membrane using adaptive local criteria in the image analysis. The reliability of the proposed approach was validated with measurements and comparison of structural characteristics in different morphological conditions. The results were found to be well compatible with empirical observations of perfect membrane structures. This approach, based on optimization of electrospinning parameters, may pave the way for producing optimal membrane structures for boosting the performance of electrospun membranes in end-use applications.展开更多
Bubble dynamics properties play a crucial and significant role in the design and optimization of gas-solid fluidized beds.In this study,the bubble dynamics properties of four B-particles were investigated in a quasi-t...Bubble dynamics properties play a crucial and significant role in the design and optimization of gas-solid fluidized beds.In this study,the bubble dynamics properties of four B-particles were investigated in a quasi-two-dimensional(quasi-2D)fluidized bed,including bubble equivalent diameter,bubble size distribution,average bubble density,bubble aspect ratio,bubble hold-up,bed expansion ratio,bubble radial position,and bubble velocity.The studies were performed by computational particle fluid dynamics(CPFD)numerical simulation and post-processed with digital image analysis(DIA)technique,at superficial gas velocities ranging from 2u_(mf) to 7u_(mf).The simulated results shown that the CPFD simulation combining with DIA technique post-processing could be used as a reliable method for simulating bubble dynamics properties in quasi-2D gas-solid fluidized beds.However,it seemed not desirable for the simulation of bubble motion near the air distributor at higher superficial gas velocity from the simulated average bubble density distribution.The superficial gas velocity significantly affected the bubble equivalent diameter and evolution,while it had little influence on bubble size distribution and bubble aspect ratio distribution for the same particles.Both time-averaged bubble hold-up and bed expansion ratio increased with the increase of superficial gas velocity.Two core-annular flow structures could be found in the fluidized bed for all cases.The average bubble rising velocity increased with the increasing bubble equivalent diameter.For bubble lateral movement,the smaller bubbles might be more susceptible,and superficial gas velocity had a little influence on the absolute lateral velocity of bubbles.The simulated results presented a valuable and novel approach for studying bubble dynamics properties.The comprehensive understanding of bubble dynamics behaviors in quasi-2D gas-solid fluidized beds would provide support in the design,operation,and optimization of gas-solid fluidized bed reactors.展开更多
In situ microscopic imaging is a useful tool in monitoring crystallization processes,including crystal nucleation,growth,aggregation and breakage,as well as possible polymorphic transition.To convert the qualitative i...In situ microscopic imaging is a useful tool in monitoring crystallization processes,including crystal nucleation,growth,aggregation and breakage,as well as possible polymorphic transition.To convert the qualitative information to be quantitative for the purpose of process optimization and control,accurate analysis of crystal images is essential.However,the accuracy of image segmentation with traditional methods is largely affected by many factors,including solid concentration and image quality.In this study,the deep learning technique using mask region-based convolutional neural network(Mask R-CNN)is investigated for the analysis of on-line images from an industrial crystallizer of 10 m^(3) operated in continuous mode with high solid concentration and overlapped particles.With detailed label points for each crystal and transfer learning technique,two models trained with 70,908 and 7,709 crystals respectively are compared for the effect of training data amount.The former model effectively segments the aggregated and overlapped crystals even at high solid concentrations.Moreover,it performs much better than the latter one and traditional multi-scale method both in terms of precision and recall,revealing the importance of large number of crystals in deep learning.Some geometrical characteristics of segmented crystals are also analyzed,involving equivalent diameter,circularity,and aspect ratio.展开更多
Transformers have dominated the field of natural language processing and have recently made an impact in the area of computer vision.In the field of medical image analysis,transformers have also been successfully used...Transformers have dominated the field of natural language processing and have recently made an impact in the area of computer vision.In the field of medical image analysis,transformers have also been successfully used in to full-stack clinical applications,including image synthesis/reconstruction,registration,segmentation,detection,and diagnosis.This paper aimed to promote awareness of the applications of transformers in medical image analysis.Specifically,we first provided an overview of the core concepts of the attention mechanism built into transformers and other basic components.Second,we reviewed various transformer architectures tailored for medical image applications and discuss their limitations.Within this review,we investigated key challenges including the use of transformers in different learning paradigms,improving model efficiency,and coupling with other techniques.We hope this review would provide a comprehensive picture of transformers to readers with an interest in medical image analysis.展开更多
Artificial Intelligence(AI)is being increasingly used for diagnosing Vision-Threatening Diabetic Retinopathy(VTDR),which is a leading cause of visual impairment and blindness worldwide.However,previous automated VTDR ...Artificial Intelligence(AI)is being increasingly used for diagnosing Vision-Threatening Diabetic Retinopathy(VTDR),which is a leading cause of visual impairment and blindness worldwide.However,previous automated VTDR detection methods have mainly relied on manual feature extraction and classification,leading to errors.This paper proposes a novel VTDR detection and classification model that combines different models through majority voting.Our proposed methodology involves preprocessing,data augmentation,feature extraction,and classification stages.We use a hybrid convolutional neural network-singular value decomposition(CNN-SVD)model for feature extraction and selection and an improved SVM-RBF with a Decision Tree(DT)and K-Nearest Neighbor(KNN)for classification.We tested our model on the IDRiD dataset and achieved an accuracy of 98.06%,a sensitivity of 83.67%,and a specificity of 100%for DR detection and evaluation tests,respectively.Our proposed approach outperforms baseline techniques and provides a more robust and accurate method for VTDR detection.展开更多
The object-based against pixel-based image analysis approaches were assessed for lithological mapping in a geologically complex terrain using Visible Near Infrared(VNIR)bands of WorldView-3(WV-3)satellite imagery.The ...The object-based against pixel-based image analysis approaches were assessed for lithological mapping in a geologically complex terrain using Visible Near Infrared(VNIR)bands of WorldView-3(WV-3)satellite imagery.The study area is Hormuz Island,southern Iran,a salt dome composed of dominant sedimentary and igneous rocks.When performing the object-based image analysis(OBLA)approach,the textural and spectral characteristics of lithological features were analyzed by the use of support vector machine(SVM)algorithm.However,in the pixelbased image analysis(PBIA),the spectra of lithological end-members,extracted from imagery,were used through the spectral angle mapper(SAM)method.Several test samples were used in a confusion matrix to assess the accuracy of classification methods quantitatively.Results showed that OBIA was capable of lithological mapping with an overall accuracy of 86.54%which was 19.33%greater than the accuracy of PBIA.OBIA also reduced the salt-and-pepper artifact pixels and produced a more realistic map with sharper lithological borders.This research showed limitations of pixel-based method due to relying merely on the spectral characteristics of rock types when applied to high-spatial-resolution VNIR bands of WorldView-3 imagery.It is concluded that the application of an object-based image analysis approach obtains a more accurate lithological classification when compared to a pixel-based image analysis algorithm.展开更多
Semantic segmentation of remote sensing images is an important but unsolved problem in the remote sensing society.Advanced image semantic segmentation models,such as DeepLabv3+,have achieved astonishing performance fo...Semantic segmentation of remote sensing images is an important but unsolved problem in the remote sensing society.Advanced image semantic segmentation models,such as DeepLabv3+,have achieved astonishing performance for semantically labeling very high resolution(VHR)remote sensing images.However,it is difficult for these models to capture the precise outlines of ground objects and explore the context information that revealing relationships among image objects for optimizing segmentation results.Consequently,this study proposes a semantic segmentation method for VHR images by incorporating deep learning semantic segmentation model(DeepLabv3+)and objectbased image analysis(OBIA),wherein DSM is employed to provide geometric information to enhance the interpretation of VHR images.The proposed method first obtains two initial probabilistic labeling predictions using a DeepLabv3+network on spectral image and a random forest(RF)classifier on hand-crafted features,respectively.These two predictions are then integrated by Dempster-Shafer(D-S)evidence theory to be fed into an object-constrained higher-order conditional random field(CRF)framework to estimate the final semantic labeling results with the consideration of the spatial contextual information.The proposed method is applied to the ISPRS 2D semantic labeling benchmark,and competitive overall accuracies of 90.6%and 85.0%are achieved for Vaihingen and Potsdam datasets,respectively.展开更多
This study analyses the effect of browning through image analysis based on colour and textural features in fresh-cut apple slices.A computer vision system(CVS)was developed for image acquisition,which consisted of a d...This study analyses the effect of browning through image analysis based on colour and textural features in fresh-cut apple slices.A computer vision system(CVS)was developed for image acquisition,which consisted of a digital camera and a florescent lamp source for illumination with a contrasting background.The CVS was calibrated using standard colour values and a model was developed by artificial neural network technique.Three varieties of apples such as Honey crisp,Granny Smith,and Golden Delicious were used for the analysis.The apples were freshly cut and subjected to image acquisition.Normalized colour features(L*,browning index,hue,and colour change)and textural features(entropy,contrast,and homogeneity)were analysed from the acquired images.The varieties Honey Crisp and Granny Smith did undergo browning within 120 min,whereas Golden delicious did not brown significantly.The study concluded that colour and textural features were important decision features for detecting browning in apples through image analysis.展开更多
Background:The carob tree(Ceratonia siliqua L.)is one of the most iconic tree species of the Mediterranean region,with valuable economic,ecological and cultural value.Carob has been exploited around the Mediterranean ...Background:The carob tree(Ceratonia siliqua L.)is one of the most iconic tree species of the Mediterranean region,with valuable economic,ecological and cultural value.Carob has been exploited around the Mediterranean region since antiquity and has been regarded as an important component of natural habitats and traditional agroecosystems.Several studies have focused on its morphological,biochemical,and genetic diversity.However,less is known about the intraspecific variation of seed traits.In this regard,and as an overall objective,we intend to evaluate the amplitude and the expression of intraspecific variations of carob seed traits at different ecological scales ranging from individual trees to different geographical landscapes.In addition,we investigated how the climate along the study area affects the extent of carob seed variability.Using image analysis techniques,we measured seven traits related to the size and the shape of 1740 seeds collected from 18 populations of spontaneous C.siliqua distributed along a latitudinal transect in Morocco under different bioclimatic conditions.Results:The morphometric analysis of carob seed showed the effectiveness of adopted approach to highlight the amount and the amplitude of intraspecific variation according to geographic and climatic factors.Seed trait analysis revealed high intraspecific variability,explained by differences between and among carob populations and geographic zones.Seed area,perimeter,length,and width showed the largest variability between geographic zones.However,circularity,aspect ratio,and seed roundness showed higher variability at the tree level.Finally,our results show that seed traits vary depending on altitude and climate condition.Conclusions:Revealing the amount and the structure of intraspecific traits variability of carob seed provides interesting insights to understand the mechanisms underlying trees adaptation to various environmental and ecological conditions.Therefore,intraspecific variation of seed traits should be integrated into trait‑based functional ecology to assess plant species responses to environmental changes.展开更多
Longitudinal image analysis plays an important role in depicting the development of the brain structure,where image regression and interpolation are two commonly used techniques.In this paper,we develop an efficient m...Longitudinal image analysis plays an important role in depicting the development of the brain structure,where image regression and interpolation are two commonly used techniques.In this paper,we develop an efficient model and approach based on a path regression on the image manifold instead of the geodesic regression to avoid the complexity of the geodesic computation.Concretely,first we model the deformation by diffeomorphism;then,a large deformation is represented by a path on the orbit of the diffeomorphism group action.This path is obtained by compositing several small deformations,which can be well approximated by its linearization.Second,we introduce some intermediate images as constraints to the model,which guides to form the best-fitting path.Thirdly,we propose an approximated quadratic model by local linearization method,where a closed form is deduced for the solution.It actually speeds up the algorithm.Finally,we evaluate the proposed model and algorithm on a synthetic data and a real longitudinal MRI data.The results show that our proposed method outperforms several state-of-the-art methods.展开更多
文摘The Ki67 index (KI) is a standard clinical marker for tumor proliferation;however, its application is hindered by intratumoral heterogeneity. In this study, we used digital image analysis to comprehensively analyze Ki67 heterogeneity and distribution patterns in breast carcinoma. Using Smart Pathology software, we digitized and analyzed 42 excised breast carcinoma Ki67 slides. Boxplots, histograms, and heat maps were generated to illustrate the KI distribution. We found that 30% of cases (13/42) exhibited discrepancies between global and hotspot KI when using a 14% KI threshold for classification. Patients with higher global or hotspot KI values displayed greater heterogenicity. Ki67 distribution patterns were categorized as randomly distributed (52%, 22/42), peripheral (43%, 18/42), and centered (5%, 2/42). Our sampling simulator indicated analyzing more than 10 high-power fields was typically required to accurately estimate global KI, with sampling size being correlated with heterogeneity. In conclusion, using digital image analysis in whole-slide images allows for comprehensive Ki67 profile assessment, shedding light on heterogeneity and distribution patterns. This spatial information can facilitate KI surveys of breast cancer and other malignancies.
基金This work was supported by Science and Technology Project of State Grid Corporation“Research on Key Technologies of Power Artificial Intelligence Open Platform”(5700-202155260A-0-0-00).
文摘The continuous growth in the scale of unmanned aerial vehicle (UAV) applications in transmission line inspection has resulted in a corresponding increase in the demand for UAV inspection image processing. Owing to its excellent performance in computer vision, deep learning has been applied to UAV inspection image processing tasks such as power line identification and insulator defect detection. Despite their excellent performance, electric power UAV inspection image processing models based on deep learning face several problems such as a small application scope, the need for constant retraining and optimization, and high R&D monetary and time costs due to the black-box and scene data-driven characteristics of deep learning. In this study, an automated deep learning system for electric power UAV inspection image analysis and processing is proposed as a solution to the aforementioned problems. This system design is based on the three critical design principles of generalizability, extensibility, and automation. Pre-trained models, fine-tuning (downstream task adaptation), and automated machine learning, which are closely related to these design principles, are reviewed. In addition, an automated deep learning system architecture for electric power UAV inspection image analysis and processing is presented. A prototype system was constructed and experiments were conducted on the two electric power UAV inspection image analysis and processing tasks of insulator self-detonation and bird nest recognition. The models constructed using the prototype system achieved 91.36% and 86.13% mAP for insulator self-detonation and bird nest recognition, respectively. This demonstrates that the system design concept is reasonable and the system architecture feasible .
基金support for this work from the Deanship of Scientific Research (DSR),University of Tabuk,Tabuk,Saudi Arabia,under grant number S-1440-0262.
文摘Medical image analysis is an active research topic,with thousands of studies published in the past few years.Transfer learning(TL)including convolutional neural networks(CNNs)focused to enhance efficiency on an innovative task using the knowledge of the same tasks learnt in advance.It has played a major role in medical image analysis since it solves the data scarcity issue along with that it saves hardware resources and time.This study develops an EnhancedTunicate SwarmOptimization withTransfer Learning EnabledMedical Image Analysis System(ETSOTL-MIAS).The goal of the ETSOTL-MIAS technique lies in the identification and classification of diseases through medical imaging.The ETSOTL-MIAS technique involves the Chan Vese segmentation technique to identify the affected regions in the medical image.For feature extraction purposes,the ETSOTL-MIAS technique designs a modified DarkNet-53 model.To avoid the manual hyperparameter adjustment process,the ETSOTLMIAS technique exploits the ETSO algorithm,showing the novelty of the work.Finally,the classification of medical images takes place by random forest(RF)classifier.The performance validation of the ETSOTL-MIAS technique is tested on a benchmark medical image database.The extensive experimental analysis showed the promising performance of the ETSOTL-MIAS technique under different measures.
基金The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work under grant number(RGP 2/158/43)Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2022R161)Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.The authors would like to thank the Deanship of Scientific Research at Umm Al-Qura University for supporting this work by Grant Code:(22UQU4340237DSR11).
文摘Biomedical image processing is widely utilized for disease detection and classification of biomedical images.Tongue color image analysis is an effective and non-invasive tool for carrying out secondary detection at anytime and anywhere.For removing the qualitative aspect,tongue images are quantitatively inspected,proposing a novel disease classification model in an automated way is preferable.This article introduces a novel political optimizer with deep learning enabled tongue color image analysis(PODL-TCIA)technique.The presented PODL-TCIA model purposes to detect the occurrence of the disease by examining the color of the tongue.To attain this,the PODL-TCIA model initially performs image pre-processing to enhance medical image quality.Followed by,Inception with ResNet-v2 model is employed for feature extraction.Besides,political optimizer(PO)with twin support vector machine(TSVM)model is exploited for image classification process,shows the novelty of the work.The design of PO algorithm assists in the optimal parameter selection of the TSVM model.For ensuring the enhanced outcomes of the PODL-TCIA model,a wide-ranging experimental analysis was applied and the outcomes reported the betterment of the PODL-TCIA model over the recent approaches.
基金The technical and financial support of Aluminerie Alouette Inc.the financial support of the National Science and Engineering Research Council of Canada(NSERC),Developpement economique Sept-Iles,the University of Quebec at Chicoutimi(UQAC),and the Foundation of the University of Quebec at Chicoutimi(FUQAC)are greatly appreciated.
文摘Carbon anodes are an essential part of the primary aluminum production. They are made of coal tar pitch, calcined petroleum coke, recycled anodes and butts. As pitch acts as a binder for the anode, its proper distribution in a green anode has a great impact on the properties of the baked anode. Information on cracks in anodes is important for the quality of the baked anode. There is presently no reliable method available to analyze and quantify the amount of coke, pitch and pores/cracks in a green anode sample. In this article, an image analysis technique has been described, that can analyze as well as quantify the area percentage of pores/cracks and weight percentages of pitch and coke. The novelty of the method is its capacity to differentiate the different components of anode.
基金This work is partly supported by the Fundamental Research Funds for the Central Universities of China under grants GK202003080the Natural Science Foundation of Shaanxi Province under Grants 2021JM-205the UK Engineering and Physical Sciences Research Council through grants EP/V034111/1.
文摘Malaria is an important and worldwide fatal disease that has been widely reported by the World Health Organization(WHO),and it has about 219 million cases worldwide,with 435,000 of those mortal.The common malaria diagnosis approach is heavily reliant on highly trained experts,who use a microscope to examine the samples.Therefore,there is a need to create an automated solution for the diagnosis of malaria.One of the main objectives of this work is to create a design tool that could be used to diagnose malaria from the image of a blood sample.In this paper,we firstly developed a graphical user interface that could be used to help segment red blood cells and infected cells and allow the users to analyze the blood samples.Secondly,a Feed-forward Neural Network(FNN)is designed to classify the cells into two classes.The achieved results show that the proposed techniques can be used to detect malaria,as it has achieved 92%accuracy with a database that contains 27,560 benchmark images.
文摘In this paper,an Automated Brain Image Analysis(ABIA)system that classifies the Magnetic Resonance Imaging(MRI)of human brain is presented.The classification of MRI images into normal or low grade or high grade plays a vital role for the early diagnosis.The Non-Subsampled Shearlet Transform(NSST)that captures more visual information than conventional wavelet transforms is employed for feature extraction.As the feature space of NSST is very high,a statistical t-test is applied to select the dominant directional sub-bands at each level of NSST decomposition based on sub-band energies.A combination of features that includes Gray Level Co-occurrence Matrix(GLCM)based features,Histograms of Positive Shearlet Coefficients(HPSC),and Histograms of Negative Shearlet Coefficients(HNSC)are estimated.The combined feature set is utilized in the classification phase where a hybrid approach is designed with three classifiers;k-Nearest Neighbor(kNN),Naive Bayes(NB)and Support Vector Machine(SVM)classifiers.The output of individual trained classifiers for a testing input is hybridized to take a final decision.The quantitative results of ABIA system on Repository of Molecular Brain Neoplasia Data(REMBRANDT)database show the overall improved performance in comparison with a single classifier model with accuracy of 99% for normal/abnormal classification and 98% for low and high risk classification.
文摘This review is focused on using computer image analysis as a means of objective and quantitative characterizing optical images of the macroscopic (e.g. microbial colonies) and the microscopic (e.g. single cell) objects in the microbiological research. This is the way of making many visual inspection assays more objective and less time and labor consuming. Also, it can provide new visually inaccessible information on relation between some optical parameters and various biological features of the microbial cul-tures. Of special interest is application of image analysis in fluorescence microscopy as it opens new ways of using fluorescence based methodology for single microbial cell studies. Examples of using image analysis in the studies of both the macroscopic and the microscopic microbiological objects obtained by various imaging techniques are presented and discussed.
文摘BACKGROUND Digital pathology image(DPI)analysis has been developed by machine learning(ML)techniques.However,little attention has been paid to the reproducibility of ML-based histological classification in heterochronously obtained DPIs of the same hematoxylin and eosin(HE)slide.AIM To elucidate the frequency and preventable causes of discordant classification results of DPI analysis using ML for the heterochronously obtained DPIs.METHODS We created paired DPIs by scanning 298 HE stained slides containing 584 tissues twice with a virtual slide scanner.The paired DPIs were analyzed by our MLaided classification model.We defined non-flipped and flipped groups as the paired DPIs with concordant and discordant classification results,respectively.We compared differences in color and blur between the non-flipped and flipped groups by L1-norm and a blur index,respectively.RESULTS We observed discordant classification results in 23.1%of the paired DPIs obtained by two independent scans of the same microscope slide.We detected no significant difference in the L1-norm of each color channel between the two groups;however,the flipped group showed a significantly higher blur index than the non-flipped group.CONCLUSION Our results suggest that differences in the blur-not the color-of the paired DPIs may cause discordant classification results.An ML-aided classification model for DPI should be tested for this potential cause of the reduced reproducibility of the model.In a future study,a slide scanner and/or a preprocessing method of minimizing DPI blur should be developed.
文摘Due to their unique structural features, electrospun membranes have gained considerable attention for use in applications where quality of depth filtration is a dominant performance factor. To elucidate the depth filtration phenomena it is important to quantify the intrinsic structural properties independent from the dynamics of transport media. Several methods have been proposed for structural characterization of such membranes. However, these methods do not meet the requirement for the quantification of intrinsic structural properties in depth filtration. This may be due to the complex influence of transport media dynamics and structural elements in the depth filtration process. In addition, the different morphological architectures of electrospun membranes present obstacles to precise quantification. This paper seeks to quantify the structural characteristics of electrospun membranes by introducing a robust image analysis technique and exploiting it to evaluate the permeation-filtration mechanism. To this end, a nanostructured fibrous network was simulated as an ideal membrane using adaptive local criteria in the image analysis. The reliability of the proposed approach was validated with measurements and comparison of structural characteristics in different morphological conditions. The results were found to be well compatible with empirical observations of perfect membrane structures. This approach, based on optimization of electrospinning parameters, may pave the way for producing optimal membrane structures for boosting the performance of electrospun membranes in end-use applications.
基金the financial support provided by National Key R&D Project of China(grant No.2020YFB0606303)the technical supports received from Sam Clark in CPFD Software,LLC of USA,and from Hi-Key Technology Incorporated of China.
文摘Bubble dynamics properties play a crucial and significant role in the design and optimization of gas-solid fluidized beds.In this study,the bubble dynamics properties of four B-particles were investigated in a quasi-two-dimensional(quasi-2D)fluidized bed,including bubble equivalent diameter,bubble size distribution,average bubble density,bubble aspect ratio,bubble hold-up,bed expansion ratio,bubble radial position,and bubble velocity.The studies were performed by computational particle fluid dynamics(CPFD)numerical simulation and post-processed with digital image analysis(DIA)technique,at superficial gas velocities ranging from 2u_(mf) to 7u_(mf).The simulated results shown that the CPFD simulation combining with DIA technique post-processing could be used as a reliable method for simulating bubble dynamics properties in quasi-2D gas-solid fluidized beds.However,it seemed not desirable for the simulation of bubble motion near the air distributor at higher superficial gas velocity from the simulated average bubble density distribution.The superficial gas velocity significantly affected the bubble equivalent diameter and evolution,while it had little influence on bubble size distribution and bubble aspect ratio distribution for the same particles.Both time-averaged bubble hold-up and bed expansion ratio increased with the increase of superficial gas velocity.Two core-annular flow structures could be found in the fluidized bed for all cases.The average bubble rising velocity increased with the increasing bubble equivalent diameter.For bubble lateral movement,the smaller bubbles might be more susceptible,and superficial gas velocity had a little influence on the absolute lateral velocity of bubbles.The simulated results presented a valuable and novel approach for studying bubble dynamics properties.The comprehensive understanding of bubble dynamics behaviors in quasi-2D gas-solid fluidized beds would provide support in the design,operation,and optimization of gas-solid fluidized bed reactors.
基金Financial support from the National Natural Science Foundation of China(grant No.61633006)is acknowledged。
文摘In situ microscopic imaging is a useful tool in monitoring crystallization processes,including crystal nucleation,growth,aggregation and breakage,as well as possible polymorphic transition.To convert the qualitative information to be quantitative for the purpose of process optimization and control,accurate analysis of crystal images is essential.However,the accuracy of image segmentation with traditional methods is largely affected by many factors,including solid concentration and image quality.In this study,the deep learning technique using mask region-based convolutional neural network(Mask R-CNN)is investigated for the analysis of on-line images from an industrial crystallizer of 10 m^(3) operated in continuous mode with high solid concentration and overlapped particles.With detailed label points for each crystal and transfer learning technique,two models trained with 70,908 and 7,709 crystals respectively are compared for the effect of training data amount.The former model effectively segments the aggregated and overlapped crystals even at high solid concentrations.Moreover,it performs much better than the latter one and traditional multi-scale method both in terms of precision and recall,revealing the importance of large number of crystals in deep learning.Some geometrical characteristics of segmented crystals are also analyzed,involving equivalent diameter,circularity,and aspect ratio.
基金the National Natural Science Foundation of China(Grant No.62106101)the Natural Science Foundation of Jiangsu Province(Grant No.BK20210180).
文摘Transformers have dominated the field of natural language processing and have recently made an impact in the area of computer vision.In the field of medical image analysis,transformers have also been successfully used in to full-stack clinical applications,including image synthesis/reconstruction,registration,segmentation,detection,and diagnosis.This paper aimed to promote awareness of the applications of transformers in medical image analysis.Specifically,we first provided an overview of the core concepts of the attention mechanism built into transformers and other basic components.Second,we reviewed various transformer architectures tailored for medical image applications and discuss their limitations.Within this review,we investigated key challenges including the use of transformers in different learning paradigms,improving model efficiency,and coupling with other techniques.We hope this review would provide a comprehensive picture of transformers to readers with an interest in medical image analysis.
基金This research was funded by the National Natural Science Foundation of China(Nos.71762010,62262019,62162025,61966013,12162012)the Hainan Provincial Natural Science Foundation of China(Nos.823RC488,623RC481,620RC603,621QN241,620RC602,121RC536)+1 种基金the Haikou Science and Technology Plan Project of China(No.2022-016)the Project supported by the Education Department of Hainan Province,No.Hnky2021-23.
文摘Artificial Intelligence(AI)is being increasingly used for diagnosing Vision-Threatening Diabetic Retinopathy(VTDR),which is a leading cause of visual impairment and blindness worldwide.However,previous automated VTDR detection methods have mainly relied on manual feature extraction and classification,leading to errors.This paper proposes a novel VTDR detection and classification model that combines different models through majority voting.Our proposed methodology involves preprocessing,data augmentation,feature extraction,and classification stages.We use a hybrid convolutional neural network-singular value decomposition(CNN-SVD)model for feature extraction and selection and an improved SVM-RBF with a Decision Tree(DT)and K-Nearest Neighbor(KNN)for classification.We tested our model on the IDRiD dataset and achieved an accuracy of 98.06%,a sensitivity of 83.67%,and a specificity of 100%for DR detection and evaluation tests,respectively.Our proposed approach outperforms baseline techniques and provides a more robust and accurate method for VTDR detection.
文摘The object-based against pixel-based image analysis approaches were assessed for lithological mapping in a geologically complex terrain using Visible Near Infrared(VNIR)bands of WorldView-3(WV-3)satellite imagery.The study area is Hormuz Island,southern Iran,a salt dome composed of dominant sedimentary and igneous rocks.When performing the object-based image analysis(OBLA)approach,the textural and spectral characteristics of lithological features were analyzed by the use of support vector machine(SVM)algorithm.However,in the pixelbased image analysis(PBIA),the spectra of lithological end-members,extracted from imagery,were used through the spectral angle mapper(SAM)method.Several test samples were used in a confusion matrix to assess the accuracy of classification methods quantitatively.Results showed that OBIA was capable of lithological mapping with an overall accuracy of 86.54%which was 19.33%greater than the accuracy of PBIA.OBIA also reduced the salt-and-pepper artifact pixels and produced a more realistic map with sharper lithological borders.This research showed limitations of pixel-based method due to relying merely on the spectral characteristics of rock types when applied to high-spatial-resolution VNIR bands of WorldView-3 imagery.It is concluded that the application of an object-based image analysis approach obtains a more accurate lithological classification when compared to a pixel-based image analysis algorithm.
基金was funded by the Key Laboratory of Surveying and Mapping Science and Geospatial Information Technology of Ministry of Natural Resources[grant number 2020-2-1]the National Natural Science Foundation of China[grant number 41871372].
文摘Semantic segmentation of remote sensing images is an important but unsolved problem in the remote sensing society.Advanced image semantic segmentation models,such as DeepLabv3+,have achieved astonishing performance for semantically labeling very high resolution(VHR)remote sensing images.However,it is difficult for these models to capture the precise outlines of ground objects and explore the context information that revealing relationships among image objects for optimizing segmentation results.Consequently,this study proposes a semantic segmentation method for VHR images by incorporating deep learning semantic segmentation model(DeepLabv3+)and objectbased image analysis(OBIA),wherein DSM is employed to provide geometric information to enhance the interpretation of VHR images.The proposed method first obtains two initial probabilistic labeling predictions using a DeepLabv3+network on spectral image and a random forest(RF)classifier on hand-crafted features,respectively.These two predictions are then integrated by Dempster-Shafer(D-S)evidence theory to be fed into an object-constrained higher-order conditional random field(CRF)framework to estimate the final semantic labeling results with the consideration of the spatial contextual information.The proposed method is applied to the ISPRS 2D semantic labeling benchmark,and competitive overall accuracies of 90.6%and 85.0%are achieved for Vaihingen and Potsdam datasets,respectively.
文摘This study analyses the effect of browning through image analysis based on colour and textural features in fresh-cut apple slices.A computer vision system(CVS)was developed for image acquisition,which consisted of a digital camera and a florescent lamp source for illumination with a contrasting background.The CVS was calibrated using standard colour values and a model was developed by artificial neural network technique.Three varieties of apples such as Honey crisp,Granny Smith,and Golden Delicious were used for the analysis.The apples were freshly cut and subjected to image acquisition.Normalized colour features(L*,browning index,hue,and colour change)and textural features(entropy,contrast,and homogeneity)were analysed from the acquired images.The varieties Honey Crisp and Granny Smith did undergo browning within 120 min,whereas Golden delicious did not brown significantly.The study concluded that colour and textural features were important decision features for detecting browning in apples through image analysis.
基金This study was supported by the Agencia Andaluza de Cooperación Internacional para el Desarrollo(AACID)and the project“Amélioration de la productivitédes cultures forestières d’intérêt socio-économiqueélevédans les zones rurales du nord du Maroc,n°2018004”.
文摘Background:The carob tree(Ceratonia siliqua L.)is one of the most iconic tree species of the Mediterranean region,with valuable economic,ecological and cultural value.Carob has been exploited around the Mediterranean region since antiquity and has been regarded as an important component of natural habitats and traditional agroecosystems.Several studies have focused on its morphological,biochemical,and genetic diversity.However,less is known about the intraspecific variation of seed traits.In this regard,and as an overall objective,we intend to evaluate the amplitude and the expression of intraspecific variations of carob seed traits at different ecological scales ranging from individual trees to different geographical landscapes.In addition,we investigated how the climate along the study area affects the extent of carob seed variability.Using image analysis techniques,we measured seven traits related to the size and the shape of 1740 seeds collected from 18 populations of spontaneous C.siliqua distributed along a latitudinal transect in Morocco under different bioclimatic conditions.Results:The morphometric analysis of carob seed showed the effectiveness of adopted approach to highlight the amount and the amplitude of intraspecific variation according to geographic and climatic factors.Seed trait analysis revealed high intraspecific variability,explained by differences between and among carob populations and geographic zones.Seed area,perimeter,length,and width showed the largest variability between geographic zones.However,circularity,aspect ratio,and seed roundness showed higher variability at the tree level.Finally,our results show that seed traits vary depending on altitude and climate condition.Conclusions:Revealing the amount and the structure of intraspecific traits variability of carob seed provides interesting insights to understand the mechanisms underlying trees adaptation to various environmental and ecological conditions.Therefore,intraspecific variation of seed traits should be integrated into trait‑based functional ecology to assess plant species responses to environmental changes.
基金The research was supported by the National Natural Science Foundation of China(Nos.11771276,11471208)the Capacity Construction Project of Local Universities in Shanghai(No.18010500600).
文摘Longitudinal image analysis plays an important role in depicting the development of the brain structure,where image regression and interpolation are two commonly used techniques.In this paper,we develop an efficient model and approach based on a path regression on the image manifold instead of the geodesic regression to avoid the complexity of the geodesic computation.Concretely,first we model the deformation by diffeomorphism;then,a large deformation is represented by a path on the orbit of the diffeomorphism group action.This path is obtained by compositing several small deformations,which can be well approximated by its linearization.Second,we introduce some intermediate images as constraints to the model,which guides to form the best-fitting path.Thirdly,we propose an approximated quadratic model by local linearization method,where a closed form is deduced for the solution.It actually speeds up the algorithm.Finally,we evaluate the proposed model and algorithm on a synthetic data and a real longitudinal MRI data.The results show that our proposed method outperforms several state-of-the-art methods.