Workers’exposure to excessive noise is a big universal work-related challenges.One of the major consequences of exposure to noise is permanent or transient hearing loss.The current study sought to utilize audiometric...Workers’exposure to excessive noise is a big universal work-related challenges.One of the major consequences of exposure to noise is permanent or transient hearing loss.The current study sought to utilize audiometric data to weigh and prioritize the factors affecting workers’hearing loss based using the Support Vector Machine(SVM)algorithm.This cross sectional-descriptive study was conducted in 2017 in a mining industry in southeast Iran.The participating workers(n=150)were divided into three groups of 50 based on the sound pressure level to which they were exposed(two experimental groups and one control group).Audiometric tests were carried out for all members of each group.The study generally entailed the following steps:(1)selecting predicting variables to weigh and prioritize factors affecting hearing loss;(2)conducting audiometric tests and assessing permanent hearing loss in each ear and then evaluating total hearing loss;(3)categorizing different types of hearing loss;(4)weighing and prioritizing factors that affect hearing loss based on the SVM algorithm;and(5)assessing the error rate and accuracy of the models.The collected data were fed into SPSS 18,followed by conducting linear regression and paired samples t-test.It was revealed that,in the first model(SPL<70 dBA),the frequency of 8 KHz had the greatest impact(with a weight of 33%),while noise had the smallest influence(with a weight of 5%).The accuracy of this model was 100%.In the second model(70<SPL<80 dBA),the frequency of 4 KHz had the most profound effect(with a weight of 21%),whereas the frequency of 250 Hz had the lowest impact(with a weight of 6%).The accuracy of this model was 100%too.In the third model(SPL>85 dBA),the frequency of 4 KHz had the highest impact(with a weight of 22%),while the frequency of 250 Hz had the smallest influence(with a weight of 3%).The accuracy of this model was 100%too.In the fourth model,the frequency of 4 KHz had the greatest effect(with a weight of 24%),while the frequency of 500 Hz had the smallest effect(with a weight of 4%).The accuracy of this model was found to be 94%.According to the modeling conducted using the SVM algorithm,the frequency of 4 KHz has the most profound effect on predicting changes in hearing loss.Given the high accuracy of the obtained model,this algorithm is an appropriate and powerful tool to predict and model hearing loss.展开更多
This paper presents a nonlinear model predictive control(NMPC) approach based on support vector machine(SVM) and genetic algorithm(GA) for multiple-input multiple-output(MIMO) nonlinear systems.Individual SVM is used ...This paper presents a nonlinear model predictive control(NMPC) approach based on support vector machine(SVM) and genetic algorithm(GA) for multiple-input multiple-output(MIMO) nonlinear systems.Individual SVM is used to approximate each output of the controlled plant Then the model is used in MPC control scheme to predict the outputs of the controlled plant.The optimal control sequence is calculated using GA with elite preserve strategy.Simulation results of a typical MIMO nonlinear system show that this method has a good ability of set points tracking and disturbance rejection.展开更多
In order to improve measurement accuracy of moving target signals, an automatic target recognition model of moving target signals was established based on empirical mode decomposition(EMD) and support vector machine(S...In order to improve measurement accuracy of moving target signals, an automatic target recognition model of moving target signals was established based on empirical mode decomposition(EMD) and support vector machine(SVM). Automatic target recognition process on the nonlinear and non-stationary of Doppler signals of military target by using automatic target recognition model can be expressed as follows. Firstly, the nonlinearity and non-stationary of Doppler signals were decomposed into a set of intrinsic mode functions(IMFs) using EMD. After the Hilbert transform of IMF, the energy ratio of each IMF to the total IMFs can be extracted as the features of military target. Then, the SVM was trained through using the energy ratio to classify the military targets, and genetic algorithm(GA) was used to optimize SVM parameters in the solution space. The experimental results show that this algorithm can achieve the recognition accuracies of 86.15%, 87.93%, and 82.28% for tank, vehicle and soldier, respectively.展开更多
Breast Arterial Calcification(BAC)is a mammographic decision dissimilar to cancer and commonly observed in elderly women.Thus identifying BAC could provide an expense,and be inaccurate.Recently Deep Learning(DL)method...Breast Arterial Calcification(BAC)is a mammographic decision dissimilar to cancer and commonly observed in elderly women.Thus identifying BAC could provide an expense,and be inaccurate.Recently Deep Learning(DL)methods have been introduced for automatic BAC detection and quantification with increased accuracy.Previously,classification with deep learning had reached higher efficiency,but designing the structure of DL proved to be an extremely challenging task due to overfitting models.It also is not able to capture the patterns and irregularities presented in the images.To solve the overfitting problem,an optimal feature set has been formed by Enhanced Wolf Pack Algorithm(EWPA),and their irregularities are identified by Dense-kUNet segmentation.In this paper,Dense-kUNet for segmentation and optimal feature has been introduced for classification(severe,mild,light)that integrates DenseUNet and kU-Net.Longer bound links exist among adjacent modules,allowing relatively rough data to be sent to the following component and assisting the system in finding higher qualities.The major contribution of the work is to design the best features selected by Enhanced Wolf Pack Algorithm(EWPA),and Modified Support Vector Machine(MSVM)based learning for classification.k-Dense-UNet is introduced which combines the procedure of Dense-UNet and kU-Net for image segmentation.Longer bound associations occur among nearby sections,allowing relatively granular data to be sent to the next subsystem and benefiting the system in recognizing smaller characteristics.The proposed techniques and the performance are tested using several types of analysis techniques 826 filled digitized mammography.The proposed method achieved the highest precision,recall,F-measure,and accuracy of 84.4333%,84.5333%,84.4833%,and 86.8667%when compared to other methods on the Digital Database for Screening Mammography(DDSM).展开更多
The pruning algorithms for sparse least squares support vector regression machine are common methods, and easily com- prehensible, but the computational burden in the training phase is heavy due to the retraining in p...The pruning algorithms for sparse least squares support vector regression machine are common methods, and easily com- prehensible, but the computational burden in the training phase is heavy due to the retraining in performing the pruning process, which is not favorable for their applications. To this end, an im- proved scheme is proposed to accelerate sparse least squares support vector regression machine. A major advantage of this new scheme is based on the iterative methodology, which uses the previous training results instead of retraining, and its feasibility is strictly verified theoretically. Finally, experiments on bench- mark data sets corroborate a significant saving of the training time with the same number of support vectors and predictive accuracy compared with the original pruning algorithms, and this speedup scheme is also extended to classification problem.展开更多
Word Sense Disambiguation has been a trending topic of research in Natural Language Processing and Machine Learning.Mining core features and performing the text classification still exist as a challenging task.Here the...Word Sense Disambiguation has been a trending topic of research in Natural Language Processing and Machine Learning.Mining core features and performing the text classification still exist as a challenging task.Here the features of the context such as neighboring words like adjective provide the evidence for classification using machine learning approach.This paper presented the text document classification that has wide applications in information retrieval,which uses movie review datasets.Here the document indexing based on controlled vocabulary,adjective,word sense disambiguation,generating hierarchical cate-gorization of web pages,spam detection,topic labeling,web search,document summarization,etc.Here the kernel support vector machine learning algorithm helps to classify the text and feature extract is performed by cuckoo search opti-mization.Positive review and negative review of movie dataset is presented to get the better classification accuracy.Experimental results focused with context mining,feature analysis and classification.By comparing with the previous work,proposed work designed to achieve the efficient results.Overall design is per-formed with MATLAB 2020a tool.展开更多
Objective:Circulating tumor cells(CTCs)play a critical role in cancer metastasis,but their prevalence and significance remain unclear.This study attempted to track the epithelial-mesenchymal transition(EMT)status of C...Objective:Circulating tumor cells(CTCs)play a critical role in cancer metastasis,but their prevalence and significance remain unclear.This study attempted to track the epithelial-mesenchymal transition(EMT)status of CTCs in breast cancer patients and investigate their clinical relevance.Methods:In this study,the established negFACS-IF:E/M platform was applied to isolate rare CTCs and characterize their EMT status in breast cancer.A total of 89 breast cancer patients were recruited,including stage 0–III(n=60)and late stage(n=29)cases.Results:Using the negFACS-IF:E/M platform,it was found that in human epidermal growth factor receptor 2(HER2)+patients,mesenchymal CTCs usually exhibited a high percentage of HER2+cells.Stage IV breast cancer patients had considerably more CTCs than stage 0–III patients.Among stage 0–III breast cancers,the HER2 subtype included a significantly higher percentage of mesenchymal and biphenotypic(epithelial and mesenchymal)CTCs than the luminal A or B subtypes.Among stage IV patients,CTCs were predominantly epithelial in cases with local recurrence and were more mesenchymal in cases with distant metastasis.By applying a support vector machine(SVM)algorithm,the EMT status of CTCs could distinguish between breast cancer cases with metastasis/local recurrence and those without recurrence.Conclusions:The negFACS-IF:E/M platform provides a flexible and generally acceptable method for the highly sensitive and specific detection of CTCs and their EMT traits in breast cancer.This study demonstrated that the EMT status of CTCs had high clinical relevance in breast cancer,especially in predicting the distant metastasis or local recurrence of breast cancer.展开更多
For the deficiency that the traditional single forecast methods could not forecast electronic equipment states, a combined forecast method based on the hidden Markov model(HMM) and least square support vector machin...For the deficiency that the traditional single forecast methods could not forecast electronic equipment states, a combined forecast method based on the hidden Markov model(HMM) and least square support vector machine(LS-SVM) is presented. The multi-agent genetic algorithm(MAGA) is used to estimate parameters of HMM to overcome the problem that the Baum-Welch algorithm is easy to fall into local optimal solution. The state condition probability is introduced into the HMM modeling process to reduce the effect of uncertain factors. MAGA is used to estimate parameters of LS-SVM. Moreover, pruning algorithms are used to estimate parameters to get the sparse approximation of LS-SVM so as to increase the ranging performance. On the basis of these, the combined forecast model of electronic equipment states is established. The example results show the superiority of the combined forecast model in terms of forecast precision,calculation speed and stability.展开更多
Pidan or century egg, also known as preserved egg, is one of the most traditional and popular egg products in China. The crack detection of preserved eggshell is very important to guarantee its quality. In this study,...Pidan or century egg, also known as preserved egg, is one of the most traditional and popular egg products in China. The crack detection of preserved eggshell is very important to guarantee its quality. In this study, we develop an image algorithm for preserved eggshell's crack detection by using natural light and polarized image. Four features including crack length, crack state coefficient, maximum projection and angular point are extracted from the natural light image by morphology calculus algorithms. The support vector machines(SVM) model with radial basis kernel function is established using the four features with an accuracy of about 92%. The detection accuracy is improved to 94% by using a new characteristic parameter of crack length on polarization image. The Multi-information fusion analysis indicates the potential for cracks detection by a real-time synthesis imaging system.展开更多
Near infrared spectroscopy (NIR) is now probably the most popular process analytical technology (PAT) for pharmaceutical and some other industries. However, unlike mid-IR, NIR is known to have difficulties in moni...Near infrared spectroscopy (NIR) is now probably the most popular process analytical technology (PAT) for pharmaceutical and some other industries. However, unlike mid-IR, NIR is known to have difficulties in monitoring crystallization or precipitation processes because the existence of solids could cause distortion of the spectra. This phenomenon, seen as unfavorable previously, is however an indication that NIR spectra contain rich information about both solids and liquids, giving the possibility of using the same instrument for multiple property characterization. In this study, transflectance NIR calibration data was obtained using solutions and slurries of varied solution concentration, particle size, solid concentration and temperature. The data was used to build calibration models for prediction of the multiple properties of both phases. Predictive models were developed for this challenging application using an approach that combines genetic algorithm (GA) and support vector machine (SVM). GA is used for wavelength selection and SVM for mode building. The new GA-SVM approach is shown to outperform other methods including GA-PLS (partial least squares) and traditional SVM. NIR is thus successfully applied to monitoring seeded and unseeded cooling crystallization processes of L-glutamic acid.展开更多
In this work, two chemometrics methods are applied for the modeling and prediction of electrophoretic mobilities of some organic and inorganic compounds. The successive projection algorithm, feature selection (SPA) ...In this work, two chemometrics methods are applied for the modeling and prediction of electrophoretic mobilities of some organic and inorganic compounds. The successive projection algorithm, feature selection (SPA) strategy, is used as the descriptor selection and model development method. Then, the support vector machine (SVM) and multiple linear regression (MLR) model are utilized to construct the non-linear and linear quantitative structure-property relationship models. The results obtained using the SVM model are compared with those obtained using MLR reveal that the SVM model is of much better predictive value than the MLR one. The root-mean-square errors for the training set and the test set for the SVM model were 0.1911 and 0.2569, respectively, while by the MLR model, they were 0.4908 and 0.6494, respectively. The results show that the SVM model drastically enhances the ability of prediction in QSPR studies and is superior to the MLR model.展开更多
Hot components operate in a high-temperature and high-pressure environment. The occurrence of a fault in hot components leads to high economic losses. In general, exhaust gas temperature(EGT) is used to monitor the pe...Hot components operate in a high-temperature and high-pressure environment. The occurrence of a fault in hot components leads to high economic losses. In general, exhaust gas temperature(EGT) is used to monitor the performance of hot components.However, during the early stages of a failure, the fault information is weak, and is simultaneously affected by various types of interference, such as the complex working conditions, ambient conditions, gradual performance degradation of the compressors and turbines, and noise. Additionally, inadequate effective information of the gas turbine also restricts the establishment of the detection model. To solve the above problems, this paper proposes an anomaly detection method based on frequent pattern extraction. A frequent pattern model(FPM) is applied to indicate the inherent regularity of change in EGT occurring from different types of interference. In this study, based on a genetic algorithm and support vector machine regression, the relationship model between the EGT and interference was tentatively built. The modeling accuracy was then further improved through the selection of the kernel function and training data. Experiments indicate that the optimal kernel function is linear and that the optimal training data should be balanced in addition to covering the appropriate range of operating conditions and ambient temperature. Furthermore, the thresholds based on the Pauta criterion that is automatically obtained during the modeling process, are used to determine whether hot components are operating abnormally. Moreover, the FPM is compared with the similarity theory, which demonstrates that the FPM can better suppress the effect of the component performance degradation and fuel heat value fluctuation. Finally, the effectiveness of the proposed method is validated on seven months of actual data obtained from a Titan130 gas turbine on an offshore oil platform. The results indicate that the proposed method can sensitively detect malfunctions in hot components during the early stages of a fault, and is robust to various types of interference.展开更多
基金This study stemmed from a research project(code number:96000838)which was sponsored by the Institute for Futures Studies in Health at Kerman University of Medical Sciences.
文摘Workers’exposure to excessive noise is a big universal work-related challenges.One of the major consequences of exposure to noise is permanent or transient hearing loss.The current study sought to utilize audiometric data to weigh and prioritize the factors affecting workers’hearing loss based using the Support Vector Machine(SVM)algorithm.This cross sectional-descriptive study was conducted in 2017 in a mining industry in southeast Iran.The participating workers(n=150)were divided into three groups of 50 based on the sound pressure level to which they were exposed(two experimental groups and one control group).Audiometric tests were carried out for all members of each group.The study generally entailed the following steps:(1)selecting predicting variables to weigh and prioritize factors affecting hearing loss;(2)conducting audiometric tests and assessing permanent hearing loss in each ear and then evaluating total hearing loss;(3)categorizing different types of hearing loss;(4)weighing and prioritizing factors that affect hearing loss based on the SVM algorithm;and(5)assessing the error rate and accuracy of the models.The collected data were fed into SPSS 18,followed by conducting linear regression and paired samples t-test.It was revealed that,in the first model(SPL<70 dBA),the frequency of 8 KHz had the greatest impact(with a weight of 33%),while noise had the smallest influence(with a weight of 5%).The accuracy of this model was 100%.In the second model(70<SPL<80 dBA),the frequency of 4 KHz had the most profound effect(with a weight of 21%),whereas the frequency of 250 Hz had the lowest impact(with a weight of 6%).The accuracy of this model was 100%too.In the third model(SPL>85 dBA),the frequency of 4 KHz had the highest impact(with a weight of 22%),while the frequency of 250 Hz had the smallest influence(with a weight of 3%).The accuracy of this model was 100%too.In the fourth model,the frequency of 4 KHz had the greatest effect(with a weight of 24%),while the frequency of 500 Hz had the smallest effect(with a weight of 4%).The accuracy of this model was found to be 94%.According to the modeling conducted using the SVM algorithm,the frequency of 4 KHz has the most profound effect on predicting changes in hearing loss.Given the high accuracy of the obtained model,this algorithm is an appropriate and powerful tool to predict and model hearing loss.
基金Supported by the National Natural Science Foundation of China(21076179)the National Basic Research Program of China(2012CB720500)
文摘This paper presents a nonlinear model predictive control(NMPC) approach based on support vector machine(SVM) and genetic algorithm(GA) for multiple-input multiple-output(MIMO) nonlinear systems.Individual SVM is used to approximate each output of the controlled plant Then the model is used in MPC control scheme to predict the outputs of the controlled plant.The optimal control sequence is calculated using GA with elite preserve strategy.Simulation results of a typical MIMO nonlinear system show that this method has a good ability of set points tracking and disturbance rejection.
基金Projects(61471370,61401479)supported by the National Natural Science Foundation of China
文摘In order to improve measurement accuracy of moving target signals, an automatic target recognition model of moving target signals was established based on empirical mode decomposition(EMD) and support vector machine(SVM). Automatic target recognition process on the nonlinear and non-stationary of Doppler signals of military target by using automatic target recognition model can be expressed as follows. Firstly, the nonlinearity and non-stationary of Doppler signals were decomposed into a set of intrinsic mode functions(IMFs) using EMD. After the Hilbert transform of IMF, the energy ratio of each IMF to the total IMFs can be extracted as the features of military target. Then, the SVM was trained through using the energy ratio to classify the military targets, and genetic algorithm(GA) was used to optimize SVM parameters in the solution space. The experimental results show that this algorithm can achieve the recognition accuracies of 86.15%, 87.93%, and 82.28% for tank, vehicle and soldier, respectively.
文摘Breast Arterial Calcification(BAC)is a mammographic decision dissimilar to cancer and commonly observed in elderly women.Thus identifying BAC could provide an expense,and be inaccurate.Recently Deep Learning(DL)methods have been introduced for automatic BAC detection and quantification with increased accuracy.Previously,classification with deep learning had reached higher efficiency,but designing the structure of DL proved to be an extremely challenging task due to overfitting models.It also is not able to capture the patterns and irregularities presented in the images.To solve the overfitting problem,an optimal feature set has been formed by Enhanced Wolf Pack Algorithm(EWPA),and their irregularities are identified by Dense-kUNet segmentation.In this paper,Dense-kUNet for segmentation and optimal feature has been introduced for classification(severe,mild,light)that integrates DenseUNet and kU-Net.Longer bound links exist among adjacent modules,allowing relatively rough data to be sent to the following component and assisting the system in finding higher qualities.The major contribution of the work is to design the best features selected by Enhanced Wolf Pack Algorithm(EWPA),and Modified Support Vector Machine(MSVM)based learning for classification.k-Dense-UNet is introduced which combines the procedure of Dense-UNet and kU-Net for image segmentation.Longer bound associations occur among nearby sections,allowing relatively granular data to be sent to the next subsystem and benefiting the system in recognizing smaller characteristics.The proposed techniques and the performance are tested using several types of analysis techniques 826 filled digitized mammography.The proposed method achieved the highest precision,recall,F-measure,and accuracy of 84.4333%,84.5333%,84.4833%,and 86.8667%when compared to other methods on the Digital Database for Screening Mammography(DDSM).
基金supported by the National Natural Science Foundation of China(50576033)
文摘The pruning algorithms for sparse least squares support vector regression machine are common methods, and easily com- prehensible, but the computational burden in the training phase is heavy due to the retraining in performing the pruning process, which is not favorable for their applications. To this end, an im- proved scheme is proposed to accelerate sparse least squares support vector regression machine. A major advantage of this new scheme is based on the iterative methodology, which uses the previous training results instead of retraining, and its feasibility is strictly verified theoretically. Finally, experiments on bench- mark data sets corroborate a significant saving of the training time with the same number of support vectors and predictive accuracy compared with the original pruning algorithms, and this speedup scheme is also extended to classification problem.
文摘Word Sense Disambiguation has been a trending topic of research in Natural Language Processing and Machine Learning.Mining core features and performing the text classification still exist as a challenging task.Here the features of the context such as neighboring words like adjective provide the evidence for classification using machine learning approach.This paper presented the text document classification that has wide applications in information retrieval,which uses movie review datasets.Here the document indexing based on controlled vocabulary,adjective,word sense disambiguation,generating hierarchical cate-gorization of web pages,spam detection,topic labeling,web search,document summarization,etc.Here the kernel support vector machine learning algorithm helps to classify the text and feature extract is performed by cuckoo search opti-mization.Positive review and negative review of movie dataset is presented to get the better classification accuracy.Experimental results focused with context mining,feature analysis and classification.By comparing with the previous work,proposed work designed to achieve the efficient results.Overall design is per-formed with MATLAB 2020a tool.
基金mostly supported through the funding provided by the National Natural Science Foundation of China(Grant No.81702866)the Key Program of the Natural Science Foundation of Zhejiang Province(Grant No.LZ16H160002)+2 种基金the Zhejiang Provincial Program for the Cultivation of HighLevel Innovative Health Talentsthe Foundation of the Education Department of Zhejiang Province(Grant No.Y201636451)partially supported through funding provided by the National Natural Science Foundation of China(Grant No.81472666)。
文摘Objective:Circulating tumor cells(CTCs)play a critical role in cancer metastasis,but their prevalence and significance remain unclear.This study attempted to track the epithelial-mesenchymal transition(EMT)status of CTCs in breast cancer patients and investigate their clinical relevance.Methods:In this study,the established negFACS-IF:E/M platform was applied to isolate rare CTCs and characterize their EMT status in breast cancer.A total of 89 breast cancer patients were recruited,including stage 0–III(n=60)and late stage(n=29)cases.Results:Using the negFACS-IF:E/M platform,it was found that in human epidermal growth factor receptor 2(HER2)+patients,mesenchymal CTCs usually exhibited a high percentage of HER2+cells.Stage IV breast cancer patients had considerably more CTCs than stage 0–III patients.Among stage 0–III breast cancers,the HER2 subtype included a significantly higher percentage of mesenchymal and biphenotypic(epithelial and mesenchymal)CTCs than the luminal A or B subtypes.Among stage IV patients,CTCs were predominantly epithelial in cases with local recurrence and were more mesenchymal in cases with distant metastasis.By applying a support vector machine(SVM)algorithm,the EMT status of CTCs could distinguish between breast cancer cases with metastasis/local recurrence and those without recurrence.Conclusions:The negFACS-IF:E/M platform provides a flexible and generally acceptable method for the highly sensitive and specific detection of CTCs and their EMT traits in breast cancer.This study demonstrated that the EMT status of CTCs had high clinical relevance in breast cancer,especially in predicting the distant metastasis or local recurrence of breast cancer.
文摘For the deficiency that the traditional single forecast methods could not forecast electronic equipment states, a combined forecast method based on the hidden Markov model(HMM) and least square support vector machine(LS-SVM) is presented. The multi-agent genetic algorithm(MAGA) is used to estimate parameters of HMM to overcome the problem that the Baum-Welch algorithm is easy to fall into local optimal solution. The state condition probability is introduced into the HMM modeling process to reduce the effect of uncertain factors. MAGA is used to estimate parameters of LS-SVM. Moreover, pruning algorithms are used to estimate parameters to get the sparse approximation of LS-SVM so as to increase the ranging performance. On the basis of these, the combined forecast model of electronic equipment states is established. The example results show the superiority of the combined forecast model in terms of forecast precision,calculation speed and stability.
基金Supported by the Fundamental Funds for Central University(2662014BQ062)
文摘Pidan or century egg, also known as preserved egg, is one of the most traditional and popular egg products in China. The crack detection of preserved eggshell is very important to guarantee its quality. In this study, we develop an image algorithm for preserved eggshell's crack detection by using natural light and polarized image. Four features including crack length, crack state coefficient, maximum projection and angular point are extracted from the natural light image by morphology calculus algorithms. The support vector machines(SVM) model with radial basis kernel function is established using the four features with an accuracy of about 92%. The detection accuracy is improved to 94% by using a new characteristic parameter of crack length on polarization image. The Multi-information fusion analysis indicates the potential for cracks detection by a real-time synthesis imaging system.
基金UK Engineering and Physical Sciences Research Council for funding the research (EPSRCGrant Reference: EP/C001788/1)
文摘Near infrared spectroscopy (NIR) is now probably the most popular process analytical technology (PAT) for pharmaceutical and some other industries. However, unlike mid-IR, NIR is known to have difficulties in monitoring crystallization or precipitation processes because the existence of solids could cause distortion of the spectra. This phenomenon, seen as unfavorable previously, is however an indication that NIR spectra contain rich information about both solids and liquids, giving the possibility of using the same instrument for multiple property characterization. In this study, transflectance NIR calibration data was obtained using solutions and slurries of varied solution concentration, particle size, solid concentration and temperature. The data was used to build calibration models for prediction of the multiple properties of both phases. Predictive models were developed for this challenging application using an approach that combines genetic algorithm (GA) and support vector machine (SVM). GA is used for wavelength selection and SVM for mode building. The new GA-SVM approach is shown to outperform other methods including GA-PLS (partial least squares) and traditional SVM. NIR is thus successfully applied to monitoring seeded and unseeded cooling crystallization processes of L-glutamic acid.
文摘In this work, two chemometrics methods are applied for the modeling and prediction of electrophoretic mobilities of some organic and inorganic compounds. The successive projection algorithm, feature selection (SPA) strategy, is used as the descriptor selection and model development method. Then, the support vector machine (SVM) and multiple linear regression (MLR) model are utilized to construct the non-linear and linear quantitative structure-property relationship models. The results obtained using the SVM model are compared with those obtained using MLR reveal that the SVM model is of much better predictive value than the MLR one. The root-mean-square errors for the training set and the test set for the SVM model were 0.1911 and 0.2569, respectively, while by the MLR model, they were 0.4908 and 0.6494, respectively. The results show that the SVM model drastically enhances the ability of prediction in QSPR studies and is superior to the MLR model.
文摘Hot components operate in a high-temperature and high-pressure environment. The occurrence of a fault in hot components leads to high economic losses. In general, exhaust gas temperature(EGT) is used to monitor the performance of hot components.However, during the early stages of a failure, the fault information is weak, and is simultaneously affected by various types of interference, such as the complex working conditions, ambient conditions, gradual performance degradation of the compressors and turbines, and noise. Additionally, inadequate effective information of the gas turbine also restricts the establishment of the detection model. To solve the above problems, this paper proposes an anomaly detection method based on frequent pattern extraction. A frequent pattern model(FPM) is applied to indicate the inherent regularity of change in EGT occurring from different types of interference. In this study, based on a genetic algorithm and support vector machine regression, the relationship model between the EGT and interference was tentatively built. The modeling accuracy was then further improved through the selection of the kernel function and training data. Experiments indicate that the optimal kernel function is linear and that the optimal training data should be balanced in addition to covering the appropriate range of operating conditions and ambient temperature. Furthermore, the thresholds based on the Pauta criterion that is automatically obtained during the modeling process, are used to determine whether hot components are operating abnormally. Moreover, the FPM is compared with the similarity theory, which demonstrates that the FPM can better suppress the effect of the component performance degradation and fuel heat value fluctuation. Finally, the effectiveness of the proposed method is validated on seven months of actual data obtained from a Titan130 gas turbine on an offshore oil platform. The results indicate that the proposed method can sensitively detect malfunctions in hot components during the early stages of a fault, and is robust to various types of interference.