Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challe...Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challenging when training data(e.g.truck haulage information and weather conditions)are massive.In machine learning(ML)algorithms,deep neural network(DNN)is a superior method for processing nonlinear and massive data by adjusting the amount of neurons and hidden layers.This study adopted DNN to forecast ore production using truck haulage information and weather conditions at open-pit mines as training data.Before the prediction models were built,principal component analysis(PCA)was employed to reduce the data dimensionality and eliminate the multicollinearity among highly correlated input variables.To verify the superiority of DNN,three ANNs containing only one hidden layer and six traditional ML models were established as benchmark models.The DNN model with multiple hidden layers performed better than the ANN models with a single hidden layer.The DNN model outperformed the extensively applied benchmark models in predicting ore production.This can provide engineers and researchers with an accurate method to forecast ore production,which helps make sound budgetary decisions and mine planning at open-pit mines.展开更多
We investigated the parametric optimization on incremental sheet forming of stainless steel using Grey Relational Analysis(GRA) coupled with Principal Component Analysis(PCA). AISI 316L stainless steel sheets were use...We investigated the parametric optimization on incremental sheet forming of stainless steel using Grey Relational Analysis(GRA) coupled with Principal Component Analysis(PCA). AISI 316L stainless steel sheets were used to develop double wall angle pyramid with aid of tungsten carbide tool. GRA coupled with PCA was used to plan the experiment conditions. Control factors such as Tool Diameter(TD), Step Depth(SD), Bottom Wall Angle(BWA), Feed Rate(FR) and Spindle Speed(SS) on Top Wall Angle(TWA) and Top Wall Angle Surface Roughness(TWASR) have been studied. Wall angle increases with increasing tool diameter due to large contact area between tool and workpiece. As the step depth, feed rate and spindle speed increase,TWASR decreases with increasing tool diameter. As the step depth increasing, the hydrostatic stress is raised causing severe cracks in the deformed surface. Hence it was concluded that the proposed hybrid method was suitable for optimizing the factors and response.展开更多
With integration of renewable energy and use of non-linear loads in power systems,the power quality problem is increasingly attracting attention of researchers.In China,standards for individual power quality indexes a...With integration of renewable energy and use of non-linear loads in power systems,the power quality problem is increasingly attracting attention of researchers.In China,standards for individual power quality indexes are set.However,when evaluating power quality in practice,individual indexes cannot directly reflect a comprehensive level of power quality.In this paper,a comprehensive analysis of various indexes is conducted to obtain a unified parameter for describing the characteristics of power quality from an overall perspective.First,weight values of power quality indexes are calculated by combining the subjective and objective weight.Then,based on the principal components of the projection method,projection values of boundary data and data to be evaluated are obtained.Finally,using these projection values,a grade range for power quality data is located.A practical case study is presented to show the validity of the proposed method for evaluating power quality.展开更多
The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring f...The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.展开更多
The composition control of molten steel is one of the main functions in the ladle furnace(LF)refining process.In this study,a feasible model was established to predict the alloying element yield using principal compon...The composition control of molten steel is one of the main functions in the ladle furnace(LF)refining process.In this study,a feasible model was established to predict the alloying element yield using principal component analysis(PCA)and deep neural network(DNN).The PCA was used to eliminate collinearity and reduce the dimension of the input variables,and then the data processed by PCA were used to establish the DNN model.The prediction hit ratios for the Si element yield in the error ranges of±1%,±3%,and±5%are 54.0%,93.8%,and98.8%,respectively,whereas those of the Mn element yield in the error ranges of±1%,±2%,and±3%are 77.0%,96.3%,and 99.5%,respectively,in the PCA-DNN model.The results demonstrate that the PCA-DNN model performs better than the known models,such as the reference heat method,multiple linear regression,modified backpropagation,and DNN model.Meanwhile,the accurate prediction of the alloying element yield can greatly contribute to realizing a“narrow window”control of composition in molten steel.The construction of the prediction model for the element yield can also provide a reference for the development of an alloying control model in LF intelligent refining in the modern iron and steel industry.展开更多
Principal component analysis(PCA)was employed to determine the implications of geochemical and isotopic data from Cenozoic volcanic activities in the Southeast Asian region,including China(South China Sea(SCS),Hainan ...Principal component analysis(PCA)was employed to determine the implications of geochemical and isotopic data from Cenozoic volcanic activities in the Southeast Asian region,including China(South China Sea(SCS),Hainan Island,Fujian-Zhejiang coast,Taiwan Island),and parts of Vietnam and Thailand.We analyzed 15 trace element indicators and 5 isotopic indicators for 623 volcanic rock samples collected from the study region.Two principal components(PCs)were extracted by PCA based on the trace elements and Sr-Nd-Pb isotopic ratios,which probably indicate an enriched oceanic island basalt-type mantle plume and a depleted mid-ocean ridge basalt-type spreading ridge.The results show that the influence of the Hainan mantle plume on younger volcanic activities(<13 Ma)is stronger than that on older ones(>13 Ma)at the same location in the Southeast Asian region.PCA was employed to verify the mantle-plume-ridge interaction model of volcanic activities beneath the expansion center of SCS and refute the hypothesis that the tension of SCS is triggered by the Hainan plume.This study reveals the efficiency and applicability of PCA in discussing mantle sources of volcanic activities;thus,PCA is a suitable research method for analyzing geochemical data.展开更多
In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tig...In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tight sandstone reservoirs which lack the prior information and core experiments.A variety of evaluation parameters were selected,including lithology characteristic parameters,poro-permeability quality characteristic parameters,engineering quality characteristic parameters,and pore structure characteristic parameters.The PCA was used to reduce the dimension of the evaluation pa-rameters,and the low-dimensional data was used as input.The unsupervised reservoir classification of tight sandstone reservoir was carried out by the SAGA-FCM,the characteristics of reservoir at different categories were analyzed and compared with the lithological profiles.The analysis results of numerical simulation and actual logging data show that:1)compared with FCM algorithm,SAGA-FCM has stronger stability and higher accuracy;2)the proposed method can cluster the reservoir flexibly and effectively according to the degree of membership;3)the results of reservoir integrated classification match well with the lithologic profle,which demonstrates the reliability of the classification method.展开更多
This work utilizes a statistical approach of Principal Component Ana-lysis(PCA)towards the detection of Methane(CH_(4))-Carbon Monoxide(CO)Poi-soning occurring in coal mines,forestfires,drainage systems etc.where the ...This work utilizes a statistical approach of Principal Component Ana-lysis(PCA)towards the detection of Methane(CH_(4))-Carbon Monoxide(CO)Poi-soning occurring in coal mines,forestfires,drainage systems etc.where the CH_(4) and CO emissions are very high in closed buildings or confined spaces during oxi-dation processes.Both methane and carbon monoxide are highly toxic,colorless and odorless gases.Both of the gases have their own toxic levels to be detected.But during their combined presence,the toxicity of the either one goes unidentified may be due to their low levels which may lead to an explosion.By using PCA,the correlation of CO and CH_(4) data is carried out and by identifying the areas of high correlation(along the principal component axis)the explosion suppression action can be triggered earlier thus avoiding adverse effects of massive explosions.Wire-less Sensor Network is deployed and simulations are carried with heterogeneous sensors(Carbon Monoxide and Methane sensors)in NS-2 Mannasim framework.The rise in the value of CO even when CH_(4) is below the toxic level may become hazardous to the people around.Thus our proposed methodology will detect the combined presence of both the gases(CH_(4) and CO)and provide an early warning in order to avoid any human losses or toxic effects.展开更多
Intended for good productivity and perfect operation of the solar power grid a failure-free system is required.Therefore,thermal image processing with the thermal camera is the latest non-invasive(without manual conta...Intended for good productivity and perfect operation of the solar power grid a failure-free system is required.Therefore,thermal image processing with the thermal camera is the latest non-invasive(without manual contact)type fault identification technique which may give good precision in all aspects.The soiling issue,which is major productivity affecting factor may import from several reasons such as dust on the wind,bird mucks,etc.The efficient power production sufferers due to accumulated soil deposits reaching from 1%–7%in the county,such as India,to more than 25%in middle-east countries country,such as Dubai,Kuwait,etc.This research offers a solar panel soiling detection system built on thermal imaging which powers the inspection method and mitigates the requirement for physical panel inspection in a large solar production place.Hence,in this method,solar panels can be verified by working without disturbing production operation and it will save time and price of recognition.India ranks 3rd worldwide in the usage use age of Photovoltaic(PV)panels now and it is supported about 8.6%of the Nation’s electricity need in the year 2020.In the meantime,the installed PV production areas in India are aged 4–5 years old.Hence the need for inspection and maintenance of installed PV is growing fast day by day.As a result,this research focuses on finding the soiling hotspot exactly of the working solar panels with the help of Principal Components Thermal Analysis(PCTA)on MATLAB Environment.展开更多
Machine learning algorithms (MLs) can potentially improve disease diagnostics, leading to early detection and treatment of these diseases. As a malignant tumor whose primary focus is located in the bronchial mucosal e...Machine learning algorithms (MLs) can potentially improve disease diagnostics, leading to early detection and treatment of these diseases. As a malignant tumor whose primary focus is located in the bronchial mucosal epithelium, lung cancer has the highest mortality and morbidity among cancer types, threatening health and life of patients suffering from the disease. Machine learning algorithms such as Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbor (KNN) and Naïve Bayes (NB) have been used for lung cancer prediction. However they still face challenges such as high dimensionality of the feature space, over-fitting, high computational complexity, noise and missing data, low accuracies, low precision and high error rates. Ensemble learning, which combines classifiers, may be helpful to boost prediction on new data. However, current ensemble ML techniques rarely consider comprehensive evaluation metrics to evaluate the performance of individual classifiers. The main purpose of this study was to develop an ensemble classifier that improves lung cancer prediction. An ensemble machine learning algorithm is developed based on RF, SVM, NB, and KNN. Feature selection is done based on Principal Component Analysis (PCA) and Analysis of Variance (ANOVA). This algorithm is then executed on lung cancer data and evaluated using execution time, true positives (TP), true negatives (TN), false positives (FP), false negatives (FN), false positive rate (FPR), recall (R), precision (P) and F-measure (FM). Experimental results show that the proposed ensemble classifier has the best classification of 0.9825% with the lowest error rate of 0.0193. This is followed by SVM in which the probability of having the best classification is 0.9652% at an error rate of 0.0206. On the other hand, NB had the worst performance of 0.8475% classification at 0.0738 error rate.展开更多
Total organic carbon(TOC)content is one of the most important parameters for characterizing the quality of source rocks and assessing the hydrocarbon-generating potential of shales.The Lucaogou Formation shale reservo...Total organic carbon(TOC)content is one of the most important parameters for characterizing the quality of source rocks and assessing the hydrocarbon-generating potential of shales.The Lucaogou Formation shale reservoirs in the Jimusaer Sag,Junggar Basin,NW China,is characterized by extremely complex lithology and a wide variety of mineral compositions with source rocks mainly consisting of carbonaceous mudstone and dolomitic mudstone.The logging responses of organic matter in the shale reservoirs is quite different from those in conventional reservoirs.Analyses show that the traditional△logR method is not suitable for evaluating the TOC content in the study area.Analysis of the sensitivity characteristics of TOC content to well logs reveals that the TOC content has good correlation with the separation degree of porosity logs.After a dimension reduction processing by the principal component analysis technology,the principal components are determined through correlation analysis of porosity logs.The results show that the TOC values obtained by the new method are in good agreement with that measured by core analysis.The average absolute error of the new method is only 0.555,much less when compared with 1.222 of using traditional△logR method.The proposed method can be used to produce more accurate TOC estimates,thus providing a reliable basis for source rock mapping.展开更多
Principal component analysis (PCA) was employed to examine the effect of nutritional and bioactive compounds of legume milk chocolate as well as the sensory to document the extend of variations and their significance ...Principal component analysis (PCA) was employed to examine the effect of nutritional and bioactive compounds of legume milk chocolate as well as the sensory to document the extend of variations and their significance with plant sources. PCA identified eight significant principle components, that reduce the size of the variables into one principal component in physiochemical analysis interpreting 73.5% of the total variability with/and 78.6% of total variability explained in sensory evaluation. Score plot indicates that Double Bean milk chocolate in-corporated with MOL and CML in nutritional profile have high positive correlations. In nutritional evaluation, carbohydrates and fat content shows negative/minimal correlations whereas no negative correlations were found in sensory evaluation which implies every sensorial variable had high correlation with each other.展开更多
Detecting plant health conditions plays a key role in farm pest management and crop protection. In this study, measurement of hyperspectral leaf reflectance in rice crop (Oryzasativa L.) was conducted on groups of hea...Detecting plant health conditions plays a key role in farm pest management and crop protection. In this study, measurement of hyperspectral leaf reflectance in rice crop (Oryzasativa L.) was conducted on groups of healthy and infected leaves by the fungus Bipolaris oryzae (Helminthosporium oryzae Breda. de Hann) through the wavelength range from 350 to 2 500 nm. The percentage of leaf surface lesions was estimated and defined as the disease severity. Statistical methods like multiple stepwise regression, principal component analysis and partial least-square regression were utilized to calculate and estimate the disease severity of rice brown spot at the leaf level. Our results revealed that multiple stepwise linear regressions could efficiently estimate disease severity with three wavebands in seven steps. The root mean square errors (RMSEs) for training (n=210) and testing (n=53) dataset were 6.5% and 5.8%, respectively. Principal component analysis showed that the first principal component could explain approximately 80% of the variance of the original hyperspectral reflectance. The regression model with the first two principal components predicted a disease severity with RMSEs of 16.3% and 13.9% for the training and testing dataset, respec-tively. Partial least-square regression with seven extracted factors could most effectively predict disease severity compared with other statistical methods with RMSEs of 4.1% and 2.0% for the training and testing dataset, respectively. Our research demon-strates that it is feasible to estimate the disease severity of rice brown spot using hyperspectral reflectance data at the leaf level.展开更多
Based on the least-square minimization a computationally efficient learning algorithm for the Principal Component Analysis(PCA) is derived. The dual learning rate parameters are adaptively introduced to make the propo...Based on the least-square minimization a computationally efficient learning algorithm for the Principal Component Analysis(PCA) is derived. The dual learning rate parameters are adaptively introduced to make the proposed algorithm providing the capability of the fast convergence and high accuracy for extracting all the principal components. It is shown that all the information needed for PCA can be completely represented by the unnormalized weight vector which is updated based only on the corresponding neuron input-output product. The convergence performance of the proposed algorithm is briefly analyzed.The relation between Oja’s rule and the least squares learning rule is also established. Finally, a simulation example is given to illustrate the effectiveness of this algorithm for PCA.展开更多
The aim of this study is to investigate the condition of the Ariake Sea, Japan, which has been suffering from severe environmental issues for the past few decades. Water quality data have been generated from several p...The aim of this study is to investigate the condition of the Ariake Sea, Japan, which has been suffering from severe environmental issues for the past few decades. Water quality data have been generated from several points in this area for over 30 years by the Fukuoka, Saga, Kumamoto, and Nagasaki prefectures. In order to understand the characteristics of this sea, principal component analysis (PCA) was utilized using 11 water quality parameters;transparency, temperature, salinity, dissolved oxygen (DO), chemical oxygen demand (COD), dissolved inorganic nitrogen (DIN), ammonium-nitrogen (NH4<sup style='margin-left:-7px;'>+-N), nitrate-nitrogen (NO3<sup style='margin-left:-7px;'>--N), nitrite-nitrogen (NO2<sup style='margin-left:-7px;'>--N), phosphate-phosphorus, (PO4<sup style='margin-left:-7px;'>3--P) and silica. PCA conveyed the amount of nutrients originating from the river, the organic pollution level, and seasonal changes. Subsequently, principal component scores were calculated for each point. It was concluded that the Ariake Sea environment has been affected by two main factors, which are the nutrients from the Chikugo River and anticlockwise tidal residual flow. These two factors must be considered for the environmental restoration of the Ariake Sea.展开更多
Power load forecasting accuracy related to the development of the power system. There were so many factors influencing the power load, but their effects were not the same and what factors played a leading role could n...Power load forecasting accuracy related to the development of the power system. There were so many factors influencing the power load, but their effects were not the same and what factors played a leading role could not be determined empirically. Based on the analysis of the principal component, the paper forecasted the demands of power load with the method of the multivariate linear regression model prediction. Took the rural power grid load for example, the paper analyzed the impacts of different factors on power load, selected the forecast methods which were appropriate for using in this area, forecasted its 2014-2018 electricity load, and provided a reliable basis for grid planning.展开更多
Remote sensing and GIS techniques were employed for prioritization of the Zerqa River watershed. Forty-three 4th order sub-watersheds were prioritized based on morphometric and Principal Component Analysis (PCA), in o...Remote sensing and GIS techniques were employed for prioritization of the Zerqa River watershed. Forty-three 4th order sub-watersheds were prioritized based on morphometric and Principal Component Analysis (PCA), in order to examine the effectiveness of morphometric parameters in watershed prioritization. A comparison has been carried out between the results achieved through applying the two methods of analysis (morphometric and PCA). Afterwards, suitable measures are proposed for soil and water conservation. Topo sheets and ASTER DEM have been employed to demarcate the 43 sub-watersheds, to extract the drainage networks, and to compile the required thematic maps such as slope categories and elevation. LANDSAT 8 image (April-2015) is employed to generate land use/cover maps using ENVI (v 5.1) software. The soil map of the watershed has been digitized using Arc GIS software. Prioritization of the 43 sub-watersheds was performed using ten linear and shape parameters, and three parameters which are highly correlated with components 1 and 2. Subsequently, different sub-watersheds were prioritized by ascribing ranks based on the calculated compound parameters (Cp) using the two approaches. Comparison of the results revealed that prioritization of watersheds based on morphometric analysis is more consistent and serves for better decision making in conservation planning as compared with the PCA approach. The recommended soil conservation measures are prescribed in accordance with the specified priority, in order to avoid undesirable effects on land and environment. Sub-watersheds classified under high priority class are subjected to high erosion risk, thus, creating an urgent need for applying soil and water conservation measures. It is expected that decision makers will pay sufficient attention to the present results/information, activate programs encouraging soil conservation, integrated watershed management, and will continue working on the afforestation of the government-owned sloping lands. Such a viable approach can be applied at different parts of the rainfed highland areas to minimize soil erosion loss, and to increase infiltration and soil moisture in the soil profile, thus, reducing the impact of recurrent droughts and the possibility of flooding hazards.展开更多
According to the ecological safety evaluation index data of land-use change in Ji'an City from 1999 to 2008,positive treatment on selected reverse indices is conducted by Reciprocal Method.Meanwhile,Index Method i...According to the ecological safety evaluation index data of land-use change in Ji'an City from 1999 to 2008,positive treatment on selected reverse indices is conducted by Reciprocal Method.Meanwhile,Index Method is used to standardize the selected indices,and Principal Component Analysis is applied by using year as a unit.FB is obtained,which is related with the ecological safety of land-use change from 1999 to 2008.According to the scientific,integrative,hierarchical,practical and dynamic principles,ecological safety evaluation index system of land-use change in Ji'an City is established.Principal Component Analysis and evaluation model are used to calculate four parameters,including the natural resources safety index of land use,the socio-economic safety indicators of land use,the eco-environmental safety index of land use,and the ecological safety degree of land use in Ji'an City.Result indicates that the ecological safety degree of land use in Ji'an City shows a slow upward trend as a whole.At the same time,ecological safety degree of land-use change is relatively low in Ji'an City with the safety value of 0.645,which is at a weak safety zone and needs further monitoring and maintenance.展开更多
基金This work was supported by the Pilot Seed Grant(Grant No.RES0049944)the Collaborative Research Project(Grant No.RES0043251)from the University of Alberta.
文摘Ore production is usually affected by multiple influencing inputs at open-pit mines.Nevertheless,the complex nonlinear relationships between these inputs and ore production remain unclear.This becomes even more challenging when training data(e.g.truck haulage information and weather conditions)are massive.In machine learning(ML)algorithms,deep neural network(DNN)is a superior method for processing nonlinear and massive data by adjusting the amount of neurons and hidden layers.This study adopted DNN to forecast ore production using truck haulage information and weather conditions at open-pit mines as training data.Before the prediction models were built,principal component analysis(PCA)was employed to reduce the data dimensionality and eliminate the multicollinearity among highly correlated input variables.To verify the superiority of DNN,three ANNs containing only one hidden layer and six traditional ML models were established as benchmark models.The DNN model with multiple hidden layers performed better than the ANN models with a single hidden layer.The DNN model outperformed the extensively applied benchmark models in predicting ore production.This can provide engineers and researchers with an accurate method to forecast ore production,which helps make sound budgetary decisions and mine planning at open-pit mines.
文摘We investigated the parametric optimization on incremental sheet forming of stainless steel using Grey Relational Analysis(GRA) coupled with Principal Component Analysis(PCA). AISI 316L stainless steel sheets were used to develop double wall angle pyramid with aid of tungsten carbide tool. GRA coupled with PCA was used to plan the experiment conditions. Control factors such as Tool Diameter(TD), Step Depth(SD), Bottom Wall Angle(BWA), Feed Rate(FR) and Spindle Speed(SS) on Top Wall Angle(TWA) and Top Wall Angle Surface Roughness(TWASR) have been studied. Wall angle increases with increasing tool diameter due to large contact area between tool and workpiece. As the step depth, feed rate and spindle speed increase,TWASR decreases with increasing tool diameter. As the step depth increasing, the hydrostatic stress is raised causing severe cracks in the deformed surface. Hence it was concluded that the proposed hybrid method was suitable for optimizing the factors and response.
基金supported by National Natural Science Foundation of China(NSFC)(51477111)National Key Research and Development Program of China(2016YFB0901104).
文摘With integration of renewable energy and use of non-linear loads in power systems,the power quality problem is increasingly attracting attention of researchers.In China,standards for individual power quality indexes are set.However,when evaluating power quality in practice,individual indexes cannot directly reflect a comprehensive level of power quality.In this paper,a comprehensive analysis of various indexes is conducted to obtain a unified parameter for describing the characteristics of power quality from an overall perspective.First,weight values of power quality indexes are calculated by combining the subjective and objective weight.Then,based on the principal components of the projection method,projection values of boundary data and data to be evaluated are obtained.Finally,using these projection values,a grade range for power quality data is located.A practical case study is presented to show the validity of the proposed method for evaluating power quality.
基金supported by the National Natural Science Foundation of China (61903326, 61933015)。
文摘The large blast furnace is essential equipment in the process of iron and steel manufacturing. Due to the complex operation process and frequent fluctuations of variables, conventional monitoring methods often bring false alarms. To address the above problem, an ensemble of greedy dynamic principal component analysis-Gaussian mixture model(EGDPCA-GMM) is proposed in this paper. First, PCA-GMM is introduced to deal with the collinearity and the non-Gaussian distribution of blast furnace data.Second, in order to explain the dynamics of data, the greedy algorithm is used to determine the extended variables and their corresponding time lags, so as to avoid introducing unnecessary noise. Then the bagging ensemble is adopted to cooperate with greedy extension to eliminate the randomness brought by the greedy algorithm and further reduce the false alarm rate(FAR) of monitoring results. Finally, the algorithm is applied to the blast furnace of a large iron and steel group in South China to verify performance.Compared with the basic algorithms, the proposed method achieves lowest FAR, while keeping missed alarm rate(MAR) remain stable.
基金supported by the National Natural Science Foundation of China(No.51974023)State Key Laboratory of Advanced Metallurgy,University of Science and Technology Beijing(No.41621005)。
文摘The composition control of molten steel is one of the main functions in the ladle furnace(LF)refining process.In this study,a feasible model was established to predict the alloying element yield using principal component analysis(PCA)and deep neural network(DNN).The PCA was used to eliminate collinearity and reduce the dimension of the input variables,and then the data processed by PCA were used to establish the DNN model.The prediction hit ratios for the Si element yield in the error ranges of±1%,±3%,and±5%are 54.0%,93.8%,and98.8%,respectively,whereas those of the Mn element yield in the error ranges of±1%,±2%,and±3%are 77.0%,96.3%,and 99.5%,respectively,in the PCA-DNN model.The results demonstrate that the PCA-DNN model performs better than the known models,such as the reference heat method,multiple linear regression,modified backpropagation,and DNN model.Meanwhile,the accurate prediction of the alloying element yield can greatly contribute to realizing a“narrow window”control of composition in molten steel.The construction of the prediction model for the element yield can also provide a reference for the development of an alloying control model in LF intelligent refining in the modern iron and steel industry.
基金Supported by the State Key Laboratory of Marine Environmental Science Visiting Fellowship(No.MELRS2233)the State Key Laboratory of Marine Geology,Tongji University(No.MGK202302)+4 种基金the Innovation Group Project of Southern Marine Science and Engineering Guangdong Laboratory(Zhuhai)(No.311021003)the Zhujiang Talent Project Foundation of Guangdong Province(No.2017ZT07Z066)the Fundamental Research Funds for the Central Universities,Sun Yat-sen University(Nos.22qntd2101,2021qntd23)the Major Projects of the National Natural Science Foundation of China(Nos.41790465,41590863)the National Natural Science Foundation of China(Nos.42102333,41806077,41904045)。
文摘Principal component analysis(PCA)was employed to determine the implications of geochemical and isotopic data from Cenozoic volcanic activities in the Southeast Asian region,including China(South China Sea(SCS),Hainan Island,Fujian-Zhejiang coast,Taiwan Island),and parts of Vietnam and Thailand.We analyzed 15 trace element indicators and 5 isotopic indicators for 623 volcanic rock samples collected from the study region.Two principal components(PCs)were extracted by PCA based on the trace elements and Sr-Nd-Pb isotopic ratios,which probably indicate an enriched oceanic island basalt-type mantle plume and a depleted mid-ocean ridge basalt-type spreading ridge.The results show that the influence of the Hainan mantle plume on younger volcanic activities(<13 Ma)is stronger than that on older ones(>13 Ma)at the same location in the Southeast Asian region.PCA was employed to verify the mantle-plume-ridge interaction model of volcanic activities beneath the expansion center of SCS and refute the hypothesis that the tension of SCS is triggered by the Hainan plume.This study reveals the efficiency and applicability of PCA in discussing mantle sources of volcanic activities;thus,PCA is a suitable research method for analyzing geochemical data.
基金funded by the National Natural Science Foundation of China(42174131)the Strategic Cooperation Technology Projects of CNPC and CUPB(ZLZX2020-03).
文摘In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tight sandstone reservoirs which lack the prior information and core experiments.A variety of evaluation parameters were selected,including lithology characteristic parameters,poro-permeability quality characteristic parameters,engineering quality characteristic parameters,and pore structure characteristic parameters.The PCA was used to reduce the dimension of the evaluation pa-rameters,and the low-dimensional data was used as input.The unsupervised reservoir classification of tight sandstone reservoir was carried out by the SAGA-FCM,the characteristics of reservoir at different categories were analyzed and compared with the lithological profiles.The analysis results of numerical simulation and actual logging data show that:1)compared with FCM algorithm,SAGA-FCM has stronger stability and higher accuracy;2)the proposed method can cluster the reservoir flexibly and effectively according to the degree of membership;3)the results of reservoir integrated classification match well with the lithologic profle,which demonstrates the reliability of the classification method.
文摘This work utilizes a statistical approach of Principal Component Ana-lysis(PCA)towards the detection of Methane(CH_(4))-Carbon Monoxide(CO)Poi-soning occurring in coal mines,forestfires,drainage systems etc.where the CH_(4) and CO emissions are very high in closed buildings or confined spaces during oxi-dation processes.Both methane and carbon monoxide are highly toxic,colorless and odorless gases.Both of the gases have their own toxic levels to be detected.But during their combined presence,the toxicity of the either one goes unidentified may be due to their low levels which may lead to an explosion.By using PCA,the correlation of CO and CH_(4) data is carried out and by identifying the areas of high correlation(along the principal component axis)the explosion suppression action can be triggered earlier thus avoiding adverse effects of massive explosions.Wire-less Sensor Network is deployed and simulations are carried with heterogeneous sensors(Carbon Monoxide and Methane sensors)in NS-2 Mannasim framework.The rise in the value of CO even when CH_(4) is below the toxic level may become hazardous to the people around.Thus our proposed methodology will detect the combined presence of both the gases(CH_(4) and CO)and provide an early warning in order to avoid any human losses or toxic effects.
文摘Intended for good productivity and perfect operation of the solar power grid a failure-free system is required.Therefore,thermal image processing with the thermal camera is the latest non-invasive(without manual contact)type fault identification technique which may give good precision in all aspects.The soiling issue,which is major productivity affecting factor may import from several reasons such as dust on the wind,bird mucks,etc.The efficient power production sufferers due to accumulated soil deposits reaching from 1%–7%in the county,such as India,to more than 25%in middle-east countries country,such as Dubai,Kuwait,etc.This research offers a solar panel soiling detection system built on thermal imaging which powers the inspection method and mitigates the requirement for physical panel inspection in a large solar production place.Hence,in this method,solar panels can be verified by working without disturbing production operation and it will save time and price of recognition.India ranks 3rd worldwide in the usage use age of Photovoltaic(PV)panels now and it is supported about 8.6%of the Nation’s electricity need in the year 2020.In the meantime,the installed PV production areas in India are aged 4–5 years old.Hence the need for inspection and maintenance of installed PV is growing fast day by day.As a result,this research focuses on finding the soiling hotspot exactly of the working solar panels with the help of Principal Components Thermal Analysis(PCTA)on MATLAB Environment.
文摘Machine learning algorithms (MLs) can potentially improve disease diagnostics, leading to early detection and treatment of these diseases. As a malignant tumor whose primary focus is located in the bronchial mucosal epithelium, lung cancer has the highest mortality and morbidity among cancer types, threatening health and life of patients suffering from the disease. Machine learning algorithms such as Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbor (KNN) and Naïve Bayes (NB) have been used for lung cancer prediction. However they still face challenges such as high dimensionality of the feature space, over-fitting, high computational complexity, noise and missing data, low accuracies, low precision and high error rates. Ensemble learning, which combines classifiers, may be helpful to boost prediction on new data. However, current ensemble ML techniques rarely consider comprehensive evaluation metrics to evaluate the performance of individual classifiers. The main purpose of this study was to develop an ensemble classifier that improves lung cancer prediction. An ensemble machine learning algorithm is developed based on RF, SVM, NB, and KNN. Feature selection is done based on Principal Component Analysis (PCA) and Analysis of Variance (ANOVA). This algorithm is then executed on lung cancer data and evaluated using execution time, true positives (TP), true negatives (TN), false positives (FP), false negatives (FN), false positive rate (FPR), recall (R), precision (P) and F-measure (FM). Experimental results show that the proposed ensemble classifier has the best classification of 0.9825% with the lowest error rate of 0.0193. This is followed by SVM in which the probability of having the best classification is 0.9652% at an error rate of 0.0206. On the other hand, NB had the worst performance of 0.8475% classification at 0.0738 error rate.
基金This research was funded by the National Natural Science Foundation of China(Grant No.41504103).
文摘Total organic carbon(TOC)content is one of the most important parameters for characterizing the quality of source rocks and assessing the hydrocarbon-generating potential of shales.The Lucaogou Formation shale reservoirs in the Jimusaer Sag,Junggar Basin,NW China,is characterized by extremely complex lithology and a wide variety of mineral compositions with source rocks mainly consisting of carbonaceous mudstone and dolomitic mudstone.The logging responses of organic matter in the shale reservoirs is quite different from those in conventional reservoirs.Analyses show that the traditional△logR method is not suitable for evaluating the TOC content in the study area.Analysis of the sensitivity characteristics of TOC content to well logs reveals that the TOC content has good correlation with the separation degree of porosity logs.After a dimension reduction processing by the principal component analysis technology,the principal components are determined through correlation analysis of porosity logs.The results show that the TOC values obtained by the new method are in good agreement with that measured by core analysis.The average absolute error of the new method is only 0.555,much less when compared with 1.222 of using traditional△logR method.The proposed method can be used to produce more accurate TOC estimates,thus providing a reliable basis for source rock mapping.
文摘Principal component analysis (PCA) was employed to examine the effect of nutritional and bioactive compounds of legume milk chocolate as well as the sensory to document the extend of variations and their significance with plant sources. PCA identified eight significant principle components, that reduce the size of the variables into one principal component in physiochemical analysis interpreting 73.5% of the total variability with/and 78.6% of total variability explained in sensory evaluation. Score plot indicates that Double Bean milk chocolate in-corporated with MOL and CML in nutritional profile have high positive correlations. In nutritional evaluation, carbohydrates and fat content shows negative/minimal correlations whereas no negative correlations were found in sensory evaluation which implies every sensorial variable had high correlation with each other.
基金the Hi-Tech Research and Development Program (863) of China (No. 2006AA10Z203)the National Scienceand Technology Task Force Project (No. 2006BAD10A01), China
文摘Detecting plant health conditions plays a key role in farm pest management and crop protection. In this study, measurement of hyperspectral leaf reflectance in rice crop (Oryzasativa L.) was conducted on groups of healthy and infected leaves by the fungus Bipolaris oryzae (Helminthosporium oryzae Breda. de Hann) through the wavelength range from 350 to 2 500 nm. The percentage of leaf surface lesions was estimated and defined as the disease severity. Statistical methods like multiple stepwise regression, principal component analysis and partial least-square regression were utilized to calculate and estimate the disease severity of rice brown spot at the leaf level. Our results revealed that multiple stepwise linear regressions could efficiently estimate disease severity with three wavebands in seven steps. The root mean square errors (RMSEs) for training (n=210) and testing (n=53) dataset were 6.5% and 5.8%, respectively. Principal component analysis showed that the first principal component could explain approximately 80% of the variance of the original hyperspectral reflectance. The regression model with the first two principal components predicted a disease severity with RMSEs of 16.3% and 13.9% for the training and testing dataset, respec-tively. Partial least-square regression with seven extracted factors could most effectively predict disease severity compared with other statistical methods with RMSEs of 4.1% and 2.0% for the training and testing dataset, respectively. Our research demon-strates that it is feasible to estimate the disease severity of rice brown spot using hyperspectral reflectance data at the leaf level.
基金Supported by the National Natural Science Foundation of Chinathe Science foundation of Guangxi Educational Administration
文摘Based on the least-square minimization a computationally efficient learning algorithm for the Principal Component Analysis(PCA) is derived. The dual learning rate parameters are adaptively introduced to make the proposed algorithm providing the capability of the fast convergence and high accuracy for extracting all the principal components. It is shown that all the information needed for PCA can be completely represented by the unnormalized weight vector which is updated based only on the corresponding neuron input-output product. The convergence performance of the proposed algorithm is briefly analyzed.The relation between Oja’s rule and the least squares learning rule is also established. Finally, a simulation example is given to illustrate the effectiveness of this algorithm for PCA.
文摘The aim of this study is to investigate the condition of the Ariake Sea, Japan, which has been suffering from severe environmental issues for the past few decades. Water quality data have been generated from several points in this area for over 30 years by the Fukuoka, Saga, Kumamoto, and Nagasaki prefectures. In order to understand the characteristics of this sea, principal component analysis (PCA) was utilized using 11 water quality parameters;transparency, temperature, salinity, dissolved oxygen (DO), chemical oxygen demand (COD), dissolved inorganic nitrogen (DIN), ammonium-nitrogen (NH4<sup style='margin-left:-7px;'>+-N), nitrate-nitrogen (NO3<sup style='margin-left:-7px;'>--N), nitrite-nitrogen (NO2<sup style='margin-left:-7px;'>--N), phosphate-phosphorus, (PO4<sup style='margin-left:-7px;'>3--P) and silica. PCA conveyed the amount of nutrients originating from the river, the organic pollution level, and seasonal changes. Subsequently, principal component scores were calculated for each point. It was concluded that the Ariake Sea environment has been affected by two main factors, which are the nutrients from the Chikugo River and anticlockwise tidal residual flow. These two factors must be considered for the environmental restoration of the Ariake Sea.
基金Supported by the Science and Technology Research Project Fund of Provincial Department of Education(12531004)Project of Heilongjiang Leading Talent Echelon Talented(2012)
文摘Power load forecasting accuracy related to the development of the power system. There were so many factors influencing the power load, but their effects were not the same and what factors played a leading role could not be determined empirically. Based on the analysis of the principal component, the paper forecasted the demands of power load with the method of the multivariate linear regression model prediction. Took the rural power grid load for example, the paper analyzed the impacts of different factors on power load, selected the forecast methods which were appropriate for using in this area, forecasted its 2014-2018 electricity load, and provided a reliable basis for grid planning.
文摘Remote sensing and GIS techniques were employed for prioritization of the Zerqa River watershed. Forty-three 4th order sub-watersheds were prioritized based on morphometric and Principal Component Analysis (PCA), in order to examine the effectiveness of morphometric parameters in watershed prioritization. A comparison has been carried out between the results achieved through applying the two methods of analysis (morphometric and PCA). Afterwards, suitable measures are proposed for soil and water conservation. Topo sheets and ASTER DEM have been employed to demarcate the 43 sub-watersheds, to extract the drainage networks, and to compile the required thematic maps such as slope categories and elevation. LANDSAT 8 image (April-2015) is employed to generate land use/cover maps using ENVI (v 5.1) software. The soil map of the watershed has been digitized using Arc GIS software. Prioritization of the 43 sub-watersheds was performed using ten linear and shape parameters, and three parameters which are highly correlated with components 1 and 2. Subsequently, different sub-watersheds were prioritized by ascribing ranks based on the calculated compound parameters (Cp) using the two approaches. Comparison of the results revealed that prioritization of watersheds based on morphometric analysis is more consistent and serves for better decision making in conservation planning as compared with the PCA approach. The recommended soil conservation measures are prescribed in accordance with the specified priority, in order to avoid undesirable effects on land and environment. Sub-watersheds classified under high priority class are subjected to high erosion risk, thus, creating an urgent need for applying soil and water conservation measures. It is expected that decision makers will pay sufficient attention to the present results/information, activate programs encouraging soil conservation, integrated watershed management, and will continue working on the afforestation of the government-owned sloping lands. Such a viable approach can be applied at different parts of the rainfed highland areas to minimize soil erosion loss, and to increase infiltration and soil moisture in the soil profile, thus, reducing the impact of recurrent droughts and the possibility of flooding hazards.
基金Supported by Major Project of Chinese National Programs for Fundamental Research and Development Program(2009CB219401)Key Project of Natural Science Foundation of China(40534019)
文摘According to the ecological safety evaluation index data of land-use change in Ji'an City from 1999 to 2008,positive treatment on selected reverse indices is conducted by Reciprocal Method.Meanwhile,Index Method is used to standardize the selected indices,and Principal Component Analysis is applied by using year as a unit.FB is obtained,which is related with the ecological safety of land-use change from 1999 to 2008.According to the scientific,integrative,hierarchical,practical and dynamic principles,ecological safety evaluation index system of land-use change in Ji'an City is established.Principal Component Analysis and evaluation model are used to calculate four parameters,including the natural resources safety index of land use,the socio-economic safety indicators of land use,the eco-environmental safety index of land use,and the ecological safety degree of land use in Ji'an City.Result indicates that the ecological safety degree of land use in Ji'an City shows a slow upward trend as a whole.At the same time,ecological safety degree of land-use change is relatively low in Ji'an City with the safety value of 0.645,which is at a weak safety zone and needs further monitoring and maintenance.