期刊文献+
共找到39,902篇文章
< 1 2 250 >
每页显示 20 50 100
基于贝叶斯超参数优化的Gradient Boosting方法的导弹气动特性预测
1
作者 崔榕峰 马海 +2 位作者 郭承鹏 李鸿岩 刘哲 《航空科学技术》 2023年第7期22-28,共7页
在导弹设计与研发的初期阶段,需要寻求高效且低成本的导弹气动力特性的分析方法。然而,气动性能分析过程中往往存在试验成本高、周期长、局限性大等问题。因此,本文采用基于提升(Boosting)的机器学习集成算法进行导弹气动特性预测,通过... 在导弹设计与研发的初期阶段,需要寻求高效且低成本的导弹气动力特性的分析方法。然而,气动性能分析过程中往往存在试验成本高、周期长、局限性大等问题。因此,本文采用基于提升(Boosting)的机器学习集成算法进行导弹气动特性预测,通过输入导弹的气动外形参数、马赫数和迎角,对于导弹气动力系数实现快速预测。结果表明,Boosting能够对导弹气动力系数进行精准高效预测。为进一步提升预测精度,与传统的机器学习参数调整方法相比,采用贝叶斯优化方法对梯度提升(Gradient Boosting)算法超参数进行优化,调优后的Gradient Boosting方法预测的导弹气动力系数与实际值吻合度得到提升,并将贝叶斯优化的Gradient Boosting方法与XGBoost、LightGBM、Adaboost方法进行了对比,贝叶斯优化的Gradient Boosting方法预测精度优于其他Boosting方法,证明了优化方法的可行性与有效性。 展开更多
关键词 导弹 气动特性 boosting gradient boosting 贝叶斯优化
下载PDF
Light gradient boosting machine with optimized hyperparameters for identification of malicious access in IoT network
2
作者 Debasmita Mishra Bighnaraj Naik +3 位作者 Janmenjoy Nayak Alireza Souri Pandit Byomakesha Dash S.Vimal 《Digital Communications and Networks》 SCIE CSCD 2023年第1期125-137,共13页
In this paper,an advanced and optimized Light Gradient Boosting Machine(LGBM)technique is proposed to identify the intrusive activities in the Internet of Things(IoT)network.The followings are the major contributions:... In this paper,an advanced and optimized Light Gradient Boosting Machine(LGBM)technique is proposed to identify the intrusive activities in the Internet of Things(IoT)network.The followings are the major contributions:i)An optimized LGBM model has been developed for the identification of malicious IoT activities in the IoT network;ii)An efficient evolutionary optimization approach has been adopted for finding the optimal set of hyper-parameters of LGBM for the projected problem.Here,a Genetic Algorithm(GA)with k-way tournament selection and uniform crossover operation is used for efficient exploration of hyper-parameter search space;iii)Finally,the performance of the proposed model is evaluated using state-of-the-art ensemble learning and machine learning-based model to achieve overall generalized performance and efficiency.Simulation outcomes reveal that the proposed approach is superior to other considered methods and proves to be a robust approach to intrusion detection in an IoT environment. 展开更多
关键词 IoT security Ensemble method Light gradient boosting machine Machine learning Intrusion detection
下载PDF
A Data-Driven Oil Production Prediction Method Based on the Gradient Boosting Decision Tree Regression
3
作者 Hongfei Ma Wenqi Zhao +1 位作者 Yurong Zhao Yu He 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第3期1773-1790,共18页
Accurate prediction ofmonthly oil and gas production is essential for oil enterprises tomake reasonable production plans,avoid blind investment and realize sustainable development.Traditional oil well production trend... Accurate prediction ofmonthly oil and gas production is essential for oil enterprises tomake reasonable production plans,avoid blind investment and realize sustainable development.Traditional oil well production trend prediction methods are based on years of oil field production experience and expertise,and the application conditions are very demanding.With the rapid development of artificial intelligence technology,big data analysis methods are gradually applied in various sub-fields of the oil and gas reservoir development.Based on the data-driven artificial intelligence algorithmGradient BoostingDecision Tree(GBDT),this paper predicts the initial single-layer production by considering geological data,fluid PVT data and well data.The results show that the GBDT algorithm prediction model has great accuracy,significantly improving efficiency and strong universal applicability.The GBDTmethod trained in this paper can predict production,which is helpful for well site optimization,perforation layer optimization and engineering parameter optimization and has guiding significance for oilfield development. 展开更多
关键词 gradient boosting decision tree production prediction data analysis
下载PDF
Object-Based Burned Area Mapping with Extreme Gradient Boosting Using Sentinel-2 Imagery
4
作者 Dimitris Stavrakoudis Ioannis Z. Gitas 《Journal of Geographic Information System》 2023年第1期53-72,共20页
The Sentinel-2 satellites are providing an unparalleled wealth of high-resolution remotely sensed information with a short revisit cycle, which is ideal for mapping burned areas both accurately and timely. This paper ... The Sentinel-2 satellites are providing an unparalleled wealth of high-resolution remotely sensed information with a short revisit cycle, which is ideal for mapping burned areas both accurately and timely. This paper proposes an automated methodology for mapping burn scars using pairs of Sentinel-2 imagery, exploiting the state-of-the-art eXtreme Gradient Boosting (XGB) machine learning framework. A large database of 64 reference wildfire perimeters in Greece from 2016 to 2019 is used to train the classifier. An empirical methodology for appropriately sampling the training patterns from this database is formulated, which guarantees the effectiveness of the approach and its computational efficiency. A difference (pre-fire minus post-fire) spectral index is used for this purpose, upon which we appropriately identify the clear and fuzzy value ranges. To reduce the data volume, a super-pixel segmentation of the images is also employed, implemented via the QuickShift algorithm. The cross-validation results showcase the effectiveness of the proposed algorithm, with the average commission and omission errors being 9% and 2%, respectively, and the average Matthews correlation coefficient (MCC) equal to 0.93. 展开更多
关键词 Operational Burned Area Mapping Sentinel-2 Extreme gradient boosting (XGB) QuickShift Segmentation Machine Learning
下载PDF
A Hybrid Ensemble Learning Approach Utilizing Light Gradient Boosting Machine and Category Boosting Model for Lifestyle-Based Prediction of Type-II Diabetes Mellitus
5
作者 Mahadi Nagassou Ronald Waweru Mwangi Euna Nyarige 《Journal of Data Analysis and Information Processing》 2023年第4期480-511,共32页
Addressing classification and prediction challenges, tree ensemble models have gained significant importance. Boosting ensemble techniques are commonly employed for forecasting Type-II diabetes mellitus. Light Gradien... Addressing classification and prediction challenges, tree ensemble models have gained significant importance. Boosting ensemble techniques are commonly employed for forecasting Type-II diabetes mellitus. Light Gradient Boosting Machine (LightGBM) is a widely used algorithm known for its leaf growth strategy, loss reduction, and enhanced training precision. However, LightGBM is prone to overfitting. In contrast, CatBoost utilizes balanced base predictors known as decision tables, which mitigate overfitting risks and significantly improve testing time efficiency. CatBoost’s algorithm structure counteracts gradient boosting biases and incorporates an overfitting detector to stop training early. This study focuses on developing a hybrid model that combines LightGBM and CatBoost to minimize overfitting and improve accuracy by reducing variance. For the purpose of finding the best hyperparameters to use with the underlying learners, the Bayesian hyperparameter optimization method is used. By fine-tuning the regularization parameter values, the hybrid model effectively reduces variance (overfitting). Comparative evaluation against LightGBM, CatBoost, XGBoost, Decision Tree, Random Forest, AdaBoost, and GBM algorithms demonstrates that the hybrid model has the best F1-score (99.37%), recall (99.25%), and accuracy (99.37%). Consequently, the proposed framework holds promise for early diabetes prediction in the healthcare industry and exhibits potential applicability to other datasets sharing similarities with diabetes. 展开更多
关键词 boosting Ensemble Learning Category boosting Light gradient boosting Machine
下载PDF
Prediction of undrained shear strength using extreme gradient boosting and random forest based on Bayesian optimization 被引量:33
6
作者 Wengang Zhang Chongzhi Wu +2 位作者 Haiyi Zhong Yongqin Li Lin Wang 《Geoscience Frontiers》 SCIE CAS CSCD 2021年第1期469-477,共9页
Accurate assessment of undrained shear strength(USS)for soft sensitive clays is a great concern in geotechnical engineering practice.This study applies novel data-driven extreme gradient boosting(XGBoost)and random fo... Accurate assessment of undrained shear strength(USS)for soft sensitive clays is a great concern in geotechnical engineering practice.This study applies novel data-driven extreme gradient boosting(XGBoost)and random forest(RF)ensemble learning methods for capturing the relationships between the USS and various basic soil parameters.Based on the soil data sets from TC304 database,a general approach is developed to predict the USS of soft clays using the two machine learning methods above,where five feature variables including the preconsolidation stress(PS),vertical effective stress(VES),liquid limit(LL),plastic limit(PL)and natural water content(W)are adopted.To reduce the dependence on the rule of thumb and inefficient brute-force search,the Bayesian optimization method is applied to determine the appropriate model hyper-parameters of both XGBoost and RF.The developed models are comprehensively compared with three comparison machine learning methods and two transformation models with respect to predictive accuracy and robustness under 5-fold cross-validation(CV).It is shown that XGBoost-based and RF-based methods outperform these approaches.Besides,the XGBoostbased model provides feature importance ranks,which makes it a promising tool in the prediction of geotechnical parameters and enhances the interpretability of model. 展开更多
关键词 Undrained shear strength Extreme gradient boosting Random forest Bayesian optimization k-fold CV
下载PDF
基于优化的Inception ResNet A模块与Gradient Boosting的人群计数方法 被引量:8
7
作者 郭瑞琴 陈雄杰 +1 位作者 骆炜 符长虹 《同济大学学报(自然科学版)》 EI CAS CSCD 北大核心 2019年第8期1216-1224,共9页
针对人群计数问题,基于优化Inception-ResNet-A模块,使用集成学习中的Gradient Boosting方法提出了一种可用于稀疏人群和密集人群的人群计数方法,并给出此方法实现的具体细节.通过在三个公开数据集和真实场景(含光照和视角变化)中进行测... 针对人群计数问题,基于优化Inception-ResNet-A模块,使用集成学习中的Gradient Boosting方法提出了一种可用于稀疏人群和密集人群的人群计数方法,并给出此方法实现的具体细节.通过在三个公开数据集和真实场景(含光照和视角变化)中进行测试,检验了该方法对于光照、人群密度、视角等变化的鲁棒性.实验结果表明,该方法对于以上变化具有较强的鲁棒性,并且相比于之前的人群计数方法在准确性和稳定性方面具有更好的性能. 展开更多
关键词 人群计数 优化Inception-ResNet-A模块 gradient boosting 多尺度特征 感知野
下载PDF
Forecasting Multi-Step Ahead Monthly Reference Evapotranspiration Using Hybrid Extreme Gradient Boosting with Grey Wolf Optimization Algorithm 被引量:1
8
作者 Xianghui Lu Junliang Fan +1 位作者 Lifeng Wu Jianhua Dong 《Computer Modeling in Engineering & Sciences》 SCIE EI 2020年第11期699-723,共25页
It is important for regional water resources management to know the agricultural water consumption information several months in advance.Forecasting reference evapotranspiration(ET_(0))in the next few months is import... It is important for regional water resources management to know the agricultural water consumption information several months in advance.Forecasting reference evapotranspiration(ET_(0))in the next few months is important for irrigation and reservoir management.Studies on forecasting of multiple-month ahead ET_(0) using machine learning models have not been reported yet.Besides,machine learning models such as the XGBoost model has multiple parameters that need to be tuned,and traditional methods can get stuck in a regional optimal solution and fail to obtain a global optimal solution.This study investigated the performance of the hybrid extreme gradient boosting(XGBoost)model coupled with the Grey Wolf Optimizer(GWO)algorithm for forecasting multi-step ahead ET_(0)(1-3 months ahead),compared with three conventional machine learning models,i.e.,standalone XGBoost,multi-layer perceptron(MLP)and M5 model tree(M5)models in the subtropical zone of China.The results showed that theGWO-XGB model generally performed better than the other three machine learning models in forecasting 1-3 months ahead ET_(0),followed by the XGB,M5 and MLP models with very small differences among the three models.The GWO-XGB model performed best in autumn,while the MLP model performed slightly better than the other three models in summer.It is thus suggested to apply the MLP model for ET_(0) forecasting in summer but use the GWO-XGB model in other seasons. 展开更多
关键词 Reference evapotranspiration extreme gradient boosting Grey Wolf Optimizer multi-layer perceptron M5 model tree
下载PDF
Gradient Boosting算法在典型浅埋煤层液压支架选型中的应用 被引量:4
9
作者 张杰 孙遥 +3 位作者 谢党虎 蔡维山 刘清洲 龙晶晶 《煤矿安全》 CAS 北大核心 2020年第7期166-170,175,共6页
针对目前工作面液压支架阻力确定方法的不足,提出了1种新的预测方法,采用改进后的逻辑斯提算法(LR)来优化梯度提升回归(GBRT)模型,以此来预测液压支架阻力。在GBRT中加入学习速率来限制子模型的学习速率,防止其过拟合;应用LR对样本参数... 针对目前工作面液压支架阻力确定方法的不足,提出了1种新的预测方法,采用改进后的逻辑斯提算法(LR)来优化梯度提升回归(GBRT)模型,以此来预测液压支架阻力。在GBRT中加入学习速率来限制子模型的学习速率,防止其过拟合;应用LR对样本参数进行优化,建立LR-GBRT回归预测模型;将该预测模型应用于液压支架阻力的预测,预测结果与LR(线性回归模型)、SVM(支持向量机模型)、DTR(决策树回归模型)、EN(弹性网回归模型)进行对比分析。结果表明:LR-GBRT模型具有较强的泛化能力,较高的预测精度,可以对液压支架阻力进行有效预测。 展开更多
关键词 梯度提升回归算法 逻辑斯谛算法 工作面液压支架阻力 预测 学习速率
下载PDF
Predicted Oil Recovery Scaling-Law Using Stochastic Gradient Boosting Regression Model
10
作者 Mohamed F.El-Amin Abdulhamit Subasi +1 位作者 Mahmoud M.Selim Awad Mousa 《Computers, Materials & Continua》 SCIE EI 2021年第8期2349-2362,共14页
In the process of oil recovery,experiments are usually carried out on core samples to evaluate the recovery of oil,so the numerical data are fitted into a non-dimensional equation called scaling-law.This will be essen... In the process of oil recovery,experiments are usually carried out on core samples to evaluate the recovery of oil,so the numerical data are fitted into a non-dimensional equation called scaling-law.This will be essential for determining the behavior of actual reservoirs.The global non-dimensional time-scale is a parameter for predicting a realistic behavior in the oil field from laboratory data.This non-dimensional universal time parameter depends on a set of primary parameters that inherit the properties of the reservoir fluids and rocks and the injection velocity,which dynamics of the process.One of the practical machine learning(ML)techniques for regression/classification problems is gradient boosting(GB)regression.The GB produces a prediction model as an ensemble of weak prediction models that can be done at each iteration by matching a least-squares base-learner with the current pseudoresiduals.Using a randomization process increases the execution speed and accuracy of GB.Hence in this study,we developed a stochastic regression model of gradient boosting(SGB)to forecast oil recovery.Different nondimensional time-scales have been used to generate data to be used with machine learning techniques.The SGB method has been found to be the best machine learning technique for predicting the non-dimensional time-scale,which depends on oil/rock properties. 展开更多
关键词 Machine learning stochastic gradient boosting linear regression TIME-SCALE oil recovery
下载PDF
Prediction of COVID-19 Confirmed Cases Using Gradient Boosting Regression Method
11
作者 Abdu Gumaei Mabrook Al-Rakhami +4 位作者 Mohamad Mahmoud Al Rahhal Fahad Raddah H.Albogamy Eslam Al Maghayreh Hussain AlSalman 《Computers, Materials & Continua》 SCIE EI 2021年第1期315-329,共15页
The fast spread of coronavirus disease(COVID-19)caused by SARSCoV-2 has become a pandemic and a serious threat to the world.As of May 30,2020,this disease had infected more than 6 million people globally,with hundreds... The fast spread of coronavirus disease(COVID-19)caused by SARSCoV-2 has become a pandemic and a serious threat to the world.As of May 30,2020,this disease had infected more than 6 million people globally,with hundreds of thousands of deaths.Therefore,there is an urgent need to predict confirmed cases so as to analyze the impact of COVID-19 and practice readiness in healthcare systems.This study uses gradient boosting regression(GBR)to build a trained model to predict the daily total confirmed cases of COVID-19.The GBR method can minimize the loss function of the training process and create a single strong learner from weak learners.Experiments are conducted on a dataset of daily confirmed COVID-19 cases from January 22,2020,to May 30,2020.The results are evaluated on a set of evaluation performance measures using 10-fold cross-validation to demonstrate the effectiveness of the GBR method.The results reveal that the GBR model achieves 0.00686 root mean square error,the lowest among several comparative models. 展开更多
关键词 COVID-19 coronavirus disease SARS-CoV-2 machine learning gradient boosting regression(GBR)method
下载PDF
Grasshopper KUWAHARA and Gradient Boosting Tree for Optimal Features Classifications
12
作者 Rabab Hamed M.Aly Aziza I.Hussein Kamel H.Rahouma 《Computers, Materials & Continua》 SCIE EI 2022年第8期3985-3997,共13页
This paper aims to design an optimizer followed by a Kawahara filter for optimal classification and prediction of employees’performance.The algorithm starts by processing data by a modified K-means technique as a hie... This paper aims to design an optimizer followed by a Kawahara filter for optimal classification and prediction of employees’performance.The algorithm starts by processing data by a modified K-means technique as a hierarchical clustering method to quickly obtain the best features of employees to reach their best performance.The work of this paper consists of two parts.The first part is based on collecting data of employees to calculate and illustrate the performance of each employee.The second part is based on the classification and prediction techniques of the employee performance.This model is designed to help companies in their decisions about the employees’performance.The classification and prediction algorithms use the Gradient Boosting Tree classifier to classify and predict the features.Results of the paper give the percentage of employees which are expected to leave the company after predicting their performance for the coming years.Results also show that the Grasshopper Optimization,followed by“KF”with the Gradient Boosting Tree as classifier and predictor,is characterized by a high accuracy.The proposed algorithm is compared with other known techniques where our results are fund to be superior. 展开更多
关键词 Metaheuristic algorithm KUWAHARA filter Grasshopper optimization algorithm and gradient boosting tree
下载PDF
Coal Rock Condition Detection Model Using Acoustic Emission and Light Gradient Boosting Machine
13
作者 Jing Li Yong Yang +2 位作者 Hongmei Ge Li Zhao Ruxue Guo 《Computers, Materials & Continua》 SCIE EI 2020年第4期151-162,共12页
Coal rock mass instability fracture may result in serious hazards to underground coal mining.Acoustic emissions(AE)stimulated by internal structure fracture should carry lots of favorable information about health cond... Coal rock mass instability fracture may result in serious hazards to underground coal mining.Acoustic emissions(AE)stimulated by internal structure fracture should carry lots of favorable information about health condition of rock mass.AE as a sensitive non-destructive test method is gradually utilized to detect anomaly conditions of coal rock.This paper proposes an improved multi-resolution feature to extract AE waveform at different frequency resolutions using Coilflet Wavelet Transform method(CWT).It is further adopt an efficient Light Gradient Boosting Machine(LightGBM)by several cascaded sub weak classifier models to merge AE features at different views of frequency for coal rock anomaly damage recognition.The results denote that the proposed method achieves excellent recognition performance on anomaly damage levels of coal rock.It is an effective method to detect the critical stability further to predict the rock mass bursting in time. 展开更多
关键词 Acoustic emission light gradient boosting machine coal rock stability
下载PDF
Stochastic Gradient Boosting Model for Twitter Spam Detection
14
作者 K.Kiruthika Devi G.A.Sathish Kumar 《Computer Systems Science & Engineering》 SCIE EI 2022年第5期849-859,共11页
In today’s world of connectivity there is a huge amount of data than we could imagine.The number of network users are increasing day by day and there are large number of social networks which keeps the users connecte... In today’s world of connectivity there is a huge amount of data than we could imagine.The number of network users are increasing day by day and there are large number of social networks which keeps the users connected all the time.These social networks give the complete independence to the user to post the data either political,commercial or entertainment value.Some data may be sensitive and have a greater impact on the society as a result.The trustworthiness of data is important when it comes to public social networking sites like facebook and twitter.Due to the large user base and its openness there is a huge possibility to spread spam messages in this network.Spam detection is a technique to identify and mark data as a false data value.There are lot of machine learning approaches proposed to detect spam in social networks.The efficiency of any spam detection algorithm is determined by its cost factor and accuracy.Aiming to improve the detection of spam in the social networks this study proposes using statistical based features that are modelled through the supervised boosting approach called Stochastic gradient boosting to evaluate the twitter data sets in the English language.The performance of the proposed model is evaluated using simulation results. 展开更多
关键词 TWITTER SPAM stochastic gradient boosting
下载PDF
Flight Delay Prediction Using Gradient Boosting Machine Learning Classifiers
15
作者 Mingdao Lu Peng Wei +1 位作者 Mingshu He Yinglei Teng 《Journal of Quantum Computing》 2021年第1期1-12,共12页
With the increasing of civil aviation business,flight delay has become a key problem in civil aviation field in recent years,which has brought a considerable economic impact to airlines and related industries.The dela... With the increasing of civil aviation business,flight delay has become a key problem in civil aviation field in recent years,which has brought a considerable economic impact to airlines and related industries.The delay prediction of specific flights is very important for airlines’plan,airport resource allocation,insurance company strategy and personal arrangement.The influence factors of flight delay have high complexity and non-linear relationship.The different situations of various regions and airports,and even the deviation of airport or airline arrangement all have certain influence on flight delay,which makes the prediction more difficult.In view of the limitations of the existing delay prediction models,this paper proposes a flight delay prediction model with more generalization ability and corresponding machine learning classification algorithm.This model fully exploits temporal and spatial characteristics of higher dimensions,such as the influence of preceding flights,the situation of departure and landing airports,and the overall situation of flights on the same route.In the process of machine learning,the model is trained with historical data and tested with the latest actual data.The test result shows that the model and this machine learning algorithm can provide an effective method for the prediction of flight delay. 展开更多
关键词 Delay prediction machine learning gradient boosting
下载PDF
基于Gradient Boosting的车载LiDAR点云分类 被引量:5
16
作者 赵刚 杨必胜 《地理信息世界》 2016年第3期47-52,共6页
车载LiDAR点云中包含地面、建筑物、行道树、路灯等丰富地物类别,自动对这些不同类别点云进行分类,对点云中目标的识别、提取及重建都具有重要意义。本文提出了一种基于Gradient Boosting的自动分类方法。该方法首先对车载激光点云进行... 车载LiDAR点云中包含地面、建筑物、行道树、路灯等丰富地物类别,自动对这些不同类别点云进行分类,对点云中目标的识别、提取及重建都具有重要意义。本文提出了一种基于Gradient Boosting的自动分类方法。该方法首先对车载激光点云进行数据预处理,然后计算点云的协方差矩阵、密度比、高程相关特征、局部平面特征、投影特征等,再计算点云特征直方图与垂直分布直方图,采用K-means方法对这两者分别进行聚类,并将其聚类类别值也作为特征,从而构建出20维的点云特征向量,应用Gradient Boosting分类方法进行自动分类。为了验证本文方法的有效性,从某城镇场景的车载激光点云数据中选取部分代表区域共144W点作为训练数据集,然后选取另一较大区域的点云共312W点作为测试数据集。使用训练好的分类器对测试数据集进行分类,分类结果总体准确率达到了93.38%,耗时631s,说明此分类方法具有较高的分类准确率,同时也具备较高的效率。 展开更多
关键词 点云分类 特征向量 特征直方图 聚类 gradient boosting
下载PDF
Mapping landslide susceptibility at the Three Gorges Reservoir, China, using gradient boosting decision tree,random forest and information value models 被引量:7
17
作者 CHEN Tao ZHU Li +3 位作者 NIU Rui-qing TRINDER C John PENG Ling LEI Tao 《Journal of Mountain Science》 SCIE CSCD 2020年第3期670-685,共16页
This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting de... This work was to generate landslide susceptibility maps for the Three Gorges Reservoir(TGR) area, China by using different machine learning models. Three advanced machine learning methods, namely, gradient boosting decision tree(GBDT), random forest(RF) and information value(InV) models, were used, and the performances were assessed and compared. In total, 202 landslides were mapped by using a series of field surveys, aerial photographs, and reviews of historical and bibliographical data. Nine causative factors were then considered in landslide susceptibility map generation by using the GBDT, RF and InV models. All of the maps of the causative factors were resampled to a resolution of 28.5 m. Of the 486289 pixels in the area,28526 pixels were landslide pixels, and 457763 pixels were non-landslide pixels. Finally, landslide susceptibility maps were generated by using the three machine learning models, and their performances were assessed through receiver operating characteristic(ROC) curves, the sensitivity, specificity,overall accuracy(OA), and kappa coefficient(KAPPA). The results showed that the GBDT, RF and In V models in overall produced reasonable accurate landslide susceptibility maps. Among these three methods, the GBDT method outperforms the other two machine learning methods, which can provide strong technical support for producing landslide susceptibility maps in TGR. 展开更多
关键词 MAPPING LANDSLIDE SUSCEPTIBILITY gradient boosting decision tree Random forest Information value model Three Gorges RESERVOIR
原文传递
基于Gradient Boosting算法的小企业信用风险评估 被引量:2
18
作者 杨俊 夏晨琦 《浙江金融》 2017年第9期44-50,共7页
信用风险是导致银行破产的主要原因之一。传统上基于专家规则的信用风险评分模型虽然具有较好的业务解释性,但对建模人员的业务经验和理论水平有较高要求,也无法挖掘变量之间复杂的相关关系从而实现完全的数据驱动建模。本文使用Gradien... 信用风险是导致银行破产的主要原因之一。传统上基于专家规则的信用风险评分模型虽然具有较好的业务解释性,但对建模人员的业务经验和理论水平有较高要求,也无法挖掘变量之间复杂的相关关系从而实现完全的数据驱动建模。本文使用Gradient Boosting算法对我行小企业信贷客户数据建模,并和逻辑回归以及专家规则模型进行横向比较和分析。实验结果表明,以违约样本召回率和ROC为模型评估指标,Gradient Boosting算法的模型精度和模型稳定性显著优于另外两种模型,另外,Gradient Boosting和逻辑回归两种基于机器学习的模型表现要明显好于专家规则模型。 展开更多
关键词 信用风险 信用评分 梯度提升 逻辑回归 专家规则
原文传递
Semantic Based Greedy Levy Gradient Boosting Algorithm for Phishing Detection
19
作者 R.Sakunthala Jenni S.Shankar 《Computer Systems Science & Engineering》 SCIE EI 2022年第5期525-538,共14页
The detection of phishing and legitimate websites is considered a great challenge for web service providers because the users of such websites are indistinguishable.Phishing websites also create traffic in the entire ... The detection of phishing and legitimate websites is considered a great challenge for web service providers because the users of such websites are indistinguishable.Phishing websites also create traffic in the entire network.Another phishing issue is the broadening malware of the entire network,thus highlighting the demand for their detection while massive datasets(i.e.,big data)are processed.Despite the application of boosting mechanisms in phishing detection,these methods are prone to significant errors in their output,specifically due to the combination of all website features in the training state.The upcoming big data system requires MapReduce,a popular parallel programming,to process massive datasets.To address these issues,a probabilistic latent semantic and greedy levy gradient boosting(PLS-GLGB)algorithm for website phishing detection using MapReduce is proposed.A feature selection-based model is provided using a probabilistic intersective latent semantic preprocessing model to minimize errors in website phishing detection.Here,the missing data in each URL are identified and discarded for further processing to ensure data quality.Subsequently,with the preprocessed features(URLs),feature vectors are updated by the greedy levy divergence gradient(model)that selects the optimal features in the URL and accurately detects the websites.Thus,greedy levy efficiently differentiates between phishing websites and legitimate websites.Experiments are conducted using one of the largest public corpora of a website phish tank dataset.Results show that the PLS-GLGB algorithm for website phishing detection outperforms stateof-the-art phishing detection methods.Significant amounts of phishing detection time and errors are also saved during the detection of website phishing. 展开更多
关键词 Web service providers probabilistic intersective latent semantic greedy levy DIVERGENCE gradient phishing detection big data
下载PDF
Rapid Prediction Model for Urban Floods Based on a Light Gradient Boosting Machine Approach and Hydrological–Hydraulic Model
20
作者 Kui Xu Zhentao Han +1 位作者 Hongshi Xu Lingling Bin 《International Journal of Disaster Risk Science》 SCIE CSCD 2023年第1期79-97,共19页
Global climate change and sea level rise have led to increased losses from flooding.Accurate prediction of floods is essential to mitigating flood losses in coastal cities.Physically based models cannot satisfy the de... Global climate change and sea level rise have led to increased losses from flooding.Accurate prediction of floods is essential to mitigating flood losses in coastal cities.Physically based models cannot satisfy the demand for real-time prediction for urban flooding due to their computational complexity.In this study,we proposed a hybrid modeling approach for rapid prediction of urban floods,coupling the physically based model with the light gradient boosting machine(LightGBM)model.A hydrological–hydraulic model was used to provide sufficient data for the LightGBM model based on the personal computer storm water management model(PCSWMM).The variables related to rainfall,tide level,and the location of flood points were used as the input for the LightGBM model.To improve the prediction accuracy,the hyperparameters of the LightGBM model are optimized by grid search algorithm and K-fold cross-validation.Taking Haidian Island,Hainan Province,China as a case study,the optimum values of the learning rate,number of estimators,and number of leaves of the LightGBM model are 0.11,450,and 12,respectively.The Nash-Sutcliffe efficiency coefficient(NSE)of the LightGBM model on the test set is 0.9896,indicating that the LightGBM model has reliable predictions and outperforms random forest(RF),extreme gradient boosting(XGBoost),and k-nearest neighbor(KNN).From the LightGBM model,the variables related to tide level were analyzed as the dominant variables for predicting the inundation depth based on the Gini index in the study area.The proposed LightGBM model provides a scientific reference for flood control in coastal cities considering its superior performance and computational efficiency. 展开更多
关键词 China Flood prediction HAINAN Hydrological-hydraulic model Light gradient boosting machine Urban floods
原文传递
上一页 1 2 250 下一页 到第
使用帮助 返回顶部