期刊文献+
共找到1,728篇文章
< 1 2 87 >
每页显示 20 50 100
An Adaptive Sequential Replacement Method for Variable Selection in Linear Regression Analysis
1
作者 Jixiang Wu Johnie N. Jenkins Jack C. McCarty Jr. 《Open Journal of Statistics》 2023年第5期746-760,共15页
With the rapid development of DNA technologies, high throughput genomic data have become a powerful leverage to locate desirable genetic loci associated with traits of importance in various crop species. However, curr... With the rapid development of DNA technologies, high throughput genomic data have become a powerful leverage to locate desirable genetic loci associated with traits of importance in various crop species. However, current genetic association mapping analyses are focused on identifying individual QTLs. This study aimed to identify a set of QTLs or genetic markers, which can capture genetic variability for marker-assisted selection. Selecting a set with k loci that can maximize genetic variation out of high throughput genomic data is a challenging issue. In this study, we proposed an adaptive sequential replacement (ASR) method, which is considered a variant of the sequential replacement (SR) method. Through Monte Carlo simulation and comparing with four other selection methods: exhaustive, SR method, forward, and backward methods we found that the ASR method sustains consistent and repeatable results comparable to the exhaustive method with much reduced computational intensity. 展开更多
关键词 Adaptive Sequential Replacement Association Mapping Exhaustive Method Global Optimal Solution Sequential Replacement variable selection
下载PDF
Modeling method for SSC prediction in pomelo using Vis-NIRS with wavelength selection and latent variable updating
2
作者 Hao Tian Shuai Wang +1 位作者 Huirong Xu Yibin Ying 《International Journal of Agricultural and Biological Engineering》 SCIE 2024年第1期251-260,共10页
The aim of this study was in-line,rapid,and non-destructive detection for soluble solid content(SSC)in pomelos using visible and near-infrared spectroscopy(Vis-NIRS).However,the large size and thick rind of pomelo aff... The aim of this study was in-line,rapid,and non-destructive detection for soluble solid content(SSC)in pomelos using visible and near-infrared spectroscopy(Vis-NIRS).However,the large size and thick rind of pomelo affect the stability of spectral acquisition and the biological variabilities affect the robustness of models.Given these issues,in this study,an efficient prototype in-line detection system in transmittance mode was designed and evaluated in comparison with an off-line detection system.Data from the years 2019 and 2020 were used for modeling and the external validation data were obtained by the inline detection system in 2021.The wavelength selection methods of changeable size moving window(CSMW),random frog(RF),and competitive adaptive reweighted sampling(CARS)were used to improve the prediction accuracy of partial least squares regression(PLSR)models.The best performance of internal prediction was obtained by CARS-PLSR and the determination coefficient of prediction(),root mean square error of prediction(RMSEP),and residual predictive deviation(RPD)were 0.958,0.204%,and 4.821,respectively.However,all models obtained large prediction biases in external validation.The latent variable updating(LVU)method was proposed to update models and improve the performance in external validation.Ten samples from the external validation set were randomly selected to update the models.Compared with the recalibration method,LVU could effectively modify the original models which matched the SSC range of the external validation set.The CSMW-PLSR models were more robust in external validations.The off-line model with LVU performed best with a root mean square error of validation(RMSEV)of 0.599%and the in-line model with recalibration obtained RMSEV of 0.864%.These results demonstrated the application potential of the transmittance Vis-NIRS for in-line rapid prediction of SSC in pomelos and the modeling and updating methods could be applied to samples with biological variabilities. 展开更多
关键词 Vis-NIRS in-line detection external validation wavelength selection model updating POMELO SSC
原文传递
Variable selection-based SPC procedures for high-dimensional multistage processes 被引量:2
3
作者 KIM Sangahn 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2019年第1期144-153,共10页
Monitoring high-dimensional multistage processes becomes crucial to ensure the quality of the final product in modern industry environments. Few statistical process monitoring(SPC) approaches for monitoring and contro... Monitoring high-dimensional multistage processes becomes crucial to ensure the quality of the final product in modern industry environments. Few statistical process monitoring(SPC) approaches for monitoring and controlling quality in highdimensional multistage processes are studied. We propose a deviance residual-based multivariate exponentially weighted moving average(MEWMA) control chart with a variable selection procedure. We demonstrate that it outperforms the existing multivariate SPC charts in terms of out-of-control average run length(ARL) for the detection of process mean shift. 展开更多
关键词 diagnosis procedure deviance RESIDUAL fault identification MODEL-BASED control CHART MULTISTAGE process monitoring variable selection.
下载PDF
Quantitative analysis of the content of nitrogen and sulfur in coal based on laserinduced breakdown spectroscopy: effects of variable selection 被引量:3
4
作者 邓凡 丁宇 +2 位作者 陈雨娟 朱绍农 陈非凡 《Plasma Science and Technology》 SCIE EI CAS CSCD 2020年第7期36-43,共8页
Coal is a crucial fossil energy in today’s society,and the detection of sulfir(S) and nitrogen(N)in coal is essential for the evaluation of coal quality.Therefore,an efficient method is needed to quantitatively analy... Coal is a crucial fossil energy in today’s society,and the detection of sulfir(S) and nitrogen(N)in coal is essential for the evaluation of coal quality.Therefore,an efficient method is needed to quantitatively analyze N and S content in coal,to achieve the purpose of clean utilization of coal.This study applied laser-induced breakdown spectroscopy(LIBS) to test coal quality,and combined two variable selection algorithms,competitive adaptive reweighted sampling(CARS) and the successive projections algorithm(SPA),to establish the corresponding partial least square(PLS) model.The results of the experiment were as follows.The PLS modeled with the full spectrum of 27,620 variables has poor accuracy,the coefficient of determination of the test set(R^2 P) and root mean square error of the test set(RMSEP) of nitrogen were 0.5172 and 0.2263,respectively,and those of sulfur were0.5784 and 0.5811,respectively.The CARS-PLS screened 37 and 25 variables respectively in the detection of N and S elements,but the prediction ability of the model did not improve significantly.SPA-PLS finally screened 14 and 11 variables respectively through successive projections,and obtained the best prediction effect among the three methods.The R^2 P and RMSEP of nitrogen were0.9873 and 0.0208,respectively,and those of sulfur were 0.9451 and 0.2082,respectively.In general,the predictive results of the two elements increased by about 90% for RMSEP and 60% for R2 P compared with PLS.The results show that LIBS combined with SPA-PLS has good potential for detecting N and S content in coal,and is a very promising technology for industrial application. 展开更多
关键词 variable selection LIBS COAL CARS and SPA
下载PDF
Cross-Validation, Shrinkage and Variable Selection in Linear Regression Revisited 被引量:3
5
作者 Hans C. van Houwelingen Willi Sauerbrei 《Open Journal of Statistics》 2013年第2期79-102,共24页
In deriving a regression model analysts often have to use variable selection, despite of problems introduced by data- dependent model building. Resampling approaches are proposed to handle some of the critical issues.... In deriving a regression model analysts often have to use variable selection, despite of problems introduced by data- dependent model building. Resampling approaches are proposed to handle some of the critical issues. In order to assess and compare several strategies, we will conduct a simulation study with 15 predictors and a complex correlation structure in the linear regression model. Using sample sizes of 100 and 400 and estimates of the residual variance corresponding to R2 of 0.50 and 0.71, we consider 4 scenarios with varying amount of information. We also consider two examples with 24 and 13 predictors, respectively. We will discuss the value of cross-validation, shrinkage and backward elimination (BE) with varying significance level. We will assess whether 2-step approaches using global or parameterwise shrinkage (PWSF) can improve selected models and will compare results to models derived with the LASSO procedure. Beside of MSE we will use model sparsity and further criteria for model assessment. The amount of information in the data has an influence on the selected models and the comparison of the procedures. None of the approaches was best in all scenarios. The performance of backward elimination with a suitably chosen significance level was not worse compared to the LASSO and BE models selected were much sparser, an important advantage for interpretation and transportability. Compared to global shrinkage, PWSF had better performance. Provided that the amount of information is not too small, we conclude that BE followed by PWSF is a suitable approach when variable selection is a key part of data analysis. 展开更多
关键词 Cross-Validation LASSO SHRINKAGE SIMULATION STUDY variable selection
下载PDF
Logistic and SVM Credit Score Models Based on Lasso Variable Selection 被引量:2
6
作者 Qingqing Li 《Journal of Applied Mathematics and Physics》 2019年第5期1131-1148,共18页
There are many factors influencing personal credit. We introduce Lasso technique to personal credit evaluation, and establish Lasso-logistic, Lasso-SVM and Group lasso-logistic models respectively. Variable selection ... There are many factors influencing personal credit. We introduce Lasso technique to personal credit evaluation, and establish Lasso-logistic, Lasso-SVM and Group lasso-logistic models respectively. Variable selection and parameter estimation are also conducted simultaneously. Based on the personal credit data set from a certain lending platform, it can be concluded through experiments that compared with the full-variable Logistic model and the stepwise Logistic model, the variable selection ability of Group lasso-logistic model was the strongest, followed by Lasso-logistic and Lasso-SVM respectively. All three models based on Lasso variable selection have better filtering capability than stepwise selection. In the meantime, the Group lasso-logistic model can eliminate or retain relevant virtual variables as a group to facilitate model interpretation. In terms of prediction accuracy, Lasso-SVM had the highest prediction accuracy for default users in the training set, while in the test set, Group lasso-logistic had the best classification accuracy for default users. Whether in the training set or in the test set, the Lasso-logistic model has the best classification accuracy for non-default users. The model based on Lasso variable selection can also better screen out the key factors influencing personal credit risk. 展开更多
关键词 CREDIT Evaluation LOGISTIC ALGORITHM SVM ALGORITHM Lasso variable selection
下载PDF
Joint Variable Selection of Mean-Covariance Model for Longitudinal Data 被引量:2
7
作者 Dengke Xu Zhongzhan Zhang Liucang Wu 《Open Journal of Statistics》 2013年第1期27-35,共9页
In this paper we reparameterize covariance structures in longitudinal data analysis through the modified Cholesky decomposition of itself. Based on this modified Cholesky decomposition, the within-subject covariance m... In this paper we reparameterize covariance structures in longitudinal data analysis through the modified Cholesky decomposition of itself. Based on this modified Cholesky decomposition, the within-subject covariance matrix is decomposed into a unit lower triangular matrix involving moving average coefficients and a diagonal matrix involving innovation variances, which are modeled as linear functions of covariates. Then, we propose a penalized maximum likelihood method for variable selection in joint mean and covariance models based on this decomposition. Under certain regularity conditions, we establish the consistency and asymptotic normality of the penalized maximum likelihood estimators of parameters in the models. Simulation studies are undertaken to assess the finite sample performance of the proposed variable selection procedure. 展开更多
关键词 JOINT Mean and COVARIANCE Models variable selection Cholesky DECOMPOSITION Longitudinal Data Penalized MAXIMUM LIKELIHOOD Method
下载PDF
Variable Selection of Partially Linear Single-index Models 被引量:1
8
作者 L U Yi-qiang HU Bin 《Chinese Quarterly Journal of Mathematics》 CSCD 2014年第3期392-399,共8页
In this article, we study the variable selection of partially linear single-index model(PLSIM). Based on the minimized average variance estimation, the variable selection of PLSIM is done by minimizing average varianc... In this article, we study the variable selection of partially linear single-index model(PLSIM). Based on the minimized average variance estimation, the variable selection of PLSIM is done by minimizing average variance with adaptive l1 penalty. Implementation algorithm is given. Under some regular conditions, we demonstrate the oracle properties of aLASSO procedure for PLSIM. Simulations are used to investigate the effectiveness of the proposed method for variable selection of PLSIM. 展开更多
关键词 variable selection adaptive LASSO minimized average variance estimation(MAVE) partially linear single-index model
下载PDF
Automatic Variable Selection for High-Dimensional Linear Models with Longitudinal Data 被引量:1
9
作者 Ruiqin Tian Liugen Xue 《Open Journal of Statistics》 2014年第1期38-48,共11页
High-dimensional longitudinal data arise frequently in biomedical and genomic research. It is important to select relevant covariates when the dimension of the parameters diverges as the sample size increases. We cons... High-dimensional longitudinal data arise frequently in biomedical and genomic research. It is important to select relevant covariates when the dimension of the parameters diverges as the sample size increases. We consider the problem of variable selection in high-dimensional linear models with longitudinal data. A new variable selection procedure is proposed using the smooth-threshold generalized estimating equation and quadratic inference functions (SGEE-QIF) to incorporate correlation information. The proposed procedure automatically eliminates inactive predictors by setting the corresponding parameters to be zero, and simultaneously estimates the nonzero regression coefficients by solving the SGEE-QIF. The proposed procedure avoids the convex optimization problem and is flexible and easy to implement. We establish the asymptotic properties in a high-dimensional framework where the number of covariates increases as the number of cluster increases. Extensive Monte Carlo simulation studies are conducted to examine the finite sample performance of the proposed variable selection procedure. 展开更多
关键词 variable selection Diverging Number of Parameters Longitudinal Data QUADRATIC INFERENCE FUNCTIONS GENERALIZED ESTIMATING Equation
下载PDF
Fast Variable Selection by Block Addition and Block Deletion 被引量:1
10
作者 Takashi Nagatani Seiichi Ozawa Shigeo Abe 《Journal of Intelligent Learning Systems and Applications》 2010年第4期200-211,共12页
We propose the threshold updating method for terminating variable selection and two variable selection methods. In the threshold updating method, we update the threshold value when the approximation error smaller than... We propose the threshold updating method for terminating variable selection and two variable selection methods. In the threshold updating method, we update the threshold value when the approximation error smaller than the current threshold value is obtained. The first variable selection method is the combination of forward selection by block addi-tion and backward selection by block deletion. In this method, starting from the empty set of the input variables, we add several input variables at a time until the approximation error is below the threshold value. Then we search deletable variables by block deletion. The second method is the combination of the first method and variable selection by Linear Programming Support Vector Regressors (LPSVRs). By training an LPSVR with linear kernels, we evaluate the weights of the decision function and delete the input variables whose associated absolute weights are zero. Then we carry out block addition and block deletion. By computer experiments using benchmark data sets, we show that the proposed methods can perform faster variable selection than the method only using block deletion, and that by the threshold updating method, the approximation error is lower than that by the fixed threshold method. We also compare our method with an imbedded method, which determines the optimal variables during training, and show that our method gives comparable or better variable selection performance. 展开更多
关键词 Backward selection Forward selection Least SQUARES SUPPORT VECTOR MACHINES Linear Programming SUPPORT VECTOR MACHINES SUPPORT VECTOR MACHINES variable selection
下载PDF
Bayesian Variable Selection for Mixture Process Variable Design Experiment 被引量:1
11
作者 Sadiah M. A. Aljeddani 《Open Journal of Modelling and Simulation》 2022年第4期391-416,共26页
This paper discussed Bayesian variable selection methods for models from split-plot mixture designs using samples from Metropolis-Hastings within the Gibbs sampling algorithm. Bayesian variable selection is easy to im... This paper discussed Bayesian variable selection methods for models from split-plot mixture designs using samples from Metropolis-Hastings within the Gibbs sampling algorithm. Bayesian variable selection is easy to implement due to the improvement in computing via MCMC sampling. We described the Bayesian methodology by introducing the Bayesian framework, and explaining Markov Chain Monte Carlo (MCMC) sampling. The Metropolis-Hastings within Gibbs sampling was used to draw dependent samples from the full conditional distributions which were explained. In mixture experiments with process variables, the response depends not only on the proportions of the mixture components but also on the effects of the process variables. In many such mixture-process variable experiments, constraints such as time or cost prohibit the selection of treatments completely at random. In these situations, restrictions on the randomisation force the level combinations of one group of factors to be fixed and the combinations of the other group of factors are run. Then a new level of the first-factor group is set and combinations of the other factors are run. We discussed the computational algorithm for the Stochastic Search Variable Selection (SSVS) in linear mixed models. We extended the computational algorithm of SSVS to fit models from split-plot mixture design by introducing the algorithm of the Stochastic Search Variable Selection for Split-plot Design (SSVS-SPD). The motivation of this extension is that we have two different levels of the experimental units, one for the whole plots and the other for subplots in the split-plot mixture design. 展开更多
关键词 variable selection Bayesian Analysis Mixture Experiment Split-Plot Design
下载PDF
Variable Selection in Randomized Block Design Experiment 被引量:1
12
作者 Sadiah Mohammed Aljeddani 《American Journal of Computational Mathematics》 2022年第2期216-231,共16页
In the experimental field, researchers need very often to select the best subset model as well as reach the best model estimation simultaneously. Selecting the best subset of variables will improve the prediction accu... In the experimental field, researchers need very often to select the best subset model as well as reach the best model estimation simultaneously. Selecting the best subset of variables will improve the prediction accuracy as noninformative variables will be removed. Having a model with high prediction accuracy allows the researchers to use the model for future forecasting. In this paper, we investigate the differences between various variable selection methods. The aim is to compare the analysis of the frequentist methodology (the backward elimination), penalised shrinkage method (the Adaptive LASSO) and the Least Angle Regression (LARS) for selecting the active variables for data produced by the blocked design experiment. The result of the comparative study supports the utilization of the LARS method for statistical analysis of data from blocked experiments. 展开更多
关键词 variable selection Shrinkage Methods Linear Mixed Model Blocked Designs
下载PDF
Spectroscopic Multicomponent Analysis Using Multi-objective Optimization for Variable Selection 被引量:1
13
作者 Anderson da Silva Soares Telma Woerle de Lima +3 位作者 Daniel Vitor de LuPcena Rogerio Lopes Salvini GustavoTeodoro Laureano Clarimar Jose Coelho 《Computer Technology and Application》 2013年第9期466-475,共10页
关键词 近红外光谱 变量选择 多组分分析 多目标优化 偏最小二乘法 预测误差 算法选择 PARETO
下载PDF
Variable Selection for Partially Linear Varying Coefficient Transformation Models with Censored Data
14
作者 Jiang Du Zhongzhan Zhang Ying Lu 《Open Journal of Statistics》 2012年第5期565-570,共6页
In this paper, we study the problem of variable selection for varying coefficient transformation models with censored data. We fit the varying coefficient transformation models by maximizing the marginal likelihood su... In this paper, we study the problem of variable selection for varying coefficient transformation models with censored data. We fit the varying coefficient transformation models by maximizing the marginal likelihood subject to a shrink- age-type penalty, which encourages sparse solutions and hence facilitates the process of variable selection. We further provide an efficient computation algorithm to implement the proposed methods. A simulation study is conducted to evaluate the performance of the proposed methods and a real dataset is analyzed as an illustration. 展开更多
关键词 variable selection Maximum LIKELIHOOD Estimation SPLINE SMOOTHING
下载PDF
Automatic Variable Selection for Single-Index Random Effects Models with Longitudinal Data
15
作者 Suigen Yang Liugen Xue 《Open Journal of Statistics》 2014年第3期230-237,共8页
We consider the problem of variable selection for the single-index random effects models with longitudinal data. An automatic variable selection procedure is developed using smooth-threshold. The proposed method share... We consider the problem of variable selection for the single-index random effects models with longitudinal data. An automatic variable selection procedure is developed using smooth-threshold. The proposed method shares some of the desired features of existing variable selection methods: the resulting estimator enjoys the oracle property;the proposed procedure avoids the convex optimization problem and is flexible and easy to implement. Moreover, we use the penalized weighted deviance criterion for a data-driven choice of the tuning parameters. Simulation studies are carried out to assess the performance of our method, and a real dataset is analyzed for further illustration. 展开更多
关键词 variable selection Single-Index MODEL RANDOM Effects Longitudinal DATA
下载PDF
Group Variable Selection via a Combination of <i>L</i><sub>q</sub>Norm and Correlation-Based Penalty
16
作者 Ning Mao Wanzhou Ye 《Advances in Pure Mathematics》 2017年第1期51-65,共15页
Considering the problem of feature selection in linear regression model, a new method called LqCP is proposed simultaneously to select variables and favor a grouping effect, where strongly correlated predictors tend t... Considering the problem of feature selection in linear regression model, a new method called LqCP is proposed simultaneously to select variables and favor a grouping effect, where strongly correlated predictors tend to be in or out of the model together. LqCP is based on penalized least squares with a penalty function that combines the Lq (0n. In addition, a simulation about grouped variable selection is performed. Finally, The model is applied to two real data: US Crime Data and Gasoline Data. In terms of prediction error and estimation error, empirical studies show the efficiency of LqCP. 展开更多
关键词 Linear Regression variable selection ELASTIC NET Adaptive ELASTIC NET
下载PDF
Variable Selection via Biased Estimators in the Linear Regression Model
17
作者 Manickavasagar Kayanan Pushpakanthie Wijekoon 《Open Journal of Statistics》 2020年第1期113-126,共14页
Least Absolute Shrinkage and Selection Operator (LASSO) is used for variable selection as well as for handling the multicollinearity problem simultaneously in the linear regression model. LASSO produces estimates havi... Least Absolute Shrinkage and Selection Operator (LASSO) is used for variable selection as well as for handling the multicollinearity problem simultaneously in the linear regression model. LASSO produces estimates having high variance if the number of predictors is higher than the number of observations and if high multicollinearity exists among the predictor variables. To handle this problem, Elastic Net (ENet) estimator was introduced by combining LASSO and Ridge estimator (RE). The solutions of LASSO and ENet have been obtained using Least Angle Regression (LARS) and LARS-EN algorithms, respectively. In this article, we proposed an alternative algorithm to overcome the issues in LASSO that can be combined LASSO with other exiting biased estimators namely Almost Unbiased Ridge Estimator (AURE), Liu Estimator (LE), Almost Unbiased Liu Estimator (AULE), Principal Component Regression Estimator (PCRE), r-k class estimator and r-d class estimator. Further, we examine the performance of the proposed algorithm using a Monte-Carlo simulation study and real-world examples. The results showed that the LARS-rk and LARS-rd algorithms,?which are combined LASSO with r-k class estimator and r-d class estimator,?outperformed other algorithms under the moderated and severe multicollinearity. 展开更多
关键词 variable selection Least ABSOLUTE SHRINKAGE and selection OPERATOR (LASSO) Least Angle Regression (LARS) Elastic Net (ENet) Biased ESTIMATORS
下载PDF
Semi entropy of uncertain random variables and its application to portfolio selection
18
作者 GAO Jin-wu Hamed Ahmadzade Mehran Farahikia 《Applied Mathematics(A Journal of Chinese Universities)》 SCIE CSCD 2022年第3期383-395,共13页
Semi entropy is a measure to characterize the indeterminacy of the uncertain random variable considering the values of the uncertain random variable which are lower than the mean.As important roles of semi entropy in ... Semi entropy is a measure to characterize the indeterminacy of the uncertain random variable considering the values of the uncertain random variable which are lower than the mean.As important roles of semi entropy in finance,this paper presents the concept of semi entropy for uncertain random variables.In order to compute semi entropy for uncertain random variables,Monte-Carlo approach is provided.As an application of semi entropy,portfolio selection problems are optimized based on mean-semi entropy mode. 展开更多
关键词 chance theory uncertain random variable semi entropy portfolio selection Monte-Carlo simulation
下载PDF
Clustering of the Values of a Response Variable and Simultaneous Covariate Selection Using a Stepwise Algorithm
19
作者 Olivier Collignon Jean-Marie Monnez 《Applied Mathematics》 2016年第15期1639-1648,共10页
In supervised learning the number of values of a response variable can be very high. Grouping these values in a few clusters can be useful to perform accurate supervised classification analyses. On the other hand sele... In supervised learning the number of values of a response variable can be very high. Grouping these values in a few clusters can be useful to perform accurate supervised classification analyses. On the other hand selecting relevant covariates is a crucial step to build robust and efficient prediction models. We propose in this paper an algorithm that simultaneously groups the values of a response variable into a limited number of clusters and selects stepwise the best covariates that discriminate this clustering. These objectives are achieved by alternate optimization of a user-defined model selection criterion. This process extends a former version of the algorithm to a more general framework. Moreover possible further developments are discussed in detail. 展开更多
关键词 Classification variable selection Supervised Learning Akaike Information Criterion Wilks’ Lambda
下载PDF
上一页 1 2 87 下一页 到第
使用帮助 返回顶部