The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software w...The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software with defects negatively impacts operational costs and finally affects customer satisfaction. Numerous approaches exist to predict software defects. However, the timely and accurate software bugs are the major challenging issues. To improve the timely and accurate software defect prediction, a novel technique called Nonparametric Statistical feature scaled QuAdratic regressive convolution Deep nEural Network (SQADEN) is introduced. The proposed SQADEN technique mainly includes two major processes namely metric or feature selection and classification. First, the SQADEN uses the nonparametric statistical Torgerson–Gower scaling technique for identifying the relevant software metrics by measuring the similarity using the dice coefficient. The feature selection process is used to minimize the time complexity of software fault prediction. With the selected metrics, software fault perdition with the help of the Quadratic Censored regressive convolution deep neural network-based classification. The deep learning classifier analyzes the training and testing samples using the contingency correlation coefficient. The softstep activation function is used to provide the final fault prediction results. To minimize the error, the Nelder–Mead method is applied to solve non-linear least-squares problems. Finally, accurate classification results with a minimum error are obtained at the output layer. Experimental evaluation is carried out with different quantitative metrics such as accuracy, precision, recall, F-measure, and time complexity. The analyzed results demonstrate the superior performance of our proposed SQADEN technique with maximum accuracy, sensitivity and specificity by 3%, 3%, 2% and 3% and minimum time and space by 13% and 15% when compared with the two state-of-the-art methods.展开更多
Dispersion and attenuation analysis can be used to determine formation anisotropy induced by fractures,or stresses.In this paper,we propose a nonparametric spectrum estimation method to get phase dispersion characteri...Dispersion and attenuation analysis can be used to determine formation anisotropy induced by fractures,or stresses.In this paper,we propose a nonparametric spectrum estimation method to get phase dispersion characteristics and attenuation coefficient.By designing an appropriate vector filter,phase velocity,attenuation coefficient and amplitude can be inverted from the waveform recorded by the receiver array.Performance analysis of this algorithm is compared with Extended Prony Method(EPM)and Forward and Backward Matrix Pencil(FBMP)method.Based on the analysis results,the proposed method is capable of achieving high resolution and precision as the parametric spectrum estimation methods.At the meantime,it also keeps high stability as the other nonparametric spectrum estimation methods.At last,applications to synthetic waveforms modeled using finite difference method and real data show its efficiency.The real data processing results show that the P-wave attenuation log is more sensitive to oil formation compared to S-wave;and the S-wave attenuation log is more sensitive to shale formation compared to P-wave.展开更多
Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Kn...Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Knowing how to select an appropriate test can lead to more accurate results. Invalid results and misleading conclusions may be drawn from a study if an incorrect statistical test is used. Therefore, to avoid these it is essential to understand the nature of the data, the research question, and the assumptions of the tests before selecting one. This is because there are a wide variety of tests available. This paper provides a step-by-step approach to selecting the right statistical test for any study, with an explanation of when it is appropriate to use it and relevant examples of each statistical test. Furthermore, this guide provides a comprehensive overview of the assumptions of each test and what to do if these assumptions are violated.展开更多
Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Kn...Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Knowing how to select an appropriate test can lead to more accurate results. Invalid results and misleading conclusions may be drawn from a study if an incorrect statistical test is used. Therefore, to avoid these it is essential to understand the nature of the data, the research question, and the assumptions of the tests before selecting one. This is because there are a wide variety of tests available. This paper provides a step-by-step approach to selecting the right statistical test for any study, with an explanation of when it is appropriate to use it and relevant examples of each statistical test. Furthermore, this guide provides a comprehensive overview of the assumptions of each test and what to do if these assumptions are violated.展开更多
In this paper, we study the strong consistency for partitioning estimation of regression function under samples that axe φ-mixing sequences with identically distribution.Key words: nonparametric regression function; ...In this paper, we study the strong consistency for partitioning estimation of regression function under samples that axe φ-mixing sequences with identically distribution.Key words: nonparametric regression function; partitioning estimation; strong convergence;φ-mixing sequences.展开更多
This paper studies evolutionary mechanism of parameter selection in the construction of weightfunction for Nearest Neighbour Estimate in nonparametrlc regresslon. Construct an algorithm which adagetively evolves fine ...This paper studies evolutionary mechanism of parameter selection in the construction of weightfunction for Nearest Neighbour Estimate in nonparametrlc regresslon. Construct an algorithm which adagetively evolves fine weight and makes good prediction about unknown points. The numerical experiments indicate that this method is errective. It is a meaningful discussion about practicability of nonparametric regression and methodology or adaptive model-building.展开更多
This poaper is devoted to the performance evaluation of the Generalized Sigu(GS). Trimmed Generalized Sign(TGS), Modified Savage(MS). Mann-Whitney(MW) and a new proposed detector in multiple target situations. The ana...This poaper is devoted to the performance evaluation of the Generalized Sigu(GS). Trimmed Generalized Sign(TGS), Modified Savage(MS). Mann-Whitney(MW) and a new proposed detector in multiple target situations. The analysis is carried out for both fluctuating and nonfluctuating received signals. The simulation results show that the new proposed detector has the best detection performance in homogeneous as well as nonhomogeneous background conditions, while TGS procedure is better than the GS detector in distinguishing the primary target from the secondary interfering ones.展开更多
Higher-order almost cyclostationary complex processes are complex random signals with almost periodically time-varying statistics, which is important to the research of non-Gaussian signals in information system. In t...Higher-order almost cyclostationary complex processes are complex random signals with almost periodically time-varying statistics, which is important to the research of non-Gaussian signals in information system. In tins paper, smoothed polyperiodograms are proposed for related to cyclic polyspectral estimation and are shown to be consistent and asymptotically complex normal. Asymptotic covariance expressions are derived along with their computable forms.展开更多
Nonparametric method based on the mutual information is an efficient technique for the image segmentation. In this method, the image is divided into the internal and external labeled regions, and the segmentation prob...Nonparametric method based on the mutual information is an efficient technique for the image segmentation. In this method, the image is divided into the internal and external labeled regions, and the segmentation problem constrained by the total length of the region boundaries will be changed into the maximization of the mutual information between the region labels and the image pixel intensities. The maximization problem can be solved by deriving the associated gradient flows and the curve evolutions. One of the advantages for this method does not need to choose the segmentation parameter;another is not sensitive to the noise. By employing the narrowband levelset and Fast Gauss Transformation, the computation time is reduced clearly and the algorithm efficiency is greatly improved.展开更多
Atrial fibrillation (Afib) is related with heart failure, stroke, and high mortality rates. In frequency domain analysis, pre-requisite for Afib detection has been the estimation of reliable dominant frequency (DF) of...Atrial fibrillation (Afib) is related with heart failure, stroke, and high mortality rates. In frequency domain analysis, pre-requisite for Afib detection has been the estimation of reliable dominant frequency (DF) of atrial signals via different spectral estimation techniques. DF further characterizes Afib, and helps in its treatment. This paper aims at finding the most appropriate nonparametric FFT-based spectral estimation technique to estimate reliable DF for Afib detection. In this work, real-time intra-atrial electrograms have been acquired and pre-processed for frequency analysis. DF is estimated via Bartlett using Hanning window, and Welch methods. Regularity index (RI), a parameter to ensure reliability of DF, is calculated using Simpson 3/8 and Trapezoidal rules. The best method is declared based upon high accuracy of Afib detection using reliable DF. On comparison, Welch method is found to be more appropriate to estimate reliable DF for Afib detection with 98% accuracy.展开更多
The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. T...The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. The asymptotic normality of the proposed estimator is established. The proposed methods are applied to the lung cancer data. Extensive simulations are reported, showing that the proposed method works well in practical settings.展开更多
A number of statistical tests are proposed for the purpose of change-point detection in a general nonparametric regression model under mild conditions. New proofs are given to prove the weak convergence of the underly...A number of statistical tests are proposed for the purpose of change-point detection in a general nonparametric regression model under mild conditions. New proofs are given to prove the weak convergence of the underlying processes which assume remove the stringent condition of bounded total variation of the regression function and need only second moments. Since many quantities, such as the regression function, the distribution of the covariates and the distribution of the errors, are unspecified, the results are not distribution-free. A weighted bootstrap approach is proposed to approximate the limiting distributions. Results of a simulation study for this paper show good performance for moderate samples sizes.展开更多
In this paper, by using some inequalities of negatively orthant dependent(NOD,in short) random variables and the truncated method of random variables, we investigate the nonparametric regression model. The complete co...In this paper, by using some inequalities of negatively orthant dependent(NOD,in short) random variables and the truncated method of random variables, we investigate the nonparametric regression model. The complete consistency result for the estimator of g(x) is presented.展开更多
Estimation of the bivariate survival function under the competing risks caseis considered.We give an explicit formula for the estimator from a decomposition of thebivariate survival function based on competing risks,w...Estimation of the bivariate survival function under the competing risks caseis considered.We give an explicit formula for the estimator from a decomposition of thebivariate survival function based on competing risks,which is almost sure consistent.展开更多
In this paper, we investigate the nonparametric regression model based on ρ-mixing errors, which are stochastically dominated by a nonnegative random variable. Weobtain the convergence rate for the weighted estimator...In this paper, we investigate the nonparametric regression model based on ρ-mixing errors, which are stochastically dominated by a nonnegative random variable. Weobtain the convergence rate for the weighted estimator of unknown function g(x) in pth-mean, which yields the convergence rate in probability. Moreover, an example of the nearestneighbor estimator is also illustrated and the convergence rates of estimator are presented.展开更多
We consider the problem of estimating a function g in nonparametric regression model when only some of covariates are measured with errors with the assistance of validation data. Without specifying any error model str...We consider the problem of estimating a function g in nonparametric regression model when only some of covariates are measured with errors with the assistance of validation data. Without specifying any error model structure between the surrogate and true covariables, we propose an estimator which integrates orthogonal series estimation and truncated series approximation method. Under general regularity conditions, we get the convergence rate of this estimator. Simulations demonstrate the finite-sample properties of the new estimator.展开更多
In this article we study the estimation method of nonparametric regression measurement error model based on a validation data. The estimation procedures are based on orthogonal series estimation and truncated series a...In this article we study the estimation method of nonparametric regression measurement error model based on a validation data. The estimation procedures are based on orthogonal series estimation and truncated series approximation methods without specifying any structure equation and the distribution assumption. The convergence rates of the proposed estimator are derived. By example and through simulation, the method is robust against the misspecification of a measurement error model.展开更多
In this article, we develop estimation approaches for nonparametric multiple regression measurement error models when both independent validation data on covariables and primary data on the response variable and surro...In this article, we develop estimation approaches for nonparametric multiple regression measurement error models when both independent validation data on covariables and primary data on the response variable and surrogate covariables are available. An estimator which integrates Fourier series estimation and truncated series approximation methods is derived without any error model structure assumption between the true covariables and surrogate variables. Most importantly, our proposed methodology can be readily extended to the case that only some of covariates are measured with errors with the assistance of validation data. Under mild conditions, we derive the convergence rates of the proposed estimators. The finite-sample properties of the estimators are investigated through simulation studies.展开更多
We propose a new nonparametric test based on the rank difference between the paired sample for testing the equality of the marginal distributions from a bivariate distribution. We also consider a modification of the n...We propose a new nonparametric test based on the rank difference between the paired sample for testing the equality of the marginal distributions from a bivariate distribution. We also consider a modification of the novel nonparametric test based on the test proposed by Baumgartern, Weiβ, and Schindler (1998). An extensive numerical power comparison for various parametric and nonparametric tests was conducted under a wide range of bivariate distributions for small sample sizes. The two new nonparametric tests have comparable power to the paired t test for the data simulated from bivariate normal distributions, and are generally more powerful than the paired t test and other commonly used nonparametric tests in several important bivariate distributions.展开更多
文摘The development of defect prediction plays a significant role in improving software quality. Such predictions are used to identify defective modules before the testing and to minimize the time and cost. The software with defects negatively impacts operational costs and finally affects customer satisfaction. Numerous approaches exist to predict software defects. However, the timely and accurate software bugs are the major challenging issues. To improve the timely and accurate software defect prediction, a novel technique called Nonparametric Statistical feature scaled QuAdratic regressive convolution Deep nEural Network (SQADEN) is introduced. The proposed SQADEN technique mainly includes two major processes namely metric or feature selection and classification. First, the SQADEN uses the nonparametric statistical Torgerson–Gower scaling technique for identifying the relevant software metrics by measuring the similarity using the dice coefficient. The feature selection process is used to minimize the time complexity of software fault prediction. With the selected metrics, software fault perdition with the help of the Quadratic Censored regressive convolution deep neural network-based classification. The deep learning classifier analyzes the training and testing samples using the contingency correlation coefficient. The softstep activation function is used to provide the final fault prediction results. To minimize the error, the Nelder–Mead method is applied to solve non-linear least-squares problems. Finally, accurate classification results with a minimum error are obtained at the output layer. Experimental evaluation is carried out with different quantitative metrics such as accuracy, precision, recall, F-measure, and time complexity. The analyzed results demonstrate the superior performance of our proposed SQADEN technique with maximum accuracy, sensitivity and specificity by 3%, 3%, 2% and 3% and minimum time and space by 13% and 15% when compared with the two state-of-the-art methods.
基金This research was supported by the National Natural Science Foundation of China(No.42274141)Science Foundation of China University of Petroleum,Beijing(No.2462020YXZZ007).
文摘Dispersion and attenuation analysis can be used to determine formation anisotropy induced by fractures,or stresses.In this paper,we propose a nonparametric spectrum estimation method to get phase dispersion characteristics and attenuation coefficient.By designing an appropriate vector filter,phase velocity,attenuation coefficient and amplitude can be inverted from the waveform recorded by the receiver array.Performance analysis of this algorithm is compared with Extended Prony Method(EPM)and Forward and Backward Matrix Pencil(FBMP)method.Based on the analysis results,the proposed method is capable of achieving high resolution and precision as the parametric spectrum estimation methods.At the meantime,it also keeps high stability as the other nonparametric spectrum estimation methods.At last,applications to synthetic waveforms modeled using finite difference method and real data show its efficiency.The real data processing results show that the P-wave attenuation log is more sensitive to oil formation compared to S-wave;and the S-wave attenuation log is more sensitive to shale formation compared to P-wave.
文摘Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Knowing how to select an appropriate test can lead to more accurate results. Invalid results and misleading conclusions may be drawn from a study if an incorrect statistical test is used. Therefore, to avoid these it is essential to understand the nature of the data, the research question, and the assumptions of the tests before selecting one. This is because there are a wide variety of tests available. This paper provides a step-by-step approach to selecting the right statistical test for any study, with an explanation of when it is appropriate to use it and relevant examples of each statistical test. Furthermore, this guide provides a comprehensive overview of the assumptions of each test and what to do if these assumptions are violated.
文摘Choosing appropriate statistical tests is crucial but deciding which tests to use can be challenging. Different tests suit different types of data and research questions, so it is important to choose the right one. Knowing how to select an appropriate test can lead to more accurate results. Invalid results and misleading conclusions may be drawn from a study if an incorrect statistical test is used. Therefore, to avoid these it is essential to understand the nature of the data, the research question, and the assumptions of the tests before selecting one. This is because there are a wide variety of tests available. This paper provides a step-by-step approach to selecting the right statistical test for any study, with an explanation of when it is appropriate to use it and relevant examples of each statistical test. Furthermore, this guide provides a comprehensive overview of the assumptions of each test and what to do if these assumptions are violated.
基金Supported by the Science Development Foundation of HFUT(041002F)
文摘In this paper, we study the strong consistency for partitioning estimation of regression function under samples that axe φ-mixing sequences with identically distribution.Key words: nonparametric regression function; partitioning estimation; strong convergence;φ-mixing sequences.
文摘This paper studies evolutionary mechanism of parameter selection in the construction of weightfunction for Nearest Neighbour Estimate in nonparametrlc regresslon. Construct an algorithm which adagetively evolves fine weight and makes good prediction about unknown points. The numerical experiments indicate that this method is errective. It is a meaningful discussion about practicability of nonparametric regression and methodology or adaptive model-building.
文摘This poaper is devoted to the performance evaluation of the Generalized Sigu(GS). Trimmed Generalized Sign(TGS), Modified Savage(MS). Mann-Whitney(MW) and a new proposed detector in multiple target situations. The analysis is carried out for both fluctuating and nonfluctuating received signals. The simulation results show that the new proposed detector has the best detection performance in homogeneous as well as nonhomogeneous background conditions, while TGS procedure is better than the GS detector in distinguishing the primary target from the secondary interfering ones.
文摘Higher-order almost cyclostationary complex processes are complex random signals with almost periodically time-varying statistics, which is important to the research of non-Gaussian signals in information system. In tins paper, smoothed polyperiodograms are proposed for related to cyclic polyspectral estimation and are shown to be consistent and asymptotically complex normal. Asymptotic covariance expressions are derived along with their computable forms.
文摘Nonparametric method based on the mutual information is an efficient technique for the image segmentation. In this method, the image is divided into the internal and external labeled regions, and the segmentation problem constrained by the total length of the region boundaries will be changed into the maximization of the mutual information between the region labels and the image pixel intensities. The maximization problem can be solved by deriving the associated gradient flows and the curve evolutions. One of the advantages for this method does not need to choose the segmentation parameter;another is not sensitive to the noise. By employing the narrowband levelset and Fast Gauss Transformation, the computation time is reduced clearly and the algorithm efficiency is greatly improved.
文摘Atrial fibrillation (Afib) is related with heart failure, stroke, and high mortality rates. In frequency domain analysis, pre-requisite for Afib detection has been the estimation of reliable dominant frequency (DF) of atrial signals via different spectral estimation techniques. DF further characterizes Afib, and helps in its treatment. This paper aims at finding the most appropriate nonparametric FFT-based spectral estimation technique to estimate reliable DF for Afib detection. In this work, real-time intra-atrial electrograms have been acquired and pre-processed for frequency analysis. DF is estimated via Bartlett using Hanning window, and Welch methods. Regularity index (RI), a parameter to ensure reliability of DF, is calculated using Simpson 3/8 and Trapezoidal rules. The best method is declared based upon high accuracy of Afib detection using reliable DF. On comparison, Welch method is found to be more appropriate to estimate reliable DF for Afib detection with 98% accuracy.
文摘The composite quantile regression should provide estimation efficiency gain over a single quantile regression. In this paper, we extend composite quantile regression to nonparametric model with random censored data. The asymptotic normality of the proposed estimator is established. The proposed methods are applied to the lung cancer data. Extensive simulations are reported, showing that the proposed method works well in practical settings.
文摘A number of statistical tests are proposed for the purpose of change-point detection in a general nonparametric regression model under mild conditions. New proofs are given to prove the weak convergence of the underlying processes which assume remove the stringent condition of bounded total variation of the regression function and need only second moments. Since many quantities, such as the regression function, the distribution of the covariates and the distribution of the errors, are unspecified, the results are not distribution-free. A weighted bootstrap approach is proposed to approximate the limiting distributions. Results of a simulation study for this paper show good performance for moderate samples sizes.
基金Supported by the Research Teaching Model Curriculum of Anhui University(xjyjkc1407)Supported by the Students Innovative Training Project of Anhui University(201310357004,201410357117,201410357249)Supported by the Quality Improvement Projects for Undergraduate Education of Anhui University(ZLTS2015035)
文摘In this paper, by using some inequalities of negatively orthant dependent(NOD,in short) random variables and the truncated method of random variables, we investigate the nonparametric regression model. The complete consistency result for the estimator of g(x) is presented.
文摘Estimation of the bivariate survival function under the competing risks caseis considered.We give an explicit formula for the estimator from a decomposition of thebivariate survival function based on competing risks,which is almost sure consistent.
基金Supported by National Natural Science Foundation of China(11426032,11501005)Natural Science Foundation of Anhui Province(1408085QA02,1508085QA01,1508085J06)+5 种基金Provincial Natural Science Research Project of Anhui Colleges(KJ2014A010,KJ2014A020,KJ2015A065)Higher Education Talent Revitalization Project of Anhui Province(2013SQRL005ZD)Quality Engineering Project of Anhui Province(2015jyxm054,2015jyxm057)Students Science Research Training Program of Anhui University(KYXL2014016,KYXL2014013)Applied Teaching Model Curriculum of Anhui University(XJYYKC1401,ZLTS2015052,ZLTS2015053)Doctoral Research Start-up Funds Projects of Anhui University
文摘In this paper, we investigate the nonparametric regression model based on ρ-mixing errors, which are stochastically dominated by a nonnegative random variable. Weobtain the convergence rate for the weighted estimator of unknown function g(x) in pth-mean, which yields the convergence rate in probability. Moreover, an example of the nearestneighbor estimator is also illustrated and the convergence rates of estimator are presented.
文摘We consider the problem of estimating a function g in nonparametric regression model when only some of covariates are measured with errors with the assistance of validation data. Without specifying any error model structure between the surrogate and true covariables, we propose an estimator which integrates orthogonal series estimation and truncated series approximation method. Under general regularity conditions, we get the convergence rate of this estimator. Simulations demonstrate the finite-sample properties of the new estimator.
文摘In this article we study the estimation method of nonparametric regression measurement error model based on a validation data. The estimation procedures are based on orthogonal series estimation and truncated series approximation methods without specifying any structure equation and the distribution assumption. The convergence rates of the proposed estimator are derived. By example and through simulation, the method is robust against the misspecification of a measurement error model.
文摘In this article, we develop estimation approaches for nonparametric multiple regression measurement error models when both independent validation data on covariables and primary data on the response variable and surrogate covariables are available. An estimator which integrates Fourier series estimation and truncated series approximation methods is derived without any error model structure assumption between the true covariables and surrogate variables. Most importantly, our proposed methodology can be readily extended to the case that only some of covariates are measured with errors with the assistance of validation data. Under mild conditions, we derive the convergence rates of the proposed estimators. The finite-sample properties of the estimators are investigated through simulation studies.
文摘We propose a new nonparametric test based on the rank difference between the paired sample for testing the equality of the marginal distributions from a bivariate distribution. We also consider a modification of the novel nonparametric test based on the test proposed by Baumgartern, Weiβ, and Schindler (1998). An extensive numerical power comparison for various parametric and nonparametric tests was conducted under a wide range of bivariate distributions for small sample sizes. The two new nonparametric tests have comparable power to the paired t test for the data simulated from bivariate normal distributions, and are generally more powerful than the paired t test and other commonly used nonparametric tests in several important bivariate distributions.