In this paper,we consider the limit distribution of the error density function estima-tor in the rst-order autoregressive models with negatively associated and positively associated random errors.Under mild regularity...In this paper,we consider the limit distribution of the error density function estima-tor in the rst-order autoregressive models with negatively associated and positively associated random errors.Under mild regularity assumptions,some asymptotic normality results of the residual density estimator are obtained when the autoregressive models are stationary process and explosive process.In order to illustrate these results,some simulations such as con dence intervals and mean integrated square errors are provided in this paper.It shows that the residual density estimator can replace the density\estimator"which contains errors.展开更多
The Extended Exponentially Weighted Moving Average(extended EWMA)control chart is one of the control charts and can be used to quickly detect a small shift.The performance of control charts can be evaluated with the a...The Extended Exponentially Weighted Moving Average(extended EWMA)control chart is one of the control charts and can be used to quickly detect a small shift.The performance of control charts can be evaluated with the average run length(ARL).Due to the deriving explicit formulas for the ARL on a two-sided extended EWMA control chart for trend autoregressive or trend AR(p)model has not been reported previously.The aim of this study is to derive the explicit formulas for the ARL on a two-sided extended EWMA con-trol chart for the trend AR(p)model as well as the trend AR(1)and trend AR(2)models with exponential white noise.The analytical solution accuracy was obtained with the extended EWMA control chart and was compared to the numer-ical integral equation(NIE)method.The results show that the ARL obtained by the explicit formula and the NIE method is hardly different,but the explicit for-mula can help decrease the computational(CPU)time.Furthermore,this is also expanded to comparative performance with the Exponentially Weighted Moving Average(EWMA)control chart.The performance of the extended EWMA control chart is better than the EWMA control chart for all situations,both the trend AR(1)and trend AR(2)models.Finally,the analytical solution of ARL is applied to real-world data in the healthfield,such as COVID-19 data in the United Kingdom and Sweden,to demonstrate the efficacy of the proposed method.展开更多
Based on analyzing the limitations of the commonly used back-propagation neural network (BPNN), a wavelet neural network (WNN) is adopted as the nonlinear river channel flood forecasting method replacing the BPNN....Based on analyzing the limitations of the commonly used back-propagation neural network (BPNN), a wavelet neural network (WNN) is adopted as the nonlinear river channel flood forecasting method replacing the BPNN. The WNN has the characteristics of fast convergence and improved capability of nonlinear approximation. For the purpose of adapting the timevarying characteristics of flood routing, the WNN is coupled with an AR real-time correction model. The AR model is utilized to calculate the forecast error. The coefficients of the AR real-time correction model are dynamically updated by an adaptive fading factor recursive least square(RLS) method. The application of the flood forecasting method in the cross section of Xijiang River at Gaoyao shows its effectiveness.展开更多
Wavelets are applied to detection of the jump points of a regression function in nonlinear autoregressive model x(t) = T(x(t-1)) + epsilon t. By checking the empirical wavelet coefficients of the data,which have signi...Wavelets are applied to detection of the jump points of a regression function in nonlinear autoregressive model x(t) = T(x(t-1)) + epsilon t. By checking the empirical wavelet coefficients of the data,which have significantly large absolute values across fine scale levels, the number of the jump points and locations where the jumps occur are estimated. The jump heights are also estimated. All estimators are shown to be consistent. Wavelet method ia also applied to the threshold AR(1) model(TAR(1)). The simple estimators of the thresholds are given,which are shown to be consistent.展开更多
In this paper, we not only construct the confidence region for parameters in a mixed integer-valued autoregressive process using the empirical likelihood method, but also establish the empirical log-likelihood ratio s...In this paper, we not only construct the confidence region for parameters in a mixed integer-valued autoregressive process using the empirical likelihood method, but also establish the empirical log-likelihood ratio statistic and obtain its limiting distribution. And then, via simulation studies we give coverage probabilities for the parameters of interest. The results show that the empirical likelihood method performs very well.展开更多
Consider the model Yt = βYt-1+g(Yt-2)+εt for 3 〈 t 〈 T. Hereg is anunknown function, β is an unknown parameter, εt are i.i.d, random errors with mean 0 andvariance σ2 and the fourth moment α4, and α4 are ...Consider the model Yt = βYt-1+g(Yt-2)+εt for 3 〈 t 〈 T. Hereg is anunknown function, β is an unknown parameter, εt are i.i.d, random errors with mean 0 andvariance σ2 and the fourth moment α4, and α4 are independent of Y8 for all t ≥ 3 and s = 1, 2.Pseudo-LS estimators σ, σ2T α4τ and D2T of σ^2,α4 and Var(ε2↑3) are respectively constructedbased on piecewise polynomial approximator of g. The weak consistency of α4T and D2T are proved. The asymptotic normality of σ2T is given, i.e., √T(σ2T -σ^2)/DT converges indistribution to N(0, 1). The result can be used to establish large sample interval estimatesof σ^2 or to make large sample tests for σ^2.展开更多
The classical autoregressive(AR)model has been widely applied to predict future data usingmpast observations over five decades.As the classical AR model required m unknown parameters,this paper implements the AR model...The classical autoregressive(AR)model has been widely applied to predict future data usingmpast observations over five decades.As the classical AR model required m unknown parameters,this paper implements the AR model by reducing m parameters to two parameters to obtain a new model with an optimal delay called as the m-delay AR model.We derive the m-delay AR formula for approximating two unknown parameters based on the least squares method and develop an algorithm to determine optimal delay based on a brute-force technique.The performance of them-delay AR model was tested by comparing with the classical AR model.The results,obtained from Monte Carlo simulation using the monthly mean minimum temperature in PerthWestern Australia from the Bureau of Meteorology,are no significant difference compared to those obtained from the classical AR model.This confirms that the m-delay AR model is an effective model for time series analysis.展开更多
The use of historical data is important in making the predictions, for instance in the exchange rate. However, in the construction of a model, extreme data or dirtiness of data is inevitable. In this study, AR model i...The use of historical data is important in making the predictions, for instance in the exchange rate. However, in the construction of a model, extreme data or dirtiness of data is inevitable. In this study, AR model is used with the exchange rate historical data (January 2007 until December 2007) for USD/MYR and is divided into 1-, 3- and 6-horizontal months respectively. Since the presence of extreme data will affect the accuracy of the results obtained in a prediction. Therefore, to obtain a more accurate prediction results, the bootstrap approach was implemented by hybrid with AR model coins as the Bootstrap Autoregressive model (BAR). The effectiveness of the proposed model is investigated by comparing the existing and the proposed model through the statistical performance methods which are RMSE, MAE and MAD. The comparison involves 1%, 5% and 10% for each horizontal month. The results showed that the BAR model performed better than the AR model in terms of sensitivity to extreme data, the accuracy of forecasting models, efficiency and predictability of the model prediction. In conclusion, bootstrap method can alleviate the sensitivity of the model to the extreme data, thereby improving the accuracy of forecasting model which also have high prediction efficiency and that can increase the predictability of the model.展开更多
This paper proposes a new method for extracting ENF (electric network frequency) fluctuations from digital audio recordings for the purpose of forensic authentication. It is shown that the extraction of ENF componen...This paper proposes a new method for extracting ENF (electric network frequency) fluctuations from digital audio recordings for the purpose of forensic authentication. It is shown that the extraction of ENF components from audio recordings is realizable by applying a parametric approach based on an AR (autoregressive) model. The proposed method is compared to the existing STFT (short-time Fourier transform) based ENF extraction method. Experimental results from recorded electrical grid signals and recorded audio signals show that the proposed approach can improve the time resolution in the extracted ENF fluctuations and improve the detection of tampering with short alterations in longer audio recordings.展开更多
The identification of the inter-electrode gap size in the high frequency group pulse micro-electrochemical machining (HGPECM) is mainly discussed. The auto-regressive(AR) model of group pulse current flowing acros...The identification of the inter-electrode gap size in the high frequency group pulse micro-electrochemical machining (HGPECM) is mainly discussed. The auto-regressive(AR) model of group pulse current flowing across the cathode and the anode are created under different situations with different processing parameters and inter-electrode gap size. The AR model based on the current signals indicates that the order of the AR model is obviously different relating to the different processing conditions and the inter-electrode gap size; Moreover, it is different about the stability of the dynamic system, i.e. the white noise response of the Green's function of the dynamic system is diverse. In addition, power spectrum method is used in the analysis of the dynamic time series about the current signals with different inter-electrode gap size, the results show that there exists a strongest power spectrum peak, characteristic power spectrum(CPS), to the current signals related to the different inter-electrode gap size in the range of 0~5 kHz. Therefore, the CPS of current signals can implement the identification of the inter-electrode gap.展开更多
Since OpenAI opened access to ChatGPT,large language models(LLMs)become an increasingly popular topic attracting researchers’attention from abundant domains.However,public researchers meet some problems when developi...Since OpenAI opened access to ChatGPT,large language models(LLMs)become an increasingly popular topic attracting researchers’attention from abundant domains.However,public researchers meet some problems when developing LLMs given that most of the LLMs are produced by industries and the training details are typically unrevealed.Since datasets are an important setup of LLMs,this paper does a holistic survey on the training datasets used in both the pre-train and fine-tune processes.The paper first summarizes 16 pre-train datasets and 16 fine-tune datasets used in the state-of-the-art LLMs.Secondly,based on the properties of the pre-train and fine-tune processes,it comments on pre-train datasets from quality,quantity,and relation with models,and comments on fine-tune datasets from quality,quantity,and concerns.This study then critically figures out the problems and research trends that exist in current LLM datasets.The study helps public researchers train and investigate LLMs by visual cases and provides useful comments to the research community regarding data development.To the best of our knowledge,this paper is the first to summarize and discuss datasets used in both autoregressive and chat LLMs.The survey offers insights and suggestions to researchers and LLM developers as they build their models,and contributes to the LLM study by pointing out the existing problems of LLM studies from the perspective of data.展开更多
This paper considers the random coefficient autoregressive model with time-functional variance noises,hereafter the RCA-TFV model.We first establish the consistency and asymptotic normality of the conditional least sq...This paper considers the random coefficient autoregressive model with time-functional variance noises,hereafter the RCA-TFV model.We first establish the consistency and asymptotic normality of the conditional least squares estimator for the constant coefficient.The semiparametric least squares estimator for the variance of the random coefficient and the nonparametric estimator for the variance function are constructed,and their asymptotic results are reported.A simulation study is presented along with an analysis of real data to assess the performance of our method in finite samples.展开更多
In many application fields of regression analysis,prior information about how explanatory variables affect response variable of interest is often available and can be formulated as constraints on regression coefficien...In many application fields of regression analysis,prior information about how explanatory variables affect response variable of interest is often available and can be formulated as constraints on regression coefficients.In this paper,the authors consider statistical inference of partially linear spatial autoregressive model under constraint conditions.By combining series approximation method,twostage least squares method and Lagrange multiplier method,the authors obtain constrained estimators of the parameters and function in the partially linear spatial autoregressive model and investigate their asymptotic properties.Furthermore,the authors propose a testing method to check whether the parameters in the parametric component of the partially linear spatial autoregressive model satisfy linear constraint conditions,and derive asymptotic distributions of the resulting test statistic under both null and alternative hypotheses.Simulation results show that the proposed constrained estimators have better finite sample performance than the unconstrained estimators and the proposed testing method performs well in finite samples.Furthermore,a real example is provided to illustrate the application of the proposed estimation and testing methods.展开更多
We propose a novel polynomial network autoregressive model by incorporating higher-order connected relationships to simultaneously model the effects of both direct and indirect connections. A quasimaximum likelihood e...We propose a novel polynomial network autoregressive model by incorporating higher-order connected relationships to simultaneously model the effects of both direct and indirect connections. A quasimaximum likelihood estimation method is proposed to estimate the unknown influence parameters, and we demonstrate its consistency and asymptotic normality without imposing any distribution assumption. Moreover,an extended Bayesian information criterion is set for order selection with a divergent upper order. The application of the proposed polynomial network autoregressive model is demonstrated through both the simulation and the real data analysis.展开更多
We discuss formulas and techniques for finding maximum-likelihood estimators of parameters of autoregressive (with particular emphasis on Markov and Yule) models, computing their asymptotic variance-covariance matrix ...We discuss formulas and techniques for finding maximum-likelihood estimators of parameters of autoregressive (with particular emphasis on Markov and Yule) models, computing their asymptotic variance-covariance matrix and displaying the resulting confidence regions;Monte Carlo simulation is then used to establish the accuracy of the corresponding level of confidence. The results indicate that a direct application of the Central Limit Theorem yields errors too large to be acceptable;instead, we recommend using a technique based directly on the natural logarithm of the likelihood function, verifying its substantially higher accuracy. Our study is then extended to the case of estimating only a subset of a model’s parameters, when the remaining ones (called nuisance) are of no interest to us.展开更多
The spatial and spatiotemporal autoregressive conditional heteroscedasticity(STARCH) models receive increasing attention. In this paper, we introduce a spatiotemporal autoregressive(STAR) model with STARCH errors, whi...The spatial and spatiotemporal autoregressive conditional heteroscedasticity(STARCH) models receive increasing attention. In this paper, we introduce a spatiotemporal autoregressive(STAR) model with STARCH errors, which can capture the spatiotemporal dependence in mean and variance simultaneously. The Bayesian estimation and model selection are considered for our model. By Monte Carlo simulations, it is shown that the Bayesian estimator performs better than the corresponding maximum-likelihood estimator, and the Bayesian model selection can select out the true model in most times. Finally, two empirical examples are given to illustrate the superiority of our models in fitting those data.展开更多
The stock market is a vital component of the broader financial system,with its dynamics closely linked to economic growth.The challenges associated with analyzing and forecasting stock prices have persisted since the ...The stock market is a vital component of the broader financial system,with its dynamics closely linked to economic growth.The challenges associated with analyzing and forecasting stock prices have persisted since the inception of financial markets.By examining historical transaction data,latent opportunities for profit can be uncovered,providing valuable insights for both institutional and individual investors to make more informed decisions.This study focuses on analyzing historical transaction data from four banks to predict closing price trends.Various models,including decision trees,random forests,and Long Short-Term Memory(LSTM)networks,are employed to forecast stock price movements.Historical stock transaction data serves as the input for training these models,which are then used to predict upward or downward stock price trends.The study’s empirical results indicate that these methods are effective to a degree in predicting stock price movements.The LSTM-based deep neural network model,in particular,demonstrates a commendable level of predictive accuracy.This conclusion is reached following a thorough evaluation of model performance,highlighting the potential of LSTM models in stock market forecasting.The findings offer significant implications for advancing financial forecasting approaches,thereby improving the decision-making capabilities of investors and financial institutions.展开更多
This paper is devoted to the goodness-of-fit test for the general autoregressive models in time series. By averaging for the weighted residuals, we construct a score type test which is asymptotically standard chi-squa...This paper is devoted to the goodness-of-fit test for the general autoregressive models in time series. By averaging for the weighted residuals, we construct a score type test which is asymptotically standard chi-squared under the null and has some desirable power properties under the alternatives. Specifically, the test is sensitive to alternatives and can detect the alternatives approaching, along a direction, the null at a rate that is arbitrarily close to n-1/2. Furthermore, when the alternatives are not directional, we construct asymptotically distribution-free maximin tests for a large class of alternatives. The performance of the tests is evaluated through simulation studies.展开更多
This paper introduces a new model-based soft decoding techniqt, e to restore the widely used joint photographic expert group (JPEG) streams. The image is modeled as a two dimensional (2D) piecewise stationary auto...This paper introduces a new model-based soft decoding techniqt, e to restore the widely used joint photographic expert group (JPEG) streams. The image is modeled as a two dimensional (2D) piecewise stationary autoregressive process, and the decoding task is formulated as a constrained optimization problem. All the constraints are given by the quantization intervals which available at the decoder freely. The autoregressive model serves as an important regularization term of the objective function of the optimization, and the model parameters are solved on the decoded image locally using a weighted total least square method. In addition, a novel bilateral dualside weighting scheme is proposed to minimize the influence of the blocking artifact on the accuracy of parameter estimation. Extensive experimental results suggest that the proposed algorithm systematically improves the quality of JPEG images and also outperforms existing JPEG postprocessing algorithms in a wide bit-rate range both in terms of peak signal-to-noise ratio (PSNR) and subjective quality展开更多
Detection and clarification of cause-effect relationships among variables is an important problem in time series analysis. Traditional causality inference methods have a salient limitation that the model must be linea...Detection and clarification of cause-effect relationships among variables is an important problem in time series analysis. Traditional causality inference methods have a salient limitation that the model must be linear and with Gaussian noise. Although additive model regression can effectively infer the nonlinear causal relationships of additive nonlinear time series, it suffers from the limitation that contemporaneous causal relationships of variables must be linear and not always valid to test conditional independence relations. This paper provides a nonparametric method that employs both mutual information and conditional mutual information to identify causal structure of a class of nonlinear time series models, which extends the additive nonlinear times series to nonlinear structural vector autoregressive models. An algorithm is developed to learn the contemporaneous and the lagged causal relationships of variables. Simulations demonstrate the effectiveness of the nroosed method.展开更多
基金supported by the National Natural Science Foundation of China(12131015,12071422)。
文摘In this paper,we consider the limit distribution of the error density function estima-tor in the rst-order autoregressive models with negatively associated and positively associated random errors.Under mild regularity assumptions,some asymptotic normality results of the residual density estimator are obtained when the autoregressive models are stationary process and explosive process.In order to illustrate these results,some simulations such as con dence intervals and mean integrated square errors are provided in this paper.It shows that the residual density estimator can replace the density\estimator"which contains errors.
基金Thailand Science ResearchInnovation Fund,and King Mongkut's University of Technology North Bangkok Contract No.KMUTNB-FF-65-45.
文摘The Extended Exponentially Weighted Moving Average(extended EWMA)control chart is one of the control charts and can be used to quickly detect a small shift.The performance of control charts can be evaluated with the average run length(ARL).Due to the deriving explicit formulas for the ARL on a two-sided extended EWMA control chart for trend autoregressive or trend AR(p)model has not been reported previously.The aim of this study is to derive the explicit formulas for the ARL on a two-sided extended EWMA con-trol chart for the trend AR(p)model as well as the trend AR(1)and trend AR(2)models with exponential white noise.The analytical solution accuracy was obtained with the extended EWMA control chart and was compared to the numer-ical integral equation(NIE)method.The results show that the ARL obtained by the explicit formula and the NIE method is hardly different,but the explicit for-mula can help decrease the computational(CPU)time.Furthermore,this is also expanded to comparative performance with the Exponentially Weighted Moving Average(EWMA)control chart.The performance of the extended EWMA control chart is better than the EWMA control chart for all situations,both the trend AR(1)and trend AR(2)models.Finally,the analytical solution of ARL is applied to real-world data in the healthfield,such as COVID-19 data in the United Kingdom and Sweden,to demonstrate the efficacy of the proposed method.
基金The National Natural Science Foundation of China(No.50479017).
文摘Based on analyzing the limitations of the commonly used back-propagation neural network (BPNN), a wavelet neural network (WNN) is adopted as the nonlinear river channel flood forecasting method replacing the BPNN. The WNN has the characteristics of fast convergence and improved capability of nonlinear approximation. For the purpose of adapting the timevarying characteristics of flood routing, the WNN is coupled with an AR real-time correction model. The AR model is utilized to calculate the forecast error. The coefficients of the AR real-time correction model are dynamically updated by an adaptive fading factor recursive least square(RLS) method. The application of the flood forecasting method in the cross section of Xijiang River at Gaoyao shows its effectiveness.
文摘Wavelets are applied to detection of the jump points of a regression function in nonlinear autoregressive model x(t) = T(x(t-1)) + epsilon t. By checking the empirical wavelet coefficients of the data,which have significantly large absolute values across fine scale levels, the number of the jump points and locations where the jumps occur are estimated. The jump heights are also estimated. All estimators are shown to be consistent. Wavelet method ia also applied to the threshold AR(1) model(TAR(1)). The simple estimators of the thresholds are given,which are shown to be consistent.
基金Supported by National Natural Science Foundation of China(11731015,11571051,J1310022,11501241)Natural Science Foundation of Jilin Province(20150520053JH,20170101057JC,20180101216JC)+2 种基金Program for Changbaishan Scholars of Jilin Province(2015010)Science and Technology Program of Jilin Educational Department during the "13th Five-Year" Plan Period(2016-399)Science and Technology Research Program of Education Department in Jilin Province for the 13th Five-Year Plan(2016213)
文摘In this paper, we not only construct the confidence region for parameters in a mixed integer-valued autoregressive process using the empirical likelihood method, but also establish the empirical log-likelihood ratio statistic and obtain its limiting distribution. And then, via simulation studies we give coverage probabilities for the parameters of interest. The results show that the empirical likelihood method performs very well.
基金Supported by the National Natural Science Foundation of China(60375003) Supported by the Chinese Aviation Foundation(03153059)
文摘Consider the model Yt = βYt-1+g(Yt-2)+εt for 3 〈 t 〈 T. Hereg is anunknown function, β is an unknown parameter, εt are i.i.d, random errors with mean 0 andvariance σ2 and the fourth moment α4, and α4 are independent of Y8 for all t ≥ 3 and s = 1, 2.Pseudo-LS estimators σ, σ2T α4τ and D2T of σ^2,α4 and Var(ε2↑3) are respectively constructedbased on piecewise polynomial approximator of g. The weak consistency of α4T and D2T are proved. The asymptotic normality of σ2T is given, i.e., √T(σ2T -σ^2)/DT converges indistribution to N(0, 1). The result can be used to establish large sample interval estimatesof σ^2 or to make large sample tests for σ^2.
文摘The classical autoregressive(AR)model has been widely applied to predict future data usingmpast observations over five decades.As the classical AR model required m unknown parameters,this paper implements the AR model by reducing m parameters to two parameters to obtain a new model with an optimal delay called as the m-delay AR model.We derive the m-delay AR formula for approximating two unknown parameters based on the least squares method and develop an algorithm to determine optimal delay based on a brute-force technique.The performance of them-delay AR model was tested by comparing with the classical AR model.The results,obtained from Monte Carlo simulation using the monthly mean minimum temperature in PerthWestern Australia from the Bureau of Meteorology,are no significant difference compared to those obtained from the classical AR model.This confirms that the m-delay AR model is an effective model for time series analysis.
文摘The use of historical data is important in making the predictions, for instance in the exchange rate. However, in the construction of a model, extreme data or dirtiness of data is inevitable. In this study, AR model is used with the exchange rate historical data (January 2007 until December 2007) for USD/MYR and is divided into 1-, 3- and 6-horizontal months respectively. Since the presence of extreme data will affect the accuracy of the results obtained in a prediction. Therefore, to obtain a more accurate prediction results, the bootstrap approach was implemented by hybrid with AR model coins as the Bootstrap Autoregressive model (BAR). The effectiveness of the proposed model is investigated by comparing the existing and the proposed model through the statistical performance methods which are RMSE, MAE and MAD. The comparison involves 1%, 5% and 10% for each horizontal month. The results showed that the BAR model performed better than the AR model in terms of sensitivity to extreme data, the accuracy of forecasting models, efficiency and predictability of the model prediction. In conclusion, bootstrap method can alleviate the sensitivity of the model to the extreme data, thereby improving the accuracy of forecasting model which also have high prediction efficiency and that can increase the predictability of the model.
文摘This paper proposes a new method for extracting ENF (electric network frequency) fluctuations from digital audio recordings for the purpose of forensic authentication. It is shown that the extraction of ENF components from audio recordings is realizable by applying a parametric approach based on an AR (autoregressive) model. The proposed method is compared to the existing STFT (short-time Fourier transform) based ENF extraction method. Experimental results from recorded electrical grid signals and recorded audio signals show that the proposed approach can improve the time resolution in the extracted ENF fluctuations and improve the detection of tampering with short alterations in longer audio recordings.
基金This project is supported by the 10th Five-year Plan Pre-research Project Foundation of China Weapon Industry Company, China(No.42001080701).
文摘The identification of the inter-electrode gap size in the high frequency group pulse micro-electrochemical machining (HGPECM) is mainly discussed. The auto-regressive(AR) model of group pulse current flowing across the cathode and the anode are created under different situations with different processing parameters and inter-electrode gap size. The AR model based on the current signals indicates that the order of the AR model is obviously different relating to the different processing conditions and the inter-electrode gap size; Moreover, it is different about the stability of the dynamic system, i.e. the white noise response of the Green's function of the dynamic system is diverse. In addition, power spectrum method is used in the analysis of the dynamic time series about the current signals with different inter-electrode gap size, the results show that there exists a strongest power spectrum peak, characteristic power spectrum(CPS), to the current signals related to the different inter-electrode gap size in the range of 0~5 kHz. Therefore, the CPS of current signals can implement the identification of the inter-electrode gap.
文摘Since OpenAI opened access to ChatGPT,large language models(LLMs)become an increasingly popular topic attracting researchers’attention from abundant domains.However,public researchers meet some problems when developing LLMs given that most of the LLMs are produced by industries and the training details are typically unrevealed.Since datasets are an important setup of LLMs,this paper does a holistic survey on the training datasets used in both the pre-train and fine-tune processes.The paper first summarizes 16 pre-train datasets and 16 fine-tune datasets used in the state-of-the-art LLMs.Secondly,based on the properties of the pre-train and fine-tune processes,it comments on pre-train datasets from quality,quantity,and relation with models,and comments on fine-tune datasets from quality,quantity,and concerns.This study then critically figures out the problems and research trends that exist in current LLM datasets.The study helps public researchers train and investigate LLMs by visual cases and provides useful comments to the research community regarding data development.To the best of our knowledge,this paper is the first to summarize and discuss datasets used in both autoregressive and chat LLMs.The survey offers insights and suggestions to researchers and LLM developers as they build their models,and contributes to the LLM study by pointing out the existing problems of LLM studies from the perspective of data.
基金supported by the National Natural Science Foundation of China(Grant No.52338009)the National Science Fund for Distinguished Young Scholars(Grant No.52025085)+4 种基金the Graduate Research Innovation Project of Hunan Province(Grant No.CX20220952)Xiaohui Liu’s research is supported by the NSF of China(Grant No.11971208)the National Social Science Foundation of China(Grant No.21&ZD152)the Outstanding Youth Fund Project of the Science and Technology Department of Jiangxi Province(Grant No.20224ACB211003)the NSF of China(Grant No.92358303).
文摘This paper considers the random coefficient autoregressive model with time-functional variance noises,hereafter the RCA-TFV model.We first establish the consistency and asymptotic normality of the conditional least squares estimator for the constant coefficient.The semiparametric least squares estimator for the variance of the random coefficient and the nonparametric estimator for the variance function are constructed,and their asymptotic results are reported.A simulation study is presented along with an analysis of real data to assess the performance of our method in finite samples.
基金supported by the Natural Science Foundation of Shaanxi Province under Grant No.2021JM349the Natural Science Foundation of China under Grant Nos.11972273 and 52170172。
文摘In many application fields of regression analysis,prior information about how explanatory variables affect response variable of interest is often available and can be formulated as constraints on regression coefficients.In this paper,the authors consider statistical inference of partially linear spatial autoregressive model under constraint conditions.By combining series approximation method,twostage least squares method and Lagrange multiplier method,the authors obtain constrained estimators of the parameters and function in the partially linear spatial autoregressive model and investigate their asymptotic properties.Furthermore,the authors propose a testing method to check whether the parameters in the parametric component of the partially linear spatial autoregressive model satisfy linear constraint conditions,and derive asymptotic distributions of the resulting test statistic under both null and alternative hypotheses.Simulation results show that the proposed constrained estimators have better finite sample performance than the unconstrained estimators and the proposed testing method performs well in finite samples.Furthermore,a real example is provided to illustrate the application of the proposed estimation and testing methods.
基金supported by the Fundamental Research Funds for the Central Universities(Grant No.JBK2207075)The second author was supported by National Natural Science Foundation of China(Grant Nos.71991472,12171395,11931014 and 71532001)+1 种基金the Joint Lab of Data Science and Business Intelligence at Southwestern University of Finance and Economics and the Fundamental Research Funds for the Central Universities(Grant No.JBK1806002)The fourth author was supported by the Humanity and Social Science Youth Foundation of Ministry of Education of China(Grant No.19YJC790204)。
文摘We propose a novel polynomial network autoregressive model by incorporating higher-order connected relationships to simultaneously model the effects of both direct and indirect connections. A quasimaximum likelihood estimation method is proposed to estimate the unknown influence parameters, and we demonstrate its consistency and asymptotic normality without imposing any distribution assumption. Moreover,an extended Bayesian information criterion is set for order selection with a divergent upper order. The application of the proposed polynomial network autoregressive model is demonstrated through both the simulation and the real data analysis.
文摘We discuss formulas and techniques for finding maximum-likelihood estimators of parameters of autoregressive (with particular emphasis on Markov and Yule) models, computing their asymptotic variance-covariance matrix and displaying the resulting confidence regions;Monte Carlo simulation is then used to establish the accuracy of the corresponding level of confidence. The results indicate that a direct application of the Central Limit Theorem yields errors too large to be acceptable;instead, we recommend using a technique based directly on the natural logarithm of the likelihood function, verifying its substantially higher accuracy. Our study is then extended to the case of estimating only a subset of a model’s parameters, when the remaining ones (called nuisance) are of no interest to us.
基金supported by National Natural Science Foundation of China (No.12271206)Natural Science Foundation of Jilin Province (No.20210101143JC)Science and Technology Research Planning Project of Jilin Provincial Department of Education (No.JJKH20231122KJ)。
文摘The spatial and spatiotemporal autoregressive conditional heteroscedasticity(STARCH) models receive increasing attention. In this paper, we introduce a spatiotemporal autoregressive(STAR) model with STARCH errors, which can capture the spatiotemporal dependence in mean and variance simultaneously. The Bayesian estimation and model selection are considered for our model. By Monte Carlo simulations, it is shown that the Bayesian estimator performs better than the corresponding maximum-likelihood estimator, and the Bayesian model selection can select out the true model in most times. Finally, two empirical examples are given to illustrate the superiority of our models in fitting those data.
文摘The stock market is a vital component of the broader financial system,with its dynamics closely linked to economic growth.The challenges associated with analyzing and forecasting stock prices have persisted since the inception of financial markets.By examining historical transaction data,latent opportunities for profit can be uncovered,providing valuable insights for both institutional and individual investors to make more informed decisions.This study focuses on analyzing historical transaction data from four banks to predict closing price trends.Various models,including decision trees,random forests,and Long Short-Term Memory(LSTM)networks,are employed to forecast stock price movements.Historical stock transaction data serves as the input for training these models,which are then used to predict upward or downward stock price trends.The study’s empirical results indicate that these methods are effective to a degree in predicting stock price movements.The LSTM-based deep neural network model,in particular,demonstrates a commendable level of predictive accuracy.This conclusion is reached following a thorough evaluation of model performance,highlighting the potential of LSTM models in stock market forecasting.The findings offer significant implications for advancing financial forecasting approaches,thereby improving the decision-making capabilities of investors and financial institutions.
基金grant from the Research Grants Council of Hong Kong
文摘This paper is devoted to the goodness-of-fit test for the general autoregressive models in time series. By averaging for the weighted residuals, we construct a score type test which is asymptotically standard chi-squared under the null and has some desirable power properties under the alternatives. Specifically, the test is sensitive to alternatives and can detect the alternatives approaching, along a direction, the null at a rate that is arbitrarily close to n-1/2. Furthermore, when the alternatives are not directional, we construct asymptotically distribution-free maximin tests for a large class of alternatives. The performance of the tests is evaluated through simulation studies.
基金supported by the National Natural Science Foundation of China(61033004,61070138,61072104,61003148)
文摘This paper introduces a new model-based soft decoding techniqt, e to restore the widely used joint photographic expert group (JPEG) streams. The image is modeled as a two dimensional (2D) piecewise stationary autoregressive process, and the decoding task is formulated as a constrained optimization problem. All the constraints are given by the quantization intervals which available at the decoder freely. The autoregressive model serves as an important regularization term of the objective function of the optimization, and the model parameters are solved on the decoded image locally using a weighted total least square method. In addition, a novel bilateral dualside weighting scheme is proposed to minimize the influence of the blocking artifact on the accuracy of parameter estimation. Extensive experimental results suggest that the proposed algorithm systematically improves the quality of JPEG images and also outperforms existing JPEG postprocessing algorithms in a wide bit-rate range both in terms of peak signal-to-noise ratio (PSNR) and subjective quality
基金supported by the National Natural Science Foundation of China under Grant Nos.60972150 and 10926197
文摘Detection and clarification of cause-effect relationships among variables is an important problem in time series analysis. Traditional causality inference methods have a salient limitation that the model must be linear and with Gaussian noise. Although additive model regression can effectively infer the nonlinear causal relationships of additive nonlinear time series, it suffers from the limitation that contemporaneous causal relationships of variables must be linear and not always valid to test conditional independence relations. This paper provides a nonparametric method that employs both mutual information and conditional mutual information to identify causal structure of a class of nonlinear time series models, which extends the additive nonlinear times series to nonlinear structural vector autoregressive models. An algorithm is developed to learn the contemporaneous and the lagged causal relationships of variables. Simulations demonstrate the effectiveness of the nroosed method.