摘要
运行在服务器集群的软件系统需要Web日志的大规模数据集以满足性能测试的需求,但现有仿真生成算法因模型单一而无法满足要求。针对此问题,提出一种基于alpha稳态过程的多分形Web日志的仿真生成算法。首先,在长相关尺度(LRD)下采用alpha稳态过程来描述Web日志的自相似性;其次,在短相关尺度(RSD)下采用二项式b模型描述Web日志的多重分形性;最后,将长相关模型和短相关模型融合于改进的ON/OFF框架中。与单一的模型相比,新算法的参数物理意义明确,具有良好的自相似性和多分形性。实验结果表明,该算法能够较准确地模拟真实Web日志,可以有效地应用于Web日志大规模数据集的仿真生成。
The software system running on the server cluster needs large-scale data sets of Web log to meet the performance test requirement, but the existing simulation generation algorithm cannot meet the requirements due to the single model. Aiming at this problem, a new muhi-fractal Web log simulation generation algorithm based on alpha stable process was proposed. Firstly, the self-similarity of Web log was described by alpha stable process in Long Range Dependence ( LRD). Secondly, the multi-fractal of Web log was described by binomial-b model in Short Range Dependence (SRD). Finally, the model of long range dependence and the model of short range dependence were integrated into the improved ON/OFF framework. Compared with the single model, the parameters of the proposed algorithm has clear physical meaning equipped with good performance of self-similarity and multi-fractal. The experimental results show that the proposed algorithm can accurately simulate the real Web log and be effectively applied in Web log simulation generation with large-scale data sets.
作者
彭行雄
肖如良
PENG Xingxiong XIAO Ruliang(Faculty of Software, Fujian Normal University, Fuzhou Fujian 350117, China Fujian Provincial Engineering Research Center of Public Service Big Data Analysis and Application, Fuzhou Fujian 350117, China)
出处
《计算机应用》
CSCD
北大核心
2017年第2期587-592,共6页
journal of Computer Applications
基金
福建省高校产学合作项目(2016H6007)~~
关键词
稳态过程
多重分形
自相似
时间序列
日志分析
仿真生成
stable process
muhi-fractal
self-similarity
time series
log analysis
simulation generation