期刊文献+

基于可信推断的流数据序列模式分析算法

New Sequential Pattern Analysis Algorithm for Data Stream
下载PDF
导出
摘要 序列模式在基因分析、金融预测等方面有着重要的应用,是数据挖掘的一个主要分支.鉴于数据流应用的日益增多,本文在研究传统序列模式挖掘算法的基础上,提出了一种基于可扩展滑动窗口和贝叶斯概率过滤的面向数据流的序列模式挖掘算法(BM SP-DS算法),目的是简化序列模式发现的中间结果,提高挖掘效率,以便在小的存储空间和低的运算时间内快速发现流数据的频繁序列模式,同时算法也减少了因主观支持度取值不当对模式发现造成的负面影响.实验结果表明,该算法是可行、较优的. Although mining sequential pattern is becoming increasing essential m many scientific and commerciat domains, it is challenging to extend it to data stream. In this paper, we present a new efficient BMSP-DS algorithm of sequential patterns mining for data stream, which based on extendable sliding window and Bayesian probability filtration. This algorithm can reduce temp data in mining process by eliminating low probability sequence candidates, and quicken frequent sequential patterns mining in limited time and restricted space. Finally, the experiment result demonstrates the algorithm is effective.
作者 赵峰 李庆华
出处 《小型微型计算机系统》 CSCD 北大核心 2006年第7期1292-1295,共4页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(60273075)资助.
关键词 数据流 序列模式 滑动窗口 贝叶斯概率 data stream sequential patterns sliding window bayesian probability
  • 相关文献

参考文献8

  • 1Babcock B,Babu S,Datar M,et al.Models and issues in data stream systems[C].In:Proc.2002 ACM Symp.Principles of Database Systems(PODS'02),pages 1-16,Madison,WI,June 2002.
  • 2Chen Y,Dong G,Han J,et al.Multi-dimensional regression analysis of time-series data streams[C].In:Proceedings of the 28th International Conference on Very Large Data Bases,pages 323-334,August 2002.
  • 3Guha S,Koudas N.Approximating a data stream for querying and estimation:algorithms and performance evaluation.In:Proceedings of the 16th ICDE Conference,2002.
  • 4Datar M,Gionis A,Ndyk P,et al.Maintaining stream statistics over sliding window[C].In:ACM-SIAM Symposium on Discrete Algorithms(SODA),2002.
  • 5Lin Qiao,Agrawal D,El Abbadi A.Supporting sliding window queries for continuous data streams[C].In:Conference on Scientific and Statistical Database Management,2003,15th International,9-11 July 2003.
  • 6Agrawal R,Srikant R.Mining sequential patterns[C].In:Proc.1995 Int.Conf.Data Engineer(ICDE'95),3-14,Taipei,Taiwan,Mar.1995.
  • 7Wei-Guang Teng,Ming-Syan Chen,Philip S Yu.A regression-based temporal pattern mining scheme for data streams[C].In:Proceedings of the 29th VLDB Conference,2003.
  • 8Heckeman D.Bayesian network for data mining[J].Data Mining and Knowledge Discovery,1997,3(1):79-119

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部