一种无冗余的Web日志挖掘算法

A New Web Log Mining Algorithm without Redundancy

下载PDF

导出

摘要 Web日志挖掘是Web数据挖掘的一个重要研究领域。Web日志挖掘通过发现Web日志中用户的访问规律和模式,可以提取出其中潜在的规律和信息,人们对这个领域的研究也日益重视。然而,传统的基于关联规则的Web日志挖掘算法都是基于所有关联规则的。这种方式往往挖掘产生大量的候选规则,而且存在大量冗余的规则。提出了一种新的无冗余的Web日志挖掘算法,该算法通过引入频繁闭项集合最小关联规则的概念,从而解决了以往基于所有关联规则挖掘算法中出现的上述问题。 Web log mining is one of the important research areas in Web data mining,through which the access law and mode of the user could be found. More attention is also paid to the Web log mining. However, the traditional Web log mining algorithm are all based on all of the association rules. And the traditional way often produces not only large amounts of the candidate frequent itemsets, but also lots of redundant rules. This paper puts forward a new Web log mining algorithm without redundancy,and the concept of frequent closed itemsets and minimal association rules are proposed to solve the problems appeared in mining Web log based on all frequent itemset association rules.

作者秦东霞姚遥

机构地区周口师范学院计算机科学与技术学院周口师范学院物理与电子工程系

出处《智能计算机与应用》 2012年第1期31-34,共4页 Intelligent Computer and Applications

基金周口师范学院青年科研基金资助项目(zknuqn201031A)

关键词 WEB日志挖掘闭频繁项集格结构最小关联规则 Web Log Mining Frequent Closed Itemsets Lattice Minimal Association Rules

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1周增国,庞有军.Cookie技术在Web日志挖掘预处理中的应用[J].大连大学学报,2006,27(2):59-62. 被引量：4
2朱玉全,宋余庆.频繁闭项目集挖掘算法研究[J].计算机研究与发展,2007,44(7):1177-1183. 被引量：10

二级参考文献20

1靳风荣,郑雪峰.Web日志挖掘的预处理过程及算法[J].微型电脑应用,2004,20(6):44-45. 被引量：5
2[1]COOLEY R,MOBASHER B,SRIVASTAVA J.Data Preparation for Mining World Wide Web Browing Patterns[J].Journal of Knowledge and Information System,1999.
3[2]WUKLYUPS,BALLMAN A.Speed Tracer:A Web Usage Mining and Analysis tool[J].IBM System Journal,1998(1):44-48.
4[3]PEI J,HAN J,MORTAZAVI-ASL B,et al.Mining Access Pattern efficiently from Web logs[C]//Proc.2000 Pacific-Asia Conf.on Knowledge Discovery and Data Mining,Japan,Kyoto,2000.
5[4]COOLEY R,MOBASHER B,SRIVASTAVA J.Web mining:Information and Pattern discovery on the World Wide Web[J].Proc.IEEE Intl.Conf.Tools with AI,Dec.1997,2(16):25-28.
6R Agrawal,R Srikant.Fast algorithm for mining association rules[C].The 20th Int'l Conf on VLDB,Santiago,Chile,1994.
7Liu Pei-Qi,Li Zeng-Zhi,Zhao Yin-Liang.Effective algorithm of mining frequent itemsets for association rules[C].In:Proc of the 3rd Int'l Conf on Machine Learning and Cybernetics.Piseataway,NJ:IEEE Press,2004.1447-1451.
8Chang Chin-Chen,Li Yu-Chiang,Lee Jung-San.An efficient algorithm for incremental mining of association rules[C].In:Proc of 15th Int'l Workshop on Research Issues in Data Engineering:Stream Data Mining and Applications.Piscataway.NJ:IEEE Press,2005.3-10.
9Xu Yong,Zhou Sen-Xin.Research on the distributed treatment of frequent itemsets extraction based on pruned concept lattices[C].In:Proc of the 5th Int'l Conf on Machine Learning and Cybernetics.Piscataway,NJ:IEEE Press,2006.1332-1336.
10Dao-I Lin,Z M Kedem.Pincer.Search:A new algorithm for discovering the maximum frequent set[C].In:H J Schek,F Saltor,I Ramos,et al.eds.Proc of the 6th European Conf on Extending Database Technology.Heidelberg:Springer,1998.105-119.

共引文献11

1朱玉全,吕晓,陈耿.频繁闭项目集更新算法[J].江苏大学学报（自然科学版）,2008,29(4):335-338.
2应维云,覃正,李秀.Web站点综合分析挖掘系统框架研究[J].情报杂志,2008,27(7):3-5.
3任永功,张亮,付玉,吕君义.基于FC-tree的频繁闭项目集挖掘算法[J].计算机科学,2008,35(9):149-152. 被引量：1
4任永功,张亮,付玉.一种基于频繁模式树的最大频繁项目集挖掘算法[J].小型微型计算机系统,2010,31(2):317-321. 被引量：6
5张炘,廖频,郭波.一种挖掘频繁闭项集的深度优先算法[J].计算机应用,2010,30(3):806-809. 被引量：2
6王璇.基于关联图的频繁闭模式挖掘[J].辽东学院学报（自然科学版）,2011,18(2):154-158. 被引量：2
7秦东霞,周航,张栋梁,吴文欢.基于频繁闭项集的Web日志挖掘算法[J].周口师范学院学报,2012,29(2):97-100.
8李爱国.基于Cookie的购物车设计与实现[J].信息技术,2013,37(6):60-62. 被引量：2
9王敬华,刘建银,张国燕,赵新想.情感语音合成中韵律参数的基频研究[J].小型微型计算机系统,2013,34(9):2047-2050. 被引量：2
10方刚,王佳乐,应宏,汤小斌.基于粒度计算的频繁闭项目集挖掘[J].计算机工程与应用,2014,50(20):130-134. 被引量：1

1秦东霞,周航,张栋梁,吴文欢.基于频繁闭项集的Web日志挖掘算法[J].周口师范学院学报,2012,29(2):97-100.
2孟军,王蓬,张静,王秀坤.基于项集依赖的最小关联规则挖掘[J].计算机科学,2013,40(1):183-186. 被引量：10
3赵妍,逄玉俊,文东丽.从样本数据中提取模糊规则的算法研究[J].石油化工高等学校学报,2004,17(3):83-88. 被引量：4
4胡晓欧.词性分析C4.5算法的候选属性规则优化[J].科技通报,2016,32(7):172-175.
5高中文,余飞.中国国家企业信息网决策支持系统设计[J].自动化技术与应用,2010,29(2):28-30.
6孙文乾.数据挖掘在提高web用户网络访问速度上的研究[J].计算机光盘软件与应用,2010(3):47-47.
7郭涵阳,高曼如,沈良忠.Moodle平台师生访问行为日志统计与挖掘研究[J].计算机技术与发展,2016,26(11):168-171. 被引量：7
8李云,蔡俊杰,刘宗田,陈崚,李拓.利用量化规则格分布获取关联规则(英文)[J].郑州大学学报（理学版）,2007,39(2):83-87.
9王金城,王晓琳,庞古风.关联规则挖掘算法及其在冷轧生产中的应用[J].清华大学学报（自然科学版）,2007,47(z2):1761-1765. 被引量：3
10杨昆,秦拯.一种报文二层预处理策略在高速NIDS上的应用[J].东莞理工学院学报,2009,16(3):67-72.

智能计算机与应用

2012年第1期

浏览历史

内容加载中请稍等...

一种无冗余的Web日志挖掘算法

参考文献2

二级参考文献20

共引文献11

相关作者

相关机构

相关主题

浏览历史