摘要
Apriori算法是关联规则中挖掘频繁项目集的典型算法。在Apriori算法的基础上,利用关系数据库管理系统的强大功能和SQL语言操作简单,效率高的特点,提出了基于SQL的Apriori算法。该算法实现简单快速,可有效缩小扫描数据库的大小。将该算法应用于经过数据预处理的Web日志文件数据库,实验结果显示该算法是有效的。
Apriori algorithm is a typical algorithm of mining frequent itemsets for association mle. Based on the apriori algorithm, using powerful functions of relation database management system and simple operation and high efficiency of SQL, an apriori algorithm based on SQL is proposed. Implementation of the algorithm are simple and fast and the scan database is reduced effectively. The algorithm applies to web log file database which has been data-preprocessed, and the result indicates that the algorithm is effective.
出处
《计算机工程与设计》
CSCD
北大核心
2006年第23期4494-4497,共4页
Computer Engineering and Design
关键词
数据挖掘
关联规则
频繁项目集
SQL
数据预处理
Web日志文件
data mining
association role
frequent itemsets
structured query language
data prepr- ocessed
web log file