期刊文献+

基于改进朴素贝叶斯算法的垃圾邮件过滤器的研究 被引量:27

Implementing Spam Filter by Improving Naive Bayesian Algorithm
下载PDF
导出
摘要 基于朴素贝叶斯算法的垃圾邮件过滤器是目前比较高效、经济的垃圾邮件过滤技术之一,它已经广泛应用到垃圾邮件过滤领域。文章在对朴素贝叶斯过滤器分析的基础上,针对朴素贝叶斯算法的缺陷结合损失最小化的思想,并根据垃圾邮件的特性对朴素贝叶斯算法做了改进,提出了改进朴素贝叶斯算法,该算法能够通过调整k值,降低合法邮件被错判为垃圾邮件的概率,从而最大程度减少用户的损失。 Our aim is to decrease the probability under which the spam filter misjudges legal e-mail as spam by adjusting the k value of the naive Bayesian algorithm,thus minimizing Internet users' economic loss.Section 1 of the full paper analyzes the classification deficiencies of the naive Bayesian algorithm.Section 2 implements the spam filter by improving the naive Bayesian algorithm through obtaining the k value as shown in eq.(8).Section 3 tested the spam filter by adjusting the k value of our improved Bayesian algorithm;the test results,presented in Table 2,and their comparison,given in Figs.1,2 and 3,show preliminarily that the spam filter that uses our improved Bayesian algorithm can increase the recall rate by 10% and the accuracy by 5%,thus effectively decreasing the probability of misjudging legal e-mails as spams.
出处 《西北工业大学学报》 EI CAS CSCD 北大核心 2010年第4期622-627,共6页 Journal of Northwestern Polytechnical University
关键词 概率 朴素贝叶斯 垃圾邮件过滤器 algorithms probability naive Bayesian algorithm spam filter
  • 相关文献

参考文献6

  • 1Zhang H. Exploring Conditions for the Optimality of Naive Bayes. International Journal of Pattern Recognition and Artificial Intelligence, 2005, 19(2) : 183 - 198.
  • 2Vangelis Metsis,Ion Androutsopoulos, Georgios Paliouras. Spam Filtering with Naive Bayes Which Naive Bayes? CEAS 2006 Third Conference on Email and AntiSpam, 2006.
  • 3Mehran Sahami, Susan Dumais, David Heckerman, Eric Horvitz. A Bayesian Approach to Filtering Junk E-Mail. AAAI Workshop, Madison, Wisconsin. 1998:55 - 62.
  • 4Johan Hovold. Naive Bayes Spare Filtering Using Word-Position-Based Attributes. 2nd Conference on Email and Anti-Spare, Stanford, CA, 2005.
  • 5Zhang I E, Zhu Jingbao, Yao Tianshun. An Evaluation of Statistical Spare Filtering Techniques. ACM Trans on Asian Language Information Processing, 2004, 3 (4) : 243 - 269.
  • 6Aris Kosmopoulos, Georglos Paliouras, Ion Androutsopoulos. Adaptive Spare Filtering Using Only Naive Bayes Text Classifiers. CEAS 2008 Fifth Conference on Email and AntiSpam, 2008, Mountain View, California USA.

同被引文献219

引证文献27

二级引证文献79

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部