期刊文献+

An Efficient Webpage Classification Algorithm Based on LSH

下载PDF
导出
摘要 With the explosive growth of Internet information, it is more and more important to fetch real-time and related information. And it puts forward higher requirement on the speed of webpage classification which is one of common methods to retrieve and manage information. To get a more efficient classifier, this paper proposes a webpage classification method based on locality sensitive hash function. In which, three innovative modules including building feature dictionary, mapping feature vectors to fingerprints using Localitysensitive hashing, and extending webpage features are contained. The compare results show that the proposed algorithm has better performance in lower time than the naive bayes one.
出处 《国际计算机前沿大会会议论文集》 2015年第1期73-75,共3页 International Conference of Pioneering Computer Scientists, Engineers and Educators(ICPCSEE)
  • 相关文献

参考文献2

二级参考文献15

  • 1贾泂,梁久祯.基于支持向量机的中文网页自动分类[J].计算机工程,2005,31(10):145-147. 被引量:12
  • 2苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:387
  • 3Cooper W.Some Inconsistencies and Misnomers in Probabilistic Information Retrieval[C]//Proc of the Int'l ACM SIGIR Conf on Research and Development in Information Retrieval,1991:57-61.
  • 4Lewis D D.Naive (Bayes) at Forty:The Independence Assumption in Information Retrieval[C]//Proc of the 10th European Conf on Machine Learning,1998:4-18.
  • 5Joachims T.Learning to Classify Text Using Support Vector Machines[M].Kluwer,2002.
  • 6Zipf G K.Human Behavior and the Principle of Least Effort:an Introduction to Human Ecology[M].MA:Addison-Wesley,1949.
  • 7Cortes C,Vapnik V N.Support-Vector Networks[J].Machine Learning Journal,1995,20(3):273-297.
  • 8Vapnik V N.Estimation of Dependences Based on Empirical Data[M].Springer,1982.
  • 9Liu T Y,Yang Y,Wan H,et al.Support Vector Machines Classification with a Very Large-Scale Taxonomy[J].SIGKDD Explor Newsl,2005,7(1):36-43.
  • 10Vapnik.统计学习理论[M].张学工,译.北京:电子工业出版社,2004.

共引文献27

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部