期刊文献+

基于内容评价的爬虫搜索策略研究 被引量:4

Research of Crawler's Search Strategy Based on Content Evaluation
下载PDF
导出
摘要 Internet上的不良信息日益增多成为危害严重的社会问题,对Internet进行监控成为一项迫切任务.而网络爬虫在信息搜索中起着明显的作用.为此,对链接价值的内容评价机制进行了研究,分析了影响链接价值的具体因素,并据此进行链接价值的计算以指导爬虫的搜索.实验结果表明,该方法有助于优先发现目标页面. Increase of bad information in Internet is a serious social problem, and it is an emergent task to monitor the Interact. The web crawler is important in information search. Therefore, the value estimate based on content was studied, then the factors which affect the value of link was discussed. Calculating of the value depends on the factors. The values of links redound to conducting the crawler's search. Experimental results show that this approach can find the target pages betimes.
出处 《微电子学与计算机》 CSCD 北大核心 2008年第11期25-28,共4页 Microelectronics & Computer
基金 国家自然科学基金项目(60673041)
关键词 信息安全 内容安全 内容评价 网络爬虫 information security content security content evaluation web crawler
  • 相关文献

参考文献5

二级参考文献34

  • 1于江德,樊孝忠,汪涛,顾益军.本体论在Web信息检索中的应用[J].微电子学与计算机,2006,23(4):160-161. 被引量:7
  • 2Heaton J 童兆丰 李纯 刘润杰 译.网络机器人java编程指南[M].北京:电子工业出版社,2001.211-237.
  • 3Venkat N Gudivada, Vijay V Raghavan. Information Retrieval on the World Wide Web. IEEE Internet Computing,1997, 1(5): 58-68.
  • 4S Lawrence, C L Giles. Accessibility of Information on the Web. Nature, 1999, 400(6470): 107-109.
  • 5Claypool M, Brown D, Le Phong, Waseda M. Inferring User Interest. IEEE Internet Computing, Nov/Dec 2001: 32-39.
  • 6Cho J, Garcia-Molina H, Page L. Efficient crawling through URL ordering [J]. Computer Networks, 1998, 30 (1-7): 161-172.
  • 7Chakrabarti S,van den Berg M,Dom B. Focused crawling: a new approach to topic-specific Web resource discovery [J].Computer Networks, 1999,31(11-16):1623-1640.
  • 8Rennie J,McCallum A. Using reinforcement learning to spider the Web efficiently[C]. In: Proc of the International Conference on Machine Learning(ICML 99),1999.
  • 9Aggarwal C, AI-Garawi F, Yu S P. Intelligent crawling on the World Wide Web with arbitrary Predicates[C]. In: Proc of the 10th International World Wide Web Conference,2001.
  • 10Menczer F. Complementing search engines with online Web mining agents[J]. Decision Support Systems,2003,35(2):195-212.

共引文献30

同被引文献30

引证文献4

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部