摘要
针对页面中的大量动态链接,提出了模拟浏览器的解析方式进行页面链接的提取,并设计实现了基于JaveScript等多链接分析的主题爬虫系统.
In this article, for the large amount of dynamic linking in the page, the analytical simulation of the browser has been proposed to carry out the extraction of page links. It also designes and implementes topic crawler based on Jave.
出处
《许昌学院学报》
CAS
2010年第2期87-90,共4页
Journal of Xuchang University
关键词
主题爬虫
链接分析
相关度
topic crawler
link analysis
correlation