期刊文献+

Web搜索中的数据挖掘技术研究 被引量:4

A Research of Data Mining Technologies on Web Search
下载PDF
导出
摘要 WWW已经成为世界上是大的分布式信息系统,如何快速有效地搜索用户所需的资源一直是研究热点。Web挖掘也已经成为数据挖掘中相对成熟的一个分支。本文针对Web资源搜索中利用的相关Web挖掘技术做一个综述。文章首先对目前流行的Web内容挖掘方面的常用技术进行了研究分析,然后着重研究了Web结构挖掘技术,介绍并评价了多种算法模型。接着介绍了用户使用的挖掘,并提出了Web内容挖掘技术,结构挖掘技术和用户使用挖掘相结合,应用于开发智能型搜索引擎的趋势。 WWW is now the largest distributed information system in the world, and how to find useful information is always a hot topic for researchers. Web mining has become an important branch of data mining. This paper mainly discusses mining technologies used in Web searching. The paper begins with talking about popular technologies in Web content mining, and then focuses on algorithms and models on Web structure mining. Then Web usage mining is briefly discussed. In the end the author advances that the technologies in Web content mining, Web structure mining and Web usage mining will be combined to develop intelligent search engines.
出处 《计算机科学》 CSCD 北大核心 2005年第4期37-41,共5页 Computer Science
  • 相关文献

参考文献22

  • 1Arasu A, Novak J, Tomkins A, Tomlin J. PageRank Computation and the Structure of the Web: Experiments and Algorithms.In: 11th Intl. World Wide WEB Conf. 2002
  • 2Bharat B, Henzinger M R. Improved algorithms for topic distillation in a hyperlinked environment. In: ACM Conf. on Research and Develop. in Info. Retrieval(SIGIR'98), 1998
  • 3Bollen J, Heylighen F. A system to restructure hypertext networks into valid user models. In:The New Review of Hypermedia and Multimedia, 1998
  • 4Brin S, Page L. The anatomy of a large-scale hypertextual web search engine. In:Proc. of the 7th World-Wide Web Conf.(WWW7), 1998
  • 5Chakrabarti S. Data mining for hypertext: A tutorial survey.ACM SIGKDD, Jan 2000,1 (2)
  • 6Chakrabarti S, Dom B, Kumar S R, et al. Mning the Web's Link Structure. In 1999 IEEE, 1999. 60-67
  • 7Chakrabarti S, Dom B, Gibson D, et al. Mining the Link Structure of the World Wide Web. IEEE Computer, 1999
  • 8Chakrabarti S, Dom B, Gibson D, et al. Automatic resource compilation by analyzing hyperlink structure and associated text.Computer Network and ISDN Systems, 1998
  • 9Chekuri C, Goldwasser M H, Raghavan P, Upfal E. Web Search Using Automatic Classification. In:Proc. of WWW-96, 6th Intl.Conf. on the World Wide Web, 1996
  • 10Chen Z, Lin F, Liu H, et al. User Intention Modeling in Web Applications Using Data Mining. In:Internet and Web Information Systems, 2002.181~191

二级参考文献22

  • 1Page L, Brin S, Motwani R, Winograd T. The PageRank Citation Ranking : Bringing Order to the WEB. Jan 1998 and July 2001 at http://www. db. stanford. edu/-backub/PageRanksub. ps.
  • 2Brin S,Page L. The anatomy of a large-scale hypertextual WEB search engine, In: Proc of the Seventh Intl World Wide WEB Conf. 1998.
  • 3Richardson M,Domingos P. The Intelligent Surfer: Probabilistic Combination of Link and Content Information in PageRank, volume 14. MIT Press, Cambridge, MA, 2002.
  • 4Haveliwala T H. Topic-Sensitive PageRank. In:Proc of the Eleventh Intl World Wide WEB Conf. 2002.
  • 5Kleinberg J. Authoritative sources in a hyperlinked environmerit. In.. Proc 9th ACM-SIAM Symposium on Discrete Algorithms, 1998. Extended version in Journal of the ACM 46(1999). Also appears as IBM Research Report RJ 10076, May 1997.
  • 6Chakrabarti S,et al. Hypersearching the WEB. Scientific American. June 1999.
  • 7Henzinger M R,Bharat K. Improved algorithms for topic distillation in a hyperlinked environment. In:Proc of the 21'st Intl ACMSIGIR Conf on Research and Development in IR, Aug. 1998.
  • 8Lempel R,Moran S. The Stochastic Approach for Link-Structure Analysis (SALSA) and the TKC Effect. In:Porc 9 th Intl WorldWide WEB Conf. 2000.
  • 9Chakrabarti S, et al. Mining the WEB's link structure. IEEE Computer, Aug. 1999.
  • 10Chakrabarti S,et al. Automatic resource compilation by analyzing hyperlink structure and associated text. In:Proc 7th Intl WWW Conf. 1998.

共引文献19

同被引文献17

引证文献4

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部