期刊文献+

一种基于文本内容的HITS改进算法 被引量:5

An Improved HITS Based on Text
下载PDF
导出
摘要 HITS算法是WEB结构挖掘中一种经典的链接分析算法,其主要问题是容易发生主题漂移。针对这一问题,提出了一种基于文本内容和链接分析相结合的改进算法。实验证明改进后的算法提高了查询结果的相关度,减少了主题漂移的发生。
作者 郭鸿
出处 《计算机系统应用》 2009年第9期38-40,131,共4页 Computer Systems & Applications
基金 广西青年科学基金(桂科青0832101)
  • 相关文献

参考文献8

  • 1王晓宇,周傲英.万维网的链接结构分析及其应用综述[J].软件学报,2003,14(10):1768-1780. 被引量:61
  • 2倪现君.结构挖掘中web有向图模型的改进算法[J].微计算机信息,2007,23(36):163-165. 被引量:5
  • 3Chakrabarti S, Dom B, Raghavan P, et al. Automatic resource compilation by analyzing hyperlink structure and associated text. Compute Networksand ISDN Systems, April, 1998,30(1-7).
  • 4Gevrey J, Ruger S. Link-based approaches for text retrieval. Proceedings of TREC-10, NIST. NIST Special Publication, 2002.
  • 5Xingw, Ghorbania.Weighted PageRank Algorithm. Proceedings of the Second Conference on Commu- nication Networks and Services Research, 2004:305- 314.
  • 6Kosala R, Blockeel H. Web Mining Research: A Survey. ACMSIGKDD, 2007:40 - 43.
  • 7Mizuuchi Y. Finding Context Paths for Web Pages. Proc. ofACM Hypertext, 1999:13 - 22.
  • 8Borodin A, Roberts GO, Rosenthal JS, et al. Finding Authorities and Hubs Form Link Structures on the Word Wide Web. In Web, Hong Kong, China, May 2001.

二级参考文献47

  • 1彭曙蓉,王耀南.针对小文本的Web数据挖掘技术及其应用[J].微计算机信息,2006,22(07X):203-205. 被引量:10
  • 2Ding J, Gravano L, Shivakumar N. Computing geographical scopes of Web resources. In: Amr A, et al., eds. Proceedings of the 26th International Conference on Very Large Data Bases. Cairo: Morgan Kaufmann Publishers, 2000. 545-556.
  • 3Bar-Yossef Z. Approximating aggregate queries about Web pages via random walks. In: Amr A, et al., eds. Proceedings of the 26th International Conference on Very Large Data Bases. Cairo: Morgan Kanfmann Publishers, 2000. 535-544.
  • 4Larson R. Bibliometrics of the World Wide Web: An exploratory analysis of the intellectual stTucture of cyberspace. In: Hans-Peter F, et al., eds. Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Zurich: ACM Press, 1996. 85-92.
  • 5Botafago A. Cluster analysis for hypertext systems. In: Robert K, et al., eds. Proceedings of the 16th Annual ACM SIGIR Conference on Research and Development in Information Retrieval. Pittsburgh: ACM Press, 1993. 116-125.
  • 6Mukherjea S. WTMS: A system for collecting and analyzing topic-specified web information. In: Albert V, et al., eds. Proceedings of the 9th ACM-WWW International Conference. Amsterdam: ACM Press, 2000. 457--471.
  • 7Kumar R, Raghavan P, Rajagopalan S, Sivakumar D, Tomkins A, Upfal E. The Web as a graph. In: Serge A, ed. Proceedings of the 18th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. Pennsylvania: ACM Press, 1999.109-118.
  • 8Carriere J, Kazman R. WebQuery: Searching and visualizing the Web through connectivity. Computer Networks and ISDN Systems, 1997,29(8-13): 1257-1267.
  • 9Chakrabarti S, Dora B, Indyk P. Enhanced hypertext classification using hyperlinks. In: Laura H, ed. Proceedings of the ACM SIGMOD International Conference on Management of Data. Washington: ACM Press, 1998. 307-318.
  • 10Spertus E. ParaSite: Mining strctural information on the Web. Computer Networks and ISDN Systems, 1997,29(8-13):1205-1215.

共引文献63

同被引文献35

引证文献5

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部