期刊文献+

使用频繁结构提炼网络权威资源 被引量:1

Refining Web Authoritative Resource by Frequent Structures
下载PDF
导出
摘要 在网络资源中有丰富的、对于许多应用领域有用的动态信息 ,已有许多的研究工作致力于提高网络中信息检索的质量 ,然而 ,这些工作中的大部分仍不能满足用户形形色色的请求 利用网络中的超链接提出新的算法ESFP来改善从搜索引擎返回的搜索结果的质量 运用SFP算法构造ESFP算法 ,完成从复杂的网络拓扑结构中提取权威的页面和社团 通过运行若干个实验来描述所提出的算法 ,这些实验数据表明 。 The web resource is a rich collection of the dynamic information that is useful in various disciplines There has also been much research work related to improving the quality of information searching in the web However, most of the work is still inadequate to satisfy a diversified demand from users In this paper, the hyperlinks in the web are exploited and a new approach called ESFP is proposed in order to improve the quality of research results obtain from search engines The essential idea of approach is to mine the frequent structures of links from a given web topology By using the SFP algorithm, the authoritative pages and communities are extracted from the complex web topology The approach proposed is demonstrated by running several experiments and it is shown that the functionalities of using the ESFP in managing search results are better than other known methods such as HITS
出处 《计算机研究与发展》 EI CSCD 北大核心 2004年第10期1614-1620,共7页 Journal of Computer Research and Development
基金 国家自然科学基金重点项目 ( 6993 3 0 10 60 3 0 3 0 0 8) 国家"八六三"高技术研究发展计划基金项目 ( 2 0 0 2AA4Z3 43 0 2 0 0 2AA2 3 10 41)
关键词 权威社团 频繁结构 频繁模式 数据挖掘 搜索引擎 authoritative community frequent structure frequent pattern data mining search engine
  • 相关文献

参考文献21

  • 1S Chakrabarti, et al. Mining the Web' s link structure. IEEE Computer, 1999, 32(8): 60-67
  • 2J Kleinberg, et al. Applications of linear algebra in information retrieval and hypertext analysis. In: Proc of PODS' 99. New York: ACM Pree, 1999. 185~193
  • 3S Brin, et al. The anatomy of a large-scale hypertextual Web search engine. WWW'98, Brisbane, Australia, 1998
  • 4J Kleinberg. Authoritative sources in a hyperlinked environment.Journal of the ACM, 1999, 46(5): 604~632
  • 5Q Yuan, et al. Extract frequent pattern from simple graph data.In: Proc of WAIM'2002, LNCS 2419. Berlin: Springer, 2002.158~ 169
  • 6R agrawal, et al. Fast algorithms for mining association rules in large databases. In: Proc of the 20th VLDB Conf. San Francisco:Morgan Kaufmann, 1994. 487~499
  • 7D Florescu, et al. Database techniques for the World-Wide Web:A survey. ACM SIGMOD Record, 1998, 27(3): 59~74
  • 8楼宇波,马坚,周皓峰,袁晴晴,施伯乐.基于频繁链接的Web权威资源挖掘[J].计算机研究与发展,2003,40(7):1095-1103. 被引量:6
  • 9D Gibson, et al. Inferring Web communities from link topology.The 9th ACM Conf on Hypertext and Hypermedia, Pittsburgh,1998
  • 10R Kumar, et al. Trawling the Web for emerging cybercommunities. WWW'99. Toronto, Canada, 1999

二级参考文献19

  • 1R Agrawal, R Srikant. Fast algorithms for mining association rules. The 1994 Int'l Conf Very Large Data Bases(VLDB'94),Santiago, Chile, 1994.
  • 2S Brin, R Motwani, J D Ullman, S Tsur. Dynamic itemset counting and implication rules for market basket data. ACM SIGMOD Conf, Tucson, 1997.
  • 3J Han, J Pei, Y Yin. Mining frequent patterns without candidate generation. ACM SIGMOD Conf, Dallas, Texas, 2000.
  • 4G Slaton. Automatic Text Processing: The Transformation,Analysis, and Retrieval of Information by Computer. Reading,MA: Addison Wesley, 1989.
  • 5E Voorhees, N gupta, B Johnson-Laird. Learning collection fusion strategies. ACM SIGIR Conf, Seattle, 1995.
  • 6J Kleinberg. Authoritative sources in a hyperlinked environment.In: Proc of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms. New York: ACM Press, 1998. 668--677.
  • 7P K Reddy, M Kitsuregawa. Inferring Web communities through relaxed cocitation and dense bipartite graphs. 2001. http: //www. tkl. iis. u-tokyo, ac. jp/Kilab/Research/Paper/2001/reddy/6a6.pdf.
  • 8D Florescu, A Levy, A Mendelzon. Database techniques for the World-Wide Web: A survey. ACM SIGMOD Record, 1998, 27(3): 59--74.
  • 9J Cho, N Shivakumar, H Garcia-Molina. Finding replicated Web collections. The 2000 ACM SIGMOD on Managenment of Data,Dallas, 2000.
  • 10K Wang, H Liu. Discovering typical structures of documents: A road map approach. The ACM SIGIR Conf on Research and Development in Information Retrieval, Melbourrne, 1998.

共引文献5

同被引文献4

  • 1AlmPanidis,G,KotroPoulos,C.,and Pitas.I.Combining Text and Link Analysis for Focused Crawhng-an Application for Vertical Search Engines.Information System.Vol.32(6),2007,886-908.
  • 2M.NajorkJ.Wiener.Breadth-First Search Crawling Yields High-Quality Pages.In Proceedings of the 1oht Interactional World Wide Web Conference,Hong Kong May 2001.
  • 3Tang T,Hawking D,Craswell N,et al.Focused crawling for both Topical relevance and quality of medical information[C].Bremen:Proceedings of CIKM2005,2005:582-586.
  • 4马亮,陈群秀,王俊,徐国伟.智能Web中文主题信息收集系统IRobot的设计[J].中文信息学报,2002,16(5):23-29. 被引量:7

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部