使用频繁结构提炼网络权威资源被引量：1

Refining Web Authoritative Resource by Frequent Structures

下载PDF

导出

摘要在网络资源中有丰富的、对于许多应用领域有用的动态信息 ,已有许多的研究工作致力于提高网络中信息检索的质量 ,然而 ,这些工作中的大部分仍不能满足用户形形色色的请求利用网络中的超链接提出新的算法ESFP来改善从搜索引擎返回的搜索结果的质量运用SFP算法构造ESFP算法 ,完成从复杂的网络拓扑结构中提取权威的页面和社团通过运行若干个实验来描述所提出的算法 ,这些实验数据表明。 The web resource is a rich collection of the dynamic information that is useful in various disciplines There has also been much research work related to improving the quality of information searching in the web However, most of the work is still inadequate to satisfy a diversified demand from users In this paper, the hyperlinks in the web are exploited and a new approach called ESFP is proposed in order to improve the quality of research results obtain from search engines The essential idea of approach is to mine the frequent structures of links from a given web topology By using the SFP algorithm, the authoritative pages and communities are extracted from the complex web topology The approach proposed is demonstrated by running several experiments and it is shown that the functionalities of using the ESFP in managing search results are better than other known methods such as HITS

作者周敏子周皓峰王晨汪卫施伯乐

机构地区复旦大学计算机与信息技术系

出处《计算机研究与发展》 EI CSCD 北大核心 2004年第10期1614-1620,共7页 Journal of Computer Research and Development

基金国家自然科学基金重点项目 ( 6993 3 0 10 60 3 0 3 0 0 8) 国家"八六三"高技术研究发展计划基金项目 ( 2 0 0 2AA4Z3 43 0 2 0 0 2AA2 3 10 41)

关键词权威社团频繁结构频繁模式数据挖掘搜索引擎 authoritative community frequent structure frequent pattern data mining search engine

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献21

1S Chakrabarti, et al. Mining the Web' s link structure. IEEE Computer, 1999, 32(8): 60-67
2J Kleinberg, et al. Applications of linear algebra in information retrieval and hypertext analysis. In: Proc of PODS' 99. New York: ACM Pree, 1999. 185～193
3S Brin, et al. The anatomy of a large-scale hypertextual Web search engine. WWW'98, Brisbane, Australia, 1998
4J Kleinberg. Authoritative sources in a hyperlinked environment.Journal of the ACM, 1999, 46(5): 604～632
5Q Yuan, et al. Extract frequent pattern from simple graph data.In: Proc of WAIM'2002, LNCS 2419. Berlin: Springer, 2002.158～ 169
6R agrawal, et al. Fast algorithms for mining association rules in large databases. In: Proc of the 20th VLDB Conf. San Francisco:Morgan Kaufmann, 1994. 487～499
7D Florescu, et al. Database techniques for the World-Wide Web:A survey. ACM SIGMOD Record, 1998, 27(3): 59～74
8楼宇波,马坚,周皓峰,袁晴晴,施伯乐.基于频繁链接的Web权威资源挖掘[J].计算机研究与发展,2003,40(7):1095-1103. 被引量：6
9D Gibson, et al. Inferring Web communities from link topology.The 9th ACM Conf on Hypertext and Hypermedia, Pittsburgh,1998
10R Kumar, et al. Trawling the Web for emerging cybercommunities. WWW'99. Toronto, Canada, 1999

二级参考文献19

1R Agrawal, R Srikant. Fast algorithms for mining association rules. The 1994 Int'l Conf Very Large Data Bases(VLDB'94),Santiago, Chile, 1994.
2S Brin, R Motwani, J D Ullman, S Tsur. Dynamic itemset counting and implication rules for market basket data. ACM SIGMOD Conf, Tucson, 1997.
3J Han, J Pei, Y Yin. Mining frequent patterns without candidate generation. ACM SIGMOD Conf, Dallas, Texas, 2000.
4G Slaton. Automatic Text Processing: The Transformation,Analysis, and Retrieval of Information by Computer. Reading,MA: Addison Wesley, 1989.
5E Voorhees, N gupta, B Johnson-Laird. Learning collection fusion strategies. ACM SIGIR Conf, Seattle, 1995.
6J Kleinberg. Authoritative sources in a hyperlinked environment.In: Proc of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms. New York: ACM Press, 1998. 668--677.
7P K Reddy, M Kitsuregawa. Inferring Web communities through relaxed cocitation and dense bipartite graphs. 2001. http: //www. tkl. iis. u-tokyo, ac. jp/Kilab/Research/Paper/2001/reddy/6a6.pdf.
8D Florescu, A Levy, A Mendelzon. Database techniques for the World-Wide Web: A survey. ACM SIGMOD Record, 1998, 27(3): 59--74.
9J Cho, N Shivakumar, H Garcia-Molina. Finding replicated Web collections. The 2000 ACM SIGMOD on Managenment of Data,Dallas, 2000.
10K Wang, H Liu. Discovering typical structures of documents: A road map approach. The ACM SIGIR Conf on Research and Development in Information Retrieval, Melbourrne, 1998.

共引文献5

1王艳辉,吴斌,王柏.频繁子图挖掘算法综述[J].计算机科学,2005,32(10):193-196. 被引量：12
2董德民,何钦铭.面向电子商务的Web挖掘技术及其应用研究[J].计算机工程与设计,2006,27(1):95-98. 被引量：3
3赵宝华.基于Web挖掘的远程教育课件访问模式分析系统[J].计算机应用与软件,2009,26(3):149-152. 被引量：2
4徐慧,陶宏.电子商务中的智能挖掘技术及其应用研究[J].漯河职业技术学院学报,2009,8(5):54-55.
5陆慧琳,黄博.基于双索引的子图查询算法[J].计算机工程,2015,41(1):44-48. 被引量：2

同被引文献4

1AlmPanidis,G,KotroPoulos,C.,and Pitas.I.Combining Text and Link Analysis for Focused Crawhng-an Application for Vertical Search Engines.Information System.Vol.32(6),2007,886-908.
2M.NajorkJ.Wiener.Breadth-First Search Crawling Yields High-Quality Pages.In Proceedings of the 1oht Interactional World Wide Web Conference,Hong Kong May 2001.
3Tang T,Hawking D,Craswell N,et al.Focused crawling for both Topical relevance and quality of medical information[C].Bremen:Proceedings of CIKM2005,2005:582-586.
4马亮,陈群秀,王俊,徐国伟.智能Web中文主题信息收集系统IRobot的设计[J].中文信息学报,2002,16(5):23-29. 被引量：7

引证文献1

1曾水香,罗林波.基于改进Hits算法的多主题爬虫研究与实现[J].福建电脑,2010,26(5):88-89. 被引量：2

二级引证文献2

1陈卓民.基于HITS算法改进的Web数据挖掘方法研究应用[J].自动化与仪器仪表,2016(7):255-257. 被引量：1
2陆海丹,曹春萍,臧劲松.移动垂直搜索引擎在移动医疗中的应用研究[J].计算机应用与软件,2013,30(5):20-21. 被引量：2

1赵晓蓉,周锦程,王丹.基于频繁结构的Deep Web查询接口集成[J].科学技术与工程,2014,22(18):81-88.
2ChenWang Ming-ShengHong WeiWang Bai-LeShi.Chopper：有效的树挖掘算法[J].Journal of Computer Science & Technology,2004,19(C00):73-73.
3杨厚群,何中市,雷景生.基于划分的XML文档聚类研究[J].计算机科学,2008,35(3):183-185. 被引量：4
4楼宇波,马坚,周皓峰,袁晴晴,施伯乐.Web权威资源挖掘的一种有效方法[J].计算机工程,2003,29(z1):50-51.
5楼宇波,马坚,周皓峰,袁晴晴,施伯乐.基于频繁链接的Web权威资源挖掘[J].计算机研究与发展,2003,40(7):1095-1103. 被引量：6
6傅珊珊,吴扬扬.基于频繁结构的XML文档聚类[J].计算机工程与应用,2008,44(9):135-138. 被引量：1
7欧姆龙STIF发布机械和过程安全指南[J].变频器世界,2011(1):30-30.
8任薇,周杨.FSM——基于子图同构和结构同构的频繁子图挖掘算法(英文)[J].西南大学学报（自然科学版）,2008,30(6):158-163. 被引量：2
9沙金,纪宁,陈立松.FSP:一种基于图论的频繁结构模式挖掘算法[J].微电子学与计算机,2007,24(2):93-95.
10李斌,谭立湘,邹谊,庄镇泉.量子概率编码遗传算法及其应用[J].电子与信息学报,2005,27(5):805-810. 被引量：19

计算机研究与发展

2004年第10期

浏览历史

内容加载中请稍等...

使用频繁结构提炼网络权威资源被引量：1

参考文献21

二级参考文献19

共引文献5

同被引文献4

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

使用频繁结构提炼网络权威资源 被引量：1

参考文献21

二级参考文献19

共引文献5

同被引文献4

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

使用频繁结构提炼网络权威资源被引量：1