期刊文献+

基于通用搜索引擎的深层网络表面化方法研究 被引量:1

Research on Deep Web Surfacing Based on Common Search Engines
原文传递
导出
摘要 在现有相关研究的基础上,对基于通用搜索引擎的深层网络表面化方法的基本原理进行分析,对表单域取值范围的确定、查询处理、查询结果的超链接设置等与深层网络表面化相关的若干关键问题进行探讨。 On the basis of related works, this paper analyzes the basic principle of deep Web surfacing based on common search engines. Several key issues related to the deep Web surfacing are discussed, which include determination of value ranges of form fields, query processing, and hyperlink setting in result pages.
作者 郭少友
出处 《现代图书情报技术》 CSSCI 北大核心 2010年第2期24-30,共7页 New Technology of Library and Information Service
关键词 搜索引擎 深层网络 表面化 数据库 Search engine Deep Web Surfacing Database
  • 相关文献

参考文献12

  • 1Bergman M K. White Paper: The Deep Web: Surfacing Hidden Value [ EB/OL ]. [ 2009 - 10 - 20]. http ://www. press, umich. edu/jep/07 - 01/bergman. html.
  • 2刘伟,孟小峰,孟卫一.Deep Web数据集成研究综述[J].计算机学报,2007,30(9):1475-1489. 被引量:136
  • 3Doan A H, Domingos P, Halevy A. Reconciling Schemas of Disparate Data Sources: A Machine Learning Approach [ EB/OL ]. [2009 - 10 - 12 ]. http://www, cs. washington, edu/homes/pedrod/papers/sigmod01, pdf.
  • 4Raghavan S, Garcia - Molina H. Crawling the Hidden Web [ EB/ OL]. [2010 -02 - 11 ]. http://www, dia. uniroma3, it/- vldbproc/017_129, pdf.
  • 5Deep Query Manager[ EB/OL]. [2009 - 10 - 20]. http:// brightplanet, com/products/dqm, asp.
  • 6Callan J, Connell M. Query - based Sampling of Text Databases [ J ]. A CM Transactions on Information Systems, 2001,19 ( 2 ) :97 - 130.
  • 7Ipeirotis P, Gravano L. Distributed Search over the Hidden Web: Hierarchical Database Sampling and Selection[ EB/OL]. [2009 - 10 -22]. http://softbase, uwaterloo, ca/- tozsu/courses/cs856/ W05/Presentations/HiddenWeb_Amr. pdf.
  • 8Ntoulas A, Zerfos P, Cho J. Downloading Textual Hidden Web Content Through Keyword Queries [ EB/OL ]. [ 2009 - 10 - 12]. http://citeseerx, ist. psu. edu/viewdoc/download? doi = 10.1.1. 105. 137&rep = repl &type = pdf.
  • 9Wu P, Wen J R, Liu H, et al. Query Selection Techniques for Efficient Crawling of Structured Web Sources[ EB/OL]. [ 2009 - 10 - 12 ]. http ://research. microsoft, com/en - us/um/people/jrwen/jrwen_files/publications/deepwebcrawling, pdf.
  • 10Byers S, Freire J, Silva C. Efficient Acquisition of Web Data Through Restricted Query Interfaces [ EB/OL 1- [ 2009 - 10 - 15 ]. http ://wwwl 0. org/cdrom/posters! 1051. pdf.

二级参考文献60

  • 1.[EB/OL].http://www.cogsci.Princeton.edu,.
  • 2Fetterly D,Manasse M,Najork M,Wiener J L.A largescale study of the evolution of Web pages//Proceedings of the 12th International World Wide Web Conference.Budapest,2003:669-678
  • 3Chang K C,He B,Li C,Patel M,Zhang Z.Structured databases on the Web:Observations and Implications.SIGMOD Record,2004,33(3):61-70
  • 4Cope J,Craswell N,Hawking D.Automated discovery of search interfaces on the Web//Proceedings of the 14th Australasian Database Conference(ADC 2003).Adelaide,2003:181-189
  • 5Zhang Z,He B,Chang K C.Understanding Web query interfaces:Best-effort parsing with hidden syntax//Proceedings of the 23rd ACM SIGMOD International Conference on Management of Data.Paris,2004:107-118
  • 6Arasu A,Garcia-Molina H.Extracting structured data from Web pages//Proceedings of the 22nd ACM SIGMOD International Conference on Management of Data.San Diego,2003:337-348
  • 7Crescenzi V,Mecca G,Merialdo P.RoadRunner:Towards automatic data extraction from large Web sites//Proceedings of the 27th International Conference on Very Large Data Bases.Italy,2001:109-118
  • 8Wittenburg K,Weitzman L.Visual grammars and incremental parsing for interface languages//Proceedings of the IEEE Symposium on Visual Languages (VL).Skokie,1990:111-118
  • 9He H,Meng W,Yu C T,Wu Z.WISE-integrator:An automatic integrator of Web search interfaces for e-commerce//Proceedings of the 29th International Conference on Very Large Data Bases.Berlin,2003:357-368
  • 10Peng Q,Meng W,He H,Yu C T.WISE-cluster:Clustering e-commerce search engines automatically//Proceedings of the 6th ACM International Workshop on Web Information and Data Management.Washington,2004:104-111

共引文献135

同被引文献7

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部