期刊文献+

Deep Web查询接口的判定技术研究 被引量:1

Research on Deep Web Query Interface Determining Technology
下载PDF
导出
摘要 互联网的飞速发展,给人类带来了海量的可供访问信息,但是,现今搜索引擎索引的绝大部分是表层SurfaceWeb网的信息,限于一些技术原因,搜索引擎几乎无法索引到Deep Web网中的信息。由于查询接口是Deep Web的唯一入口,但并非所有的网页表单都是查询接口,为了能充分利用Deep Web后台数据库信息,首先要找到进入Deep Web后台数据库的入口,所以对查询接口的正确判定至关重要。文中介绍了利用决策树C4.5分类算法自动判定网页表单是否为DeepWeb查询接口的方法。 The rapid development of the Internet brought a mass of information, but the search engine indexed most of the Surface Web, limited to a number of technical reasons, the search engine was almost impossible to index Deep Web. The query interface was the only entrance to the Deep Web, but not all of the web forms were query interfaces. In this paper, using C4.5 decision tree classification algorithm automatic web form to determine whether the Deep Web query interface.
作者 李齐会
出处 《计算机与数字工程》 2009年第3期131-134,共4页 Computer & Digital Engineering
关键词 DEEP WEB 查询接口 网页表单 决策树C4.5分类算法 Deep Web, query interface, web form, C4.5 decision tree classification algorithm
  • 相关文献

参考文献8

  • 1刘伟,孟小峰,孟卫一.Deep Web数据集成研究综述[J].计算机学报,2007,30(9):1475-1489. 被引量:136
  • 2赵朋朋,高岭,崔志明.基于查询接口特征的Deep Web数据源自动分类[J].微电子学与计算机,2006,23(10):47-50. 被引量:11
  • 3SHERMAN C, PRICE G. The Invisible Web: Uncovering Information Sources Search Engines Can't See[M]. New York: Cyber Age Books, 2001
  • 4M. K. Bergman. The deep Web: Surfacing hidden value. White Paper, Bright Planet, 2001
  • 5CHANG K C , HE B , LI C , PATEL M , ZHANG Z. Structured databases on the Web .. Observa- tions and Implications[R]. SIGMOD Record , 2004 , 33 (3) : 61270
  • 6RAGHAVAN S, GARCIA-MOLINA H. Crawling the hidden Web. Proceedings of the 27th International Conference on Very Large Data Bases [C]. Italy: Rome, 2001 : 129--138
  • 7Http:// book. dangdany.com/01.41. htm#[EB/OL]
  • 8GHANEM T M, AREF W G. Databases Deepen the Web[J]. IEEE Computer, 2004, 73(1):116-117

二级参考文献66

  • 1.[EB/OL].http://www.cogsci.Princeton.edu,.
  • 2Michael K Bergman.The deep web:surfacing hidden value[J].In journal of electronic publishing,2002,7 (1):8912~8914
  • 3K C C Chang,B He,C Li,et al.Structured databases on the web:observations and implications[J].SIGMOD Record,2004,33(3):61~70
  • 4Panagiotis G Ipeirotis,Luis Gravano,Mehran Sahami.Probe,count,and classify:categorizing hidden web databases[C].In Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data,2001:67~78
  • 5Yih-Ling Hedley,Muhammad Younas,Anne E James.The categorisation of hidden web databases through concept specificity and coverage[C].In proceedings of the 2005 international workshop on web and mobile information Systems,2005:371~376
  • 6B He,T Tao,K C C Chang.Organizing structured web sources by query schemas:a clustering approach[C].In Proceedings of the 13th Conference on Information and Knowledge Management,2004:22~31
  • 7Qian Peng,Weiyi Meng,Hai He,et al.WISE-Cluster:Clustering e-commerce search engines automatically[C].In 6th ACM International Workshop on Web Information and Data Management,2004:104~111
  • 8Fetterly D,Manasse M,Najork M,Wiener J L.A largescale study of the evolution of Web pages//Proceedings of the 12th International World Wide Web Conference.Budapest,2003:669-678
  • 9Chang K C,He B,Li C,Patel M,Zhang Z.Structured databases on the Web:Observations and Implications.SIGMOD Record,2004,33(3):61-70
  • 10Cope J,Craswell N,Hawking D.Automated discovery of search interfaces on the Web//Proceedings of the 14th Australasian Database Conference(ADC 2003).Adelaide,2003:181-189

共引文献143

同被引文献8

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部