期刊文献+

基于领域样本查询的Deep Web数据库分类 被引量:1

Classfication of Deep Web Databases Based on the Domain Sample Query
下载PDF
导出
摘要 提出了一种基于领域样本查询的方法以分类这类Web数据库.通过分析领域的高级查询接口自动获取领域主属性并使用领域知识为主属性构建查询样本,然后对查询接口提交试探查询,根据返回结果页面的结果模式和记录内容估计Web数据库与领域的相关程度.通过在多个领域的Web数据库上进行实验验证,说明该方法分类只提供简单查询接口的Web数据库是有效的,取得了较高的分类精确率,召回率和F-measure值. An approach based on the domain sample query is proposed in this paper to classify the web database, it obtains domain of the main attributes by analyzing descriptive attribute labels in the advanced query interfaces, the correllations of between web database with simple query interface and domain can be estimated by result schema and records of result pages,which obtained by submitting probing queries to simple query interface. The experiments on several domains have proved that this approach is effective and can achieve high classification precision, recall and F-measure values.
出处 《微电子学与计算机》 CSCD 北大核心 2010年第3期20-23,共4页 Microelectronics & Computer
基金 国家自然科学基金项目(60673092) 江苏省重大科技支撑与自主创新项目(BE2008044) 江苏省"六大人才高峰"项目(06-E-037) 江苏省研究生创新计划项目(CX08B_099z)
关键词 DEEP WEB WEB数据库 数据库分类 简单查询接口 deep Web Web database database classification simple query interface
  • 相关文献

参考文献7

  • 1Chang KCC, He B, Li CK, et al. Structured databases on the web: observations and implications [ J ]. SIGMOD Record, 2004, 33(3):61-70.
  • 2赵朋朋,高岭,崔志明.基于查询接口特征的Deep Web数据源自动分类[J].微电子学与计算机,2006,23(10):47-50. 被引量:11
  • 3马军,宋玲,韩晓晖,闫泼.基于网页上下文的Deep Web数据库分类[J].软件学报,2008,19(2):267-274. 被引量:31
  • 4刘伟,孟小峰,孟卫一.Deep Web数据集成研究综述[J].计算机学报,2007,30(9):1475-1489. 被引量:136
  • 5Ipeirotis P G, Gravano L, Sahami M. Probe, count, and classify: categorizing hidden web databases[ C]//Proc. of the 19th ACM SIGMOD International Conference on Management of Data. Santa Barbara, 2001 : 67 78.
  • 6Zhong Hua, Zhao Pengpeng, Gao Ling, et al. Vision- based deep web result schema automatic extraction [ J ] Computational Information Systems, 2007, 3(4) : 1515 - 1522.
  • 7Meng X F, Lu H J, Wang H Y, et al. SG- WRAP. A schema guided wrapper generator[C]//Proc. of ICDE. China: Beijing, 2002: 331-332.

二级参考文献84

  • 1.[EB/OL].http://www.cogsci.Princeton.edu,.
  • 2Michael K Bergman.The deep web:surfacing hidden value[J].In journal of electronic publishing,2002,7 (1):8912~8914
  • 3K C C Chang,B He,C Li,et al.Structured databases on the web:observations and implications[J].SIGMOD Record,2004,33(3):61~70
  • 4Panagiotis G Ipeirotis,Luis Gravano,Mehran Sahami.Probe,count,and classify:categorizing hidden web databases[C].In Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data,2001:67~78
  • 5Yih-Ling Hedley,Muhammad Younas,Anne E James.The categorisation of hidden web databases through concept specificity and coverage[C].In proceedings of the 2005 international workshop on web and mobile information Systems,2005:371~376
  • 6B He,T Tao,K C C Chang.Organizing structured web sources by query schemas:a clustering approach[C].In Proceedings of the 13th Conference on Information and Knowledge Management,2004:22~31
  • 7Qian Peng,Weiyi Meng,Hai He,et al.WISE-Cluster:Clustering e-commerce search engines automatically[C].In 6th ACM International Workshop on Web Information and Data Management,2004:104~111
  • 8Fetterly D,Manasse M,Najork M,Wiener J L.A largescale study of the evolution of Web pages//Proceedings of the 12th International World Wide Web Conference.Budapest,2003:669-678
  • 9Chang K C,He B,Li C,Patel M,Zhang Z.Structured databases on the Web:Observations and Implications.SIGMOD Record,2004,33(3):61-70
  • 10Cope J,Craswell N,Hawking D.Automated discovery of search interfaces on the Web//Proceedings of the 14th Australasian Database Conference(ADC 2003).Adelaide,2003:181-189

共引文献160

同被引文献4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部