期刊文献+

归纳学习XPATH Web信息提取规则 被引量:7

Inductively Learn XPATH Web Information Extraction Rules
下载PDF
导出
摘要 XPATH在Web信息提取中起重要作用,但是这些XPATH规则通常要人工生成。文中讨论了在XPATH与基于文本上下文规则的信息提取方法结合的系统中如何归纳学习XPATH规则。生成的XPATH规则结构简单,可以为基于文本上下文的信息提取系统提供较为准确的信息定位。 XPATH plays an important role in Web information extraction, but these XPATH rules usually generated by hand. Discusses about how to inductively learn XPATH rules used in an XPATH and text - context - based rules combined infomlation extraction system. The generated rules have simple structure, and they can support as an accurate locator for text- context- based informstation extraction system.
出处 《计算机技术与发展》 2007年第3期98-101,共4页 Computer Technology and Development
基金 江苏省高技术研究计划(G2004034)
关键词 信息提取系统 XPPATH 归纳 information extraction systems XPATH induction
  • 相关文献

参考文献10

  • 1Sahuguet A,Azavant F.Building Light-Weight Wrappers for Legacy Web Data-Sources Using W4F[C]∥Proceedings of the 25th International Conference on Very Large Data Bases VLDB '99.[s.l.]:Morgan Kaufmann Publishers Inc,1999:738-741.
  • 2Liu Ling,Pu Calton,Han Wei.XWRAP:An XML-enabled Wrapper Construction System for WEB Information Source[C]∥Data Engineering,2000.Proceedings.16th International Conference.[s.l.]:[s.n.],2000:611-621.
  • 3Bauamgartner R,Flesrs S,Gottlob G.Visual Web information Extraction with Lixto[C]∥Proceedings of the 27th International Conference on Very Large Data Bases VLDB'01.[s.1.]:Morgan Kaufmann Publishers Inc,2001:119-128.
  • 4Freitag D.Machine Learning for information extraction in informal domains[J].Machine Learning,2000,39 (2-3):169-202.
  • 5Califf M E,Mooney R J.Relational Learning of Pattern -Match Rules for Information Extraction[C]∥In:Proc.of the Sixteenth National Conf,on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence.Orlando,Florida:[s.n.],1999:328-334.
  • 6SoderLan S.Learning Informatin Extraction Rules for Semi -Structured and Free Text[J].Machine Learning,1999,34(1-3):233-272.
  • 7俞巍.XPath的两种解析技术[J].计算机时代,2006(1):49-51. 被引量:1
  • 8张昱,付雄.含XPath的表达式的解析与应用[J].小型微型计算机系统,2004,25(3):442-446. 被引量:2
  • 9王钊,耿蓉,王国仁.XPath的轴连接查询技术研究[J].小型微型计算机系统,2005,26(11):1942-1947. 被引量:2
  • 10王强,武港山.对XPath模式定位能力的扩充[J].计算机研究与发展,2001,38(6):674-678. 被引量:4

二级参考文献22

  • 1[1]Tim Bray, Jean Paoli, C M Sperberg-McQueen. Extensible Markup Language(XML), version 1.0, 1998. http://www.w3.org/TR/1998/REC-xml-19980210
  • 2[2]James Clark, Steve DeRose. XML Path Language(XPath), version 1.0, 1999. http://www.w3.org/TR/1999/REC-xpath-19991116
  • 3[3]DeRose, Steven J. XQuery: A unified syntax for linking and querying general XML documents. In: Proc of QL'98—The Query Languages Workshop. Boston: World Wide Web Consortium, 1998
  • 4[4]Derick Wood. Theory of Computation. New York: Harper & Row Publishers Inc, 1987
  • 5[5]Hartmut Liefke. Horizontal query optimization on ordered semistructured data. In: WebDB'99. 1999. http://citeseer.nj.nec.com/246796.html
  • 6[2]Aaron Skonnard.XML精要快速参考手册.人民邮电出版社,2002.
  • 7[3]http://java.sun.com/j2se/1.5.0/docs/api/javax/xml/xpath/package-summary.html.2004.
  • 8[4]http://www.jdom.org/docs/apidocs/org/jdom/xpath/XPath.html.2004.
  • 9XML Path Language (XPath) 2.0. November 2002. W3C Recommendation[EB/OL]. Available at http://www.w3.org/TR/2002/WD-xpath20-20021115.
  • 10Abiteboul S, Quass D, McHugh J et al. The lorel query language for semistructured data[J]. International Journal on Digital Libraries, 1997,1(1): 68-88.

共引文献5

同被引文献50

引证文献7

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部