期刊文献+

Web-Based Information Extraction Technology

Web-Based Information Extraction Technology
下载PDF
导出
摘要 Information extraction techniques on the Web are the current research hotspot. Now many information extraction techniques based on different principles have appeared and have different capabilities. We classify the existing information extraction techniques by the principle of information extraction and analyze the methods and principles of semantic information adding, schema defining, rule expression, semantic items locating and object locating in the approaches. Based on the above survey and analysis, several open problems are discussed. Information extraction techniques on the Web are the current research hotspot. Now many information extraction techniques based on different principles have appeared and have different capabilities. We classify the existing information extraction techniques by the principle of information extraction and analyze the methods and principles of semantic information adding, schema defining, rule expression, semantic items locating and object locating in the approaches. Based on the above survey and analysis, several open problems are discussed.
出处 《Journal of Donghua University(English Edition)》 EI CAS 2007年第2期288-292,共5页 东华大学学报(英文版)
关键词 HTML XML RULE SEMANTIC information extraction Hidden Markov model HTML XML 信息提取 隐马尔可夫模型
  • 相关文献

参考文献5

  • 1Stephen Soderland.Learning Information Extraction Rules for Semi-Structured and Free Text[J].Machine Learning (-).1999(1-3)
  • 2A Laender,B Ribeiro-Neto,ASilva,et al.A Brief Survey of Web Data Extraction Tools[].SIGMOD Record.2002
  • 3C H Hsu,M T Dung.Generating Finite-Sate Transducers for Semistructured Data Extraction From the Web[].Infor mation Systems.1998
  • 4A Soderland,F Azavant.Building Intelligent Web Applications Using Lightweight Wrappers[].Data Mining and Knowledge Discovery.2001
  • 5V Crescenzi,G Mecca.Grammars Have Exceptions[].Infor mation Systems.1998

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部