期刊文献+

基于启发式规则的中文化学物质命名识别研究 被引量:12

Research on Chinese Chemical Name Recognition Based on Heuristic Rules
原文传递
导出
摘要 针对现有的命名实体识别方法不能很好地处理专业领域特定命名抽取的问题,提出一种基于启发式规则的专业命名识别方法。以中文文本中化学物质命名为研究对象,分析其领域特征及统计语言特征,建立适用于化学领域文献命名识别的启发式规则,为专业领域的命名实体识别提供新的解决方案。对比实验证明本文的方法能有效提升专业命名识别的效率。 This paper proposes a method of domain name recognition based on heuristic rules, to overcome the shortage of traditional solution in specific domain. It firstly studies chemical name in Chinese to obtain its domain features and statistical language features, and then on the basis of such features, it puts forward several heuristic rules, which is applicable to domain name recognition of chemical literature. Comparison experiment shows this method can improve the efficiency of domain name recognition obviously.
出处 《现代图书情报技术》 CSSCI 北大核心 2010年第5期13-17,共5页 New Technology of Library and Information Service
关键词 化学物质命名识别 启发式规则 领域特征 统计语言特征 IUPAC Chemical name recognition Heuristic rule Domain feature Statistical language feature IUPAC
  • 相关文献

参考文献11

  • 1赵军.命名实体识别、排歧和跨语言关联[J].中文信息学报,2009,23(2):3-17. 被引量:51
  • 2Grishman R, Sundhiem B. Design of the MUC -6 Evaluation[ C]. In : Proceedings of the 6th Message Understanding Conference. NJ : Association for Computational Linguistics, 1995 : 1 - 11.
  • 3Chen H H, Ding Y W, Tsai S C, et al. Description of the NTU System Used for MET - 2 [ C ]. In : Proceedings of the 7th Message Understanding Conference. 1998.
  • 4Black W J, Rinaldi F, Mowatt D. Facile: Description of the NE System Used For MUC - 7 [ C ]. In : Proceedings of the 7th Message Understanding Conference. 1998.
  • 5Sun J, Gao J F, Zhang L, et al. Chinese Named Entity Identification Using Class Based Language Model [ C ]. In : Proceedings of the 19th International Conference on Computational Linguistics. N J: Association for Computational Linguistics, 2002 : 1 - 7.
  • 6Zhou G D, Su J. Named Entity Recognition Using an HMM Based Chunk Tagger[ C ]. In: Proceedings of the 40th Annual Meeting of the ACL. NJ : Association for Computational Linguistics, 2002 : 473 - 480.
  • 7Ramaparkhi A. A Simple Introduction to Maximum Entropy Models for Natural Language Processing [ R ]. Institute for Research in Cognitive Science, University of Pennsylvania, 1997.
  • 8刘建华,张智雄,徐健,许雁冬.自动术语识别--对科技文献进行文本挖掘的重要技术方法[J].现代图书情报技术,2008(8):12-17. 被引量:12
  • 9Krauthammer M, Rzhetsky A, Morozov P, et al. Using BLAST for Identifying Gene and Protein Names in Journal Articles [J]. Gene, 2000, 259( 1 ) :245 -252.
  • 10宋丹,孙济庆.基于规则的化学特征词自动标引研究[J].情报学报,2009,28(5):689-692. 被引量:8

二级参考文献89

  • 1孙茂松,黄昌宁,高海燕,方捷.中文姓名的自动辨识[J].中文信息学报,1995,9(2):16-27. 被引量:87
  • 2蒋龙,周明,简立峰.利用音译和网络挖掘翻译命名实体[J].中文信息学报,2007,21(1):23-29. 被引量:11
  • 3NIST. The ACE 2007 (ACE07) Evaluation Plan: Evaluation of the Detection and Recognition of ACE Entities, Values, Temporal Expressions, Relations, and Events [EB/OL]. [-2007]. http://www, hist. gov/ speech/tests/ace/2OOT/doc/aceOT-evalplan, vl. 3a. pdf.
  • 4Nancy A. Chinchor. Overview of MUC-7/MET-2[C]//Proceedings of the Seventh Message Under- standing Conference (MUC-7), Fairfax, Virginia, 1998.
  • 5Gina Anne Levow. The Third International Chinese Language Processing Bakeoff: Word Segmentation and Named Entity Recognition[C]//Proceedings of the Fifth SigHAN Workshop on Chinese Language Processing, Sydney: Association for Computational Lin- guistics, 2006:108 117.
  • 6A. Mikheev, C. Grover, Moens M. Description of the LTG System Used for MUC-7[C]//Proceedings of 7th Message Understanding Conference ( MUC-7 ), Fairfax, Virginia, 1998.
  • 7863计划中文信息处理与智能人机接口技术评测组.2004年度863计划中文信息处理与智能人机交互技术评测:命名实体评测结果报告[R].北京:863计划中文信息处理与智能人机接口技术评测组,2004.
  • 8Ralph Grishman, Beth Sundheim. Design of the MUC-6 evaluation [C]//Proceedings of 6th Message Under- standing Conference, Columbia, MD, 199S.
  • 9G. R. Krupka, K. Hausman. IsoQuest. Inc.:Description of the NetOwl TM Extractor System as Used for MUC-7 [C]//Proceedings of the 7th Message Understanding Conference. (MUC-7), Fairfax, Virginia, 1998.
  • 10W.J. Black, F. Rinaldi, D. Mowart. FACILE: Description of the NE System Used for MUC-7 [C]// Proceedings of the 7th Message Understanding Conference. (MUC-7), Fairfax, Virginia, 1998.

共引文献68

同被引文献140

引证文献12

二级引证文献75

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部