期刊文献+

从客户评论中识别命名实体——基于最大熵模型的实现 被引量:2

Recognizing Named Entity from Free-text Customer Reviews——A Maximum Entropy Model-based Approach
原文传递
导出
摘要 介绍命名实体识别的基本概念,分析两种命名实体识别的基本方法:基于规则的命名实体识别方法和基于统计的命名实体识别方法,并以最大熵模型为理论基础,对中文菜名识别进行实证研究。根据中文命名实体的特点,设计6种特征模板。实验结果表明,在简单特征模板的基础上增加标注特征能有效提高命名实体的识别效果。对改进识别效果有用的特征依次为:标注特征、词性组合特征、后向词性依赖特征和词形特征。 This paper introduces the concept of Named Entity Recognition ( NER), analyzes two basic approaches, the rulebased approach and the statistical approach, and conducts an empirical study on Chinese dish name recognition based on the theory of Maximum Entropy Model (MEM). According to the characteristics of Chinese named entity, 6 fea- ture templates are designed. Experimental results show that adding tagging features to the basic simple feature template ean efficiently improve the performance of Named Entity Recognition. The features in order to improve recognition performance are as follow : tagging features, combination of POS features, forward POS dependency features and word form features.
出处 《现代图书情报技术》 CSSCI 北大核心 2011年第5期77-82,共6页 New Technology of Library and Information Service
基金 国家自然科学基金资助项目"Web2.0环境下基于本体学习的观点挖掘研究"(项目编号:70903047) 上海市重点学科建设项目"系统分析与集成"(项目编号:S30501)的研究成果之一
关键词 命名实体识别 最大熵模型 客户评论 文本挖掘 Named entity recognition Maximum entropy model User reviews Text mining
  • 相关文献

参考文献1

二级参考文献11

  • 1Grishman R, Sundhiem B. Design of the MUC -6 Evaluation[ C]. In : Proceedings of the 6th Message Understanding Conference. NJ : Association for Computational Linguistics, 1995 : 1 - 11.
  • 2Chen H H, Ding Y W, Tsai S C, et al. Description of the NTU System Used for MET - 2 [ C ]. In : Proceedings of the 7th Message Understanding Conference. 1998.
  • 3Black W J, Rinaldi F, Mowatt D. Facile: Description of the NE System Used For MUC - 7 [ C ]. In : Proceedings of the 7th Message Understanding Conference. 1998.
  • 4Sun J, Gao J F, Zhang L, et al. Chinese Named Entity Identification Using Class Based Language Model [ C ]. In : Proceedings of the 19th International Conference on Computational Linguistics. N J: Association for Computational Linguistics, 2002 : 1 - 7.
  • 5Zhou G D, Su J. Named Entity Recognition Using an HMM Based Chunk Tagger[ C ]. In: Proceedings of the 40th Annual Meeting of the ACL. NJ : Association for Computational Linguistics, 2002 : 473 - 480.
  • 6Ramaparkhi A. A Simple Introduction to Maximum Entropy Models for Natural Language Processing [ R ]. Institute for Research in Cognitive Science, University of Pennsylvania, 1997.
  • 7Krauthammer M, Rzhetsky A, Morozov P, et al. Using BLAST for Identifying Gene and Protein Names in Journal Articles [J]. Gene, 2000, 259( 1 ) :245 -252.
  • 8Klinger R, Kolarik C, Fluck J, et al. Detection of IUPAC and IUPAC - like Chemical Names [ J ]. Bioinformatics, 2008, 24 ( 13 ) : 268 - 276.
  • 9刘建华,张智雄,徐健,许雁冬.自动术语识别--对科技文献进行文本挖掘的重要技术方法[J].现代图书情报技术,2008(8):12-17. 被引量:12
  • 10赵军.命名实体识别、排歧和跨语言关联[J].中文信息学报,2009,23(2):3-17. 被引量:50

共引文献11

同被引文献19

  • 1姚天昉,聂青阳,李建超,李林琳,陈柯,付宁.一个用于汉语汽车评论的意见挖掘系统[C]//中文信息处理前沿进展-中国中文信息学会二十五周年学术会议论文集.北京:清华大学出版社,2006:260-281.
  • 2Kim Soo-Min,Eduard Hovy.Determining the Sentiment of Opinions[C] //COLING'04Proceedings of the20th international conference on Computational Linguistics.Stroudsburg,PA,USA:Association for Computational Linguistics,2004.
  • 3Hu Minqing,Liu Bing.Mining and summarizing customer reviews[C] //KDD'04Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining.New York,NY,USA:ACM,2004:168-177.
  • 4赫博一,夏云庆,郑方.PINAX:一个有效的产品属性挖掘系统[C] //第四届全国信息检索与内容安全学术会议论文集.北京:清华大学智能技术与系统国家重点实验室,2008:281-290.
  • 5Zhuang Li,Jing Feng,Zhu Xiao-Yan.Movie Review Mining and Summarization[C] //CIKM'06Proceedings of the15th ACM international conference on Information and knowledge management.New York,NY,USA:ACM,2006.
  • 6吴月萍,陈玉泉.基于Web的概念属性抽取的研究[J].中国管理信息化,2009,12(10):98-101. 被引量:7
  • 7余传明.从用户评论中挖掘产品属性——基于SOM的实现[J].现代图书情报技术,2009(5):61-66. 被引量:20
  • 8余传明.从产品评论中挖掘观点:原理与算法分析[J].情报理论与实践,2009,32(7):124-128. 被引量:15
  • 9宋晓雷,王素格,李红霞.面向特定领域的产品评价对象自动识别研究[J].中文信息学报,2010,24(1):89-93. 被引量:34
  • 10闫丹辉,毕玉德.基于规则的越南语命名实体识别研究[J].中文信息学报,2014,28(5):198-205. 被引量:15

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部