期刊文献+

基于混合策略的汉语最长名词短语识别 被引量:7

Chinese Maximal Noun Phrase Recognition Based on Mixed Strategy
下载PDF
导出
摘要 该文提出一种基于语言知识评价的分类器集成方法,利用自动获得的搭配资源和人工评价规则,融合了基于支持向量机的最长名词短语识别结果和基于条件随机场的归约识别结果,进一步基于确定性规则有针对性地识别了分类器易出错的特殊结构,提高了对连续动词介词和连续名词造成的边界歧义的识别能力。实验取得了89.30%的正确率和89.62%的召回率,多词结构F1值较归约方法提高了0.75%。 This paper proposed a classifier ensemble method based on the language evaluation, and fused the MNP recognition results of SVMs and cascade CRFs based on reduction method, using the automatically obtained collocations and the manual assess rules. It then further targeted recognized the error-prone structures of the classifiers based on deterministic rules. The methods improve the recognition ability of boundary ambiguities of continuous verbs and prepositions as well as continuous nouns. The experiment is successful with a precision rate of 89.30% and a recall rate of 89.62%, especially it improves Fl-score of multi-words MNPs by 0.75% in contrast with the reduction method.
作者 钱小飞 侯敏
出处 《中文信息学报》 CSCD 北大核心 2013年第6期16-22,共7页 Journal of Chinese Information Processing
基金 上海市哲学社会科学规划青年课题资助项目(2013EYY005) 国家语言资源监测与研究中心科研项目(YZYS08-04)
关键词 最长名词短语识别 语言知识评价 分类器集成 规则 maximal noun phrase recognition language knowledge assess classifier ensemble rule
  • 相关文献

参考文献8

二级参考文献48

  • 1黄河燕,陈肇雄.基于多策略的交互式智能辅助翻译平台总体设计[J].计算机研究与发展,2004,41(7):1266-1272. 被引量:12
  • 2孙宏林,俞士汶.浅层句法分析方法概述[J].当代语言学,2000,2(2):74-83. 被引量:38
  • 3王立霞,孙宏林.现代汉语介词短语边界识别研究[J].中文信息学报,2005,19(3):80-86. 被引量:11
  • 4干俊伟,黄德根.汉语介词短语的自动识别[J].中文信息学报,2005,19(4):17-23. 被引量:14
  • 5冯冲,陈肇雄,黄河燕,张亮,王江伟.基于条件随机域的复杂最长名词短语识别[J].小型微型计算机系统,2006,27(6):1134-1139. 被引量:16
  • 6Bourigauh D. Surface grammatical analysis for the ex traction of terminological noun phrases[C]//Boitet C ed. Proceedings of the 15th International Conference on Computational Linguistics (COLING'92). Nantes: Academic Press, 1992. 977-981.
  • 7Voutilamen A. NPTool, a detector of English noun phrases[C]//Church K ed. Proceedings of the Work-shop on Very I.arge Corpora: Academic and Industrial Perspectives. Columbus: Association for Computa tional Linguistics, 1993. 48-57.
  • 8Chen Kuang-hua, Chen Hsin hsi. Extracting noun phrases from large scale texts: a hybrid approach and its automatic evaluation[C]//Proceedings of the 32nd Annual Meeting of Association of Computational Lin guistics. New York: Association for Computational Linguistics, 1994. 234-241.
  • 9李文捷,周明,潘海华,等.基于语料库的中文最长名词短语的自动提取[C]//陈力为,袁琦,计算语言学进展与应用.北京:清华大学出版社,1995,119-124.
  • 10陆俭明.汉语句法成分特有的套叠现象[M]..陆俭明自选集.郑州:河南教育出版社,1993.174-192.

共引文献54

同被引文献62

引证文献7

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部