期刊文献+

开放域事件触发词抽取技术研究 被引量:1

Extraction of Open-Domain Event Trigger Words
下载PDF
导出
摘要 开放域事件定义与传统事件定义不同,主要以任意领域的事件触发词为核心,并包括与其关联的时间、地点、人物、数量等多种元素构成的结构化数据,是不可预测的。在开放域触发词抽取中,提出了一种基于规则和二值分类相结合的混合模型方法(简称R-Two模型),规则方法需人工构建规则,具有抽取速度快、表征能力强的优点,但也存在规则不完备、过分依赖句法分析的缺点。二值分类法的训练过程虽然比较繁琐,但抽取的准确率高且受句法分析影响小,故将二者融合,并通过实验证明融合方法的有效性。 Different from traditional event definition,open-domain event definition takes event trigger words in any field as the core, including structural data of the time, place, character, quantity and so on, which are unpredictable.A hybrid model based on combination of rule and two-element classification(R-Two model) is proposed.Rules and methods need to be constructed by artificial rules, which have the advantages of high extraction-speed and strong representation ability. And however there are also some shortcomings including the not-complete rules, and over-reliance on syntactic analysis.Two-element classification method,although complex in training process, is high in extraction accuracy and small in impact by syntactic analysis.And thus based on fusion of the two and via experiments, the effectiveness of this fusion method is reliably verified.
作者 苏晓丹 周刚 陈海勇 丁宣宣 SU Xiao-dan ZHOU Gang CHEN Hai-yong DING Xuan-xuan(PLA Information Engineering University, Zhengzhou Henan 450001, China State Key Laboratory of Mathematical Engineering and Advanced Computing, Zhengzhou Henan 450001, China)
出处 《通信技术》 2017年第1期24-29,共6页 Communications Technology
关键词 开放域 触发词 规则 二值分类 open domain trigger rule two-element classification
  • 相关文献

参考文献3

二级参考文献40

  • 1张晓艳,王挺,陈火旺.命名实体识别研究[J].计算机科学,2005,32(4):44-48. 被引量:66
  • 2俞鸿魁,张华平,刘群,吕学强,施水才.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-94. 被引量:157
  • 3ACE Chinese Annotation Guidelines for Events[EB/OL]. (2005- 03-30). http://www.ldc.upenn.edu/Projects/ACE/docs/Chinese- Events-Guidelines_v5.5.1 .pdf.2005c.
  • 4Ahn D. The Stages of Event Extraction[C]//Proc. of Workshop onAnnotations and Reasoning About Time and Events. Sydney, Australia: [s. n.], 2006: 1-8.
  • 5Hardy H, Kanchakouskaya V, Strzalkowski T. Automatic Event Classification Using Surface Text Features[C]//Proc. of Workshop on Event Extraction and Synthesis. Boston, USA: [s. n.], 2006: 55-61.
  • 6Liao Shasha, Grishman R. Using Document Level Cross-event Inference to Improve Event Extraction[C]//Proc. of the 48th Annual Meeting of the Association for Computational Linguistics. Uppsala, Sweden: [s. n.], 2010: 789-797.
  • 7Naughton M, Kushmerick N, Carthy J. Event Extraction form Heterogeneous News Sources[C]//Proc. of Workshop on Event Extraction and Synthesis. Boston, USA: American Association for Artificial Intelligence. 2006: 7-13.
  • 8Ji Heng, Grishman R. Refining Event Extraction Through Cross-document Inference[C]//Proc. of Meeting of the Association for Computational Linguistics. Columbus, USA: [s. n.], 2008: 254-262.
  • 9Chen Zheng, Ji Heng. Language Specific Issue and Feature Exploration in Chinese Event Extraction[C]//Proe. of Annual Conference of the North American Chapter of the Association for Computational Linguistics. Boulder, USA: [s. n.], 2009:209-212.
  • 10Blei D M, Ng A Y, Jordan M I. Latent Dirichlet Allocation[J]. Journal of Machine Learning Research, 2003, 3(4-5): 993-1022.

共引文献122

同被引文献18

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部