期刊文献+

融合多语义特征的精读式文档级事件抽取

Intensive reading document-level event extraction based on multi-feature semantics
下载PDF
导出
摘要 为解决文档级事件抽取任务依赖实体识别、忽略先验语义和参数分散的问题,提出一种融合多语义特征的精读式抽取方法。结合“三阶段”阅读特点,根据事件与角色交互、角色类型及释义特征构建外部语义模板,提出窗口切分算法切割文档语义;基于预训练模型BERT融合外部与窗口语义;多轮精读文档避免实体依赖,设计记忆网络对精读结果建模,完成跨句定位参数和事件路径扩展。引入噪声扰动防止模型过拟合。实验结果表明,该模型性能优于当前主流方法,验证了其可行性和有效性。 To solve the problem that document-level event extraction tasks rely on entity recognition,ignore prior semantics and parameters dispersion,an intensive reading extraction method integrating multiple semantic features was proposed.Combined with the characteristics of three stages reading,the external semantic template was constructed according to the interaction between events and roles,role types and interpretation features,and the window segmentation algorithm was proposed to cut document semantics.The external semantics and window semantics were integrated based on the pre-training model BERT.Multi-round intensive reading documents avoided entity dependence.A memory network was designed to model the intensive reading results,and cross-sentence positioning parameters and event paths expansion were completed.The noise disturbance was introduced to prevent overfitting of pretrained language models(PLMs).Experimental results show that the performance of the model is superior to the current mainstream methods,and its feasibility and effectiveness is verified.
作者 赵梦瑶 刘大明 ZHAO Meng-yao;LIU Da-ming(College of Computer Science and Technology,Shanghai University of Electric Power,Shanghai 201306,China)
出处 《计算机工程与设计》 北大核心 2024年第6期1903-1909,共7页 Computer Engineering and Design
基金 甘肃省自然科学基金项目(SKLLDJ032016021)。
关键词 实体依赖 参数分散 语义特征融合 窗口切分算法 预训练模型 多轮精读 记忆网络 噪声扰动 entity dependency parameter dispersion semantic feature fusion window segmentation algorithm pre-training model multi-round intensive reading memory network noise disturbance
  • 相关文献

参考文献3

二级参考文献3

共引文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部