基于Fin-BERT的中文金融领域事件抽取方法

Fin-BERT-Based Event Extraction Method for Chinese Financial Domain

下载PDF

导出

摘要事件抽取旨在从海量的非结构化的事件相关文本中抽取出人类感兴趣的内容,目前现有的事件抽取方法大多数基于通用语料,很少考虑到领域内的先验知识,并且现有的方法大多数不能很好地处理同一文档包含多个事件的情况,面对存在较多负面样例的测试也表现不佳。针对上述问题提出了一种基于Fin-BERT(financial bidirectional encoder representation from Transformers)和PTPCG(pseudo-trigger-aware pruned complete graph)的模型FinPTPCG,该方法充分利用Fin-BERT预训练模型的表达能力,在编码阶段融入领域内的先验知识,并且在事件检测模块采用多个二元分类器叠加的方式,保证模型可以有效识别一篇文档内存在多事件的情况并筛除掉负面样例,抽取实体之后将实体连接成完全图并通过计算相似度矩阵进行剪枝,通过选择伪触发器解决无标注触发词的问题,最后接入事件分类器实现事件抽取。该方法在ChFinAnn和Duee-fin数据集上事件抽取任务的F1值相比于基线方法分别取得了0.7个百分点和3.7个百分点的提升。 Event extraction aims to extract human-interest information from massive amounts of unstructured text.Currently,most existing event extraction methods are based on general corpora and rarely consider domain-specific prior knowledge.Moreover,most methods cannot handle well the case where multiple events exist in the same document,and they perform poorly when faced with a large number of negative examples.To address these issues,this paper proposes a model called Fin-PTPCG based on Fin-BERT(financial bidirectional encoder representation from Transformers)and PTPCG(pseudo-trigger-aware pruned complete graph).This method fully utilizes the expression ability of the Fin-BERT pre-training model and incorporates domain-specific prior knowledge during the encoding stage.In the event detection module,multiple binary classifiers are stacked to ensure that the model can effectively identify the situation of multiple events in a document and screen out negative examples.Combined with the decoding module of the PTPCG model,entities are extracted and connected into a complete graph and pruned by calculating a similarity matrix.The problem of unlabeled triggers is solved by selecting pseudo-triggers.Finally,the event extraction is achieved by the event classifier.This method achieves a 0.7 and 3.7 percentage points improvement in F1 score compared to the baselines on the ChFinAnn and Duee-fin datasets for the event extraction task.

作者李熠耿朝阳杨丹 LI Yi;GENG Chaoyang;YANG Dan(School of Computer Science and Engineering,Xi’an Technological University,Xi’an 710021,China)

机构地区西安工业大学计算机科学与工程学院

出处《计算机工程与应用》 CSCD 北大核心 2024年第14期123-132,共10页 Computer Engineering and Applications

关键词事件抽取事件检测信息抽取自然语言处理 event extraction event detection information extraction natural language processing

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1郭喜跃,何婷婷.信息抽取研究综述[J].计算机科学,2015,42(2):14-17. 被引量：84
2胡瑞娟,周会娟,刘海砚,李健.基于深度学习的篇章级事件抽取研究综述[J].计算机工程与应用,2022,58(24):47-60. 被引量：4
3陈星月,倪丽萍,倪志伟.基于ELECTRA模型与词性特征的金融事件抽取方法研究[J].数据分析与知识发现,2021,5(7):36-47. 被引量：7
4万齐智,万常选,胡蓉,刘德喜.基于句法语义依存分析的中文金融事件抽取[J].计算机学报,2021,44(3):508-530. 被引量：26

二级参考文献41

1李妮,关焕梅,杨飘,董文永.基于BERT-IDCNN-CRF的中文命名实体识别方法[J].山东大学学报（理学版）,2020,55(1):102-109. 被引量：54
2张晓艳,王挺,陈火旺.命名实体识别研究[J].计算机科学,2005,32(4):44-48. 被引量：66
3俞鸿魁,张华平,刘群,吕学强,施水才.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-94. 被引量：157
4钱世凤.省略界定综述[J].语文学刊（高等教育版）,2007(1):119-122. 被引量：3
5Wikipedia:Message Understanding Conference[EB/OL].2013-12-27.http://en.wikipedia.org/wiki/Message_Understanding_Conference.
6Wikipedia:Named Entity Recognition[EB/OL].2013-12-28.http://en.wikipedia.org/wiki/Named_Entity_Recognition.
7Rizzo G,Troncy R.NERD:Evaluating Named Entity Recognition Toolsinthe Web of Data[J].Lecture Notesin Computer Science,2012(7295):39-55.
8Rizzo G,Troncy R.NERD:A Framework for Unifying Named Entity Recognition and Disam biguation Extraction Tools[C]∥13th Conference ofthe European Chapter of the Association for ComputationalL inguistics.2012:73-76.
9Li Chen-liang,Weng Jian-shu.TwiNER:Named Entity Recognition in Targeted Twitter Stream[C]∥SIGIR.2012:721-730.
10Liu Xiao-hua,Zhang Shao-dian,et al.Recognizing Named Entitiesin Tweets[C]∥ACL.2011:359-367.

共引文献113

1孔静静,于琦,李敬华,于彤,张竹绿,田野,祖雅琪.实体抽取综述及其在中医药领域的应用[J].世界科学技术-中医药现代化,2022,24(8):2957-2963. 被引量：4
2陈平,匡尧,陈婧.基于BERT-wwm-ext多特征文本表示的经济事件主体抽取方法研究[J].武汉电力职业技术学院学报,2020(2):45-50. 被引量：1
3张海瑜,陈庆龙,张斯静,张子怡,杨帆,李鑫星.基于语义知识图谱的农业知识智能检索方法[J].农业机械学报,2021,52(S01):156-163. 被引量：13
4王竹,谷松原.基于裁判文书争议焦点的民事案由逻辑图谱构建研究——以产品责任领域为例[J].民商法争鸣,2022(2):13-25.
5李春楠,王雷,孙媛媛,林鸿飞.基于BERT的盗窃罪法律文书命名实体识别方法[J].中文信息学报,2021,35(8):73-81. 被引量：19
6吴天昊,古丽拉·阿东别克.基于神经元块级别注意力机制的LSTM关系抽取[J].计算机应用研究,2020,37(S02):76-79. 被引量：6
7程乔,王映华,李冉,李友建.基于互联网+舆情数据发掘支撑网络优化新思路的研究[J].广西通信技术,2020(1):1-7.
8丁若尧.面向古汉语史料的信息抽取方法综述[J].中国科技纵横,2019,0(14):50-51. 被引量：1
9郭红转.基于信息增长模式的信息研究探讨[J].安徽工程大学学报,2015,30(5):86-90.
10王定桥,李卫华,杨春燕.从用户需求语句建立问题可拓模型的研究[J].智能系统学报,2015,10(6):865-871. 被引量：3

1吴叶辉,李汝嘉,季荣彪,李亚东,孙晓海,陈娇娇,杨建平.基于随机增强Swin-Tiny Transformer的玉米病害识别及应用[J].吉林大学学报（理学版）,2024,62(2):381-390.
2张惠鹃,黄钦阳,胡诗彦,杨青,张敬伟.完全图高阶关系驱动的链接预测[J].计算机研究与发展,2024,61(7):1825-1835. 被引量：1
3周军锋,王春花,杜明,陈子阳.SIHC:一种高效的时态图上k-core查询算法[J].计算机学报,2024,47(5):1045-1064.
4Sai Ji,Min Li,Mei Liang,Zhenning Zhang.Robust Correlation Clustering Problem with Locally Bounded Disagreements[J].Tsinghua Science and Technology,2024,29(1):66-75.
5Xiao-bing GUO,Si-nan HU,Yue-jian PENG.Ramsey Numbers of Trees Versus Multiple Copies of Books[J].Acta Mathematicae Applicatae Sinica,2024,40(3):600-612.
6Widad Elbakri,Maheyzah Md.Siraj,Bander Ali Saleh Al-rimy,Sultan Noman Qasem,Tawfik Al-Hadhrami.Adaptive Cloud Intrusion Detection System Based on Pruned Exact Linear Time Technique[J].Computers, Materials & Continua,2024,79(6):3725-3756.

计算机工程与应用

2024年第14期

浏览历史

内容加载中请稍等...

基于Fin-BERT的中文金融领域事件抽取方法

参考文献4

二级参考文献41

共引文献113

相关作者

相关机构

相关主题

浏览历史