期刊文献+

科技风险事件库构建及事件发现关键技术研究 被引量:2

Key Technologies of Event Base Construction and Event Detection in the Field of Science and Technology
下载PDF
导出
摘要 本文提出一种科技风险事件资源库和事件发现模型的构建方法,通过分析网络新闻源数据的文本特征,利用爬取的科技媒体新闻构建元事件资源库和主题事件再生资源库模型,并提出综合评价模型进行事件发现。针对事件发现,本文提出了一种two-branch Transformer的科技领域风险事件语言模型,从风险事件中提取与风险度相关的词汇特征,并弱化文本自身的领域特征等对风险事件分类任务造成的干扰,以此来发现风险事件。研究结果验证了本文所提出的风险事件发现模型及对元事件风险倾向进行判断的指标的有效性。本文能够为科技领域风险事件资源库的构建提供参考,提出的科技领域风险事件语言模型能够为风险发现研究提供方法和技术上的参考。 This study proposes a method to construct a risk event base and event detection model in the field of science and technology.By analyzing the text features of online news source data,the meta-event resource database and themeevent regenerated resource database models are constructed using crawled news in the field of science and technology,and a comprehensive evaluation model is proposed for event detection.For event detection,this study proposes a two-branch transformer model for risk events,which extracts lexical features related to risk degree from risk events and reduces the interference caused by text domain features to risk event classification,in order to identify risk events.The experimental results show that the proposed risk event detection model and the index for judging the risk propensity of meta-events are effective.This study can provide a reference for risk event base construction in the field of science and technology,and the proposed language model can provide a methodological and technical reference for the study of risk event detection.
作者 刘耀 房小玮 秦迅 Liu Yao;Fang Xiaowei;Qin Xun(Institute of Scientific and Technical Information of China,Beijing 100038;School of Software&Microelectronics,Peking University,Beijing 100091)
出处 《情报学报》 CSSCI CSCD 北大核心 2022年第11期1188-1198,共11页 Journal of the China Society for Scientific and Technical Information
基金 国家社会科学基金项目“数字资源知识共享与知识再利用模式与方法研究”(21BTQ011)。
关键词 风险事件 元事件抽取 事件发现 事件库 risk event meta-event extraction event detection event base
  • 相关文献

参考文献8

二级参考文献64

  • 1徐燕,李锦涛,王斌,孙春明,张森.不均衡数据集上文本分类的特征选择研究[J].计算机研究与发展,2007,44(z2):58-62. 被引量:20
  • 2吴平博,陈群秀,马亮.基于时空分析的线索性事件的抽取与集成系统研究[J].中文信息学报,2006,20(1):21-28. 被引量:21
  • 3梁晗,陈群秀,吴平博.基于事件框架的信息抽取系统[J].中文信息学报,2006,20(2):40-46. 被引量:38
  • 4尚文倩,黄厚宽,刘玉玲,林永民,瞿有利,董红斌.文本分类中基于基尼指数的特征选择算法研究[J].计算机研究与发展,2006,43(10):1688-1694. 被引量:38
  • 5Zheng Zhao-hui, Wu Xiao-yun, Rohini S.Feature selection for text categorization on imbalanced documents[J].SIGKDD Explorations Newsletters,2004,6(1) :80-89.
  • 6Forman G.An extensive empirical study of feature selection metrics for text classification[J].Joumal of Machine Learning Research, 2003,3 ( 1 ) : 1289-1305.
  • 7Calvo B, Larrariaga p, Lozano J A.Feature subset selection from positive and unlabelled examples[J].Pattern Recognition Letters, 2009,30 : 1097-1036.
  • 8Topic detection and tracking evaluation [ EB/OL ]. (2008-11-04). http ://www. hist. gov/speech/tests/tdt/.
  • 9ALLAN J,PAPKA R,LAVERENKO V.On-line new event detection and tracking[C]//Proc of the 21st Annual International ACM SIGIR Conference on Research and Development.New York:ACM Press,1998:37-45.
  • 10YANG Yi-ming,PIERCE T,CARBONELL J.A study on retrospective and on-line event detection[C]//Proc of the 21st Annual International ACM SIGIR Conference on Research and Development.New York:ACM Press,1998:28-36.

共引文献108

同被引文献23

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部