通过检测语义分歧识别无答案问题(英文)

Unanswerable Questions Recognition by Semantic Discrepancy Detection

导出

摘要机器阅读理解中存在无法仅从给定文档中获取问题答案的特殊情况,为此,基于语义冲突检测的机器阅读理解网络(SCDNet)提出应通过检测问题与文档内容之间的语义分歧来识别这种情况.经分析发现,文档无法为问题提供答案的根本原因主要分为两类:一是文档中不包含问题所需的语义信息;二是二者包含的语义成分之间存在分歧.据此推断,可以通过检测文档语义信息是否全面涵盖问题所需的信息来识别问题是否可由文档信息给出回答.此外,通过在损失函数中加入答案文本长度惩罚项,网络优化目标函数更接近评测指标,系统性能得到提升.网络模型使用联合训练模型建模无答案的问题识别与答案抽取2个子任务,并使用端到端的方式训练.实验结果证明,其对无答案问题类别预测的正确率超过了性能先进的基线模型SAN2.0,在SQuAD2.0数据集上取得了72.43的F1值和76.96的无答案问题识别正确率. Machine reading comprehension(MRC)with unanswerable questions is challenging to the field of natural language processing research.Unlike previous work which ignores the mechanism of answerable and unanswerable,the semantic conflicts detection-based MRC network(SCDNet)was proposed aiming at detections of no-answer(NA)questions through semantic conflicts detection network.The basic idea is that if the given question is unanswerable,there exists semantic absence or conflicts between the question and the reference passages.Therefore,SCDNet predicts the NA probability by checking whether the passage covers the integral semantics of the question.Besides,in order to extract the exact answer from the passage,SCDNet is applied an answer length penalty in the loss function,which helps the learning objective to be more consistent with the evaluation metrics.SCDNet packs the NA question predictor and the answer extractor in a joint model and is trained in an end-to-end manner.Experiments show that SCDNet performs better than some strong baseline models,and achieve an F1 score of 72.43 and 76.96 NA accuracy on SQuAD 2.0 dataset.

作者刘咏彬王小捷袁彩霞易炼 LIU Yong-bin;WANG Xiao-jie;YUAN Cai-xia;YI Lian(School of Telecommunication Engineering,Beijing University of Posts and Telecommunications,Beijing 100876,China;Alibaba(Beijing)Software Services Company Limited,Beijing 100022,China)

机构地区北京邮电大学计算机院阿里巴巴(北京)软件服务有限公司

出处《北京邮电大学学报》 EI CAS CSCD 北大核心 2019年第6期126-133,141,共9页 Journal of Beijing University of Posts and Telecommunications

基金中央高校基本科研业务费专项资金项目(500419302).

关键词机器阅读理解问答系统无答案的问题 machine reading comprehension question answering unanswerable question

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1李建新,李晓艳.建设现代农机维修服务中心方案及设想[J].农机使用与维修,2020,0(3):24-24. 被引量：1
2张明怀.新型全自动拆除工作面用液压支架装车平台的技术应用[J].科学技术创新,2020(4):148-149. 被引量：1
3张福学,李晓燕.现代农机维修支撑体系的研究与探讨[J].农机使用与维修,2020,0(3):13-14. 被引量：3
4何明明,彭建文.一种求解单调包含问题的惯性混合邻近外梯度算法[J].数学杂志,2019,39(6):931-945.
5李彬彬,李晓婵,陈佳伟.酒店实习生项目化实习(PBI)培养模式构建研究[J].中国职业技术教育,2020,36(2):90-96. 被引量：4
6杨琳琳.基于归零模式财务问题识别与治理机制[J].新会计,2020(3):50-53.
7高郭池,全敬泽,李保良,赵生,张慧中.Y12F飞机局方审定飞行试验研究[J].飞行力学,2020,38(1):84-89. 被引量：3
8黄佳佳,李鹏伟,彭敏,谢倩倩,徐超.基于深度学习的主题模型研究[J].计算机学报,2020,43(5):827-855. 被引量：46
9李睿凡,梁昊雨,冯方向,张光卫,王小捷.全卷积神经结构的段落式图像描述算法[J].北京邮电大学学报,2019,42(6):155-161. 被引量：2
10鲍琨.核电企业内容管理平台建设探讨[J].中文科技期刊数据库（全文版）图书情报,2020(3):42-44.

北京邮电大学学报

2019年第6期

浏览历史

内容加载中请稍等...

通过检测语义分歧识别无答案问题(英文)

相关作者

相关机构

相关主题

浏览历史