期刊文献+

基于半监督学习的小语种机器翻译算法 被引量:8

Machine translation algorithm of low-resource languages based on semi-supervised learning
下载PDF
导出
摘要 近年来,基于神经网络的机器翻译取得了快速发展,然而由于它需要大规模的平行语料库,所以对于资源稀缺的小语种的翻译往往显得效果不佳.在分析编码-解码框架和注意力机制的基础上,基于对偶学习的思想,提出了一种面向小语种翻译的半监督神经网络模型.该模型利用较大的单语语料库与少量平行语料库来实现小语种翻译.实验结果表明,当平行语料资源不足以训练一个普通神经网络模型时,使用半监督网络模型能够取得较好的结果,但所采用的半监督学习模型对单语语料库的数量要求非常高,要达到一定数量级才能达到良好效果. Recent years,neural machine translation has achieved great development.However,its requirement for large-scale parallel corpora,translating low-resource languages fluently becomes a big challenge.This paper first briefly introduces the encoder-decoder framework and attention mechanism.Next,we propose a semi-supervised neural network model based on dual-learning,which can translate low-resource languages using some monolingual corpora and small parallel corpora.Finally,results show that semi-supervised neural machine translation can achieve reasonable results with parallel corpora which are insufficient to train a common neural model.However,the semi-supervised model requires a large number of monolingual corpora to achieve great performance.
作者 陆雯洁 谭儒昕 刘功申 孙环荣 LU Wenjie;TAN Ruxin;LIU Gongshen;SUN Huanrong(Shanghai Jiao Tong University,School of Electronic Information and Electrical Engineering,Shanghai 200240,China;Shanghai Jiao Tong University-Shanghai Songheng Information Content Analysis Joint Lab,Shanghai 200240,China)
出处 《厦门大学学报(自然科学版)》 CAS CSCD 北大核心 2019年第2期200-208,共9页 Journal of Xiamen University:Natural Science
基金 国家自然科学基金(61772337 61472248)
关键词 半监督学习 小语种 机器翻译 semi-supervised learning low-resource language machine translation
  • 相关文献

参考文献8

二级参考文献99

  • 1宋金兰.汉藏语形态变体的分化[J].民族语文,2002(1):29-33. 被引量:5
  • 2才藏太,华关加.班智达汉藏公文翻译系统中基于二分法的句法分析方法研究[J].中文信息学报,2005,19(6):7-12. 被引量:10
  • 3刘挺,李维刚,张宇,李生.复述技术研究综述[J].中文信息学报,2006,20(4):25-32. 被引量:13
  • 4徐波,史晓东,刘群,宗成庆,庞薇,陈振标,杨振东,魏玮,杜金华,陈毅东,刘洋,熊德意,侯宏旭,何中军.2005统计机器翻译研讨班研究报告[J].中文信息学报,2006,20(5):1-9. 被引量:10
  • 5何婷婷,徐超,李晶,赵君喆.基于种子自扩展的命名实体关系抽取方法[J].计算机工程,2006,32(21):183-184. 被引量:25
  • 6Peter F Brown, Stephen A Delia Pietra, Vincent J Della Pietra, et al. The mathematics of statistical ma- chine translation: parameter estimation[-J]. Computa- tional Linguistics. 1993, 19 (2): 263-311.
  • 7Philipp Koehn, Franz Josef Och, Daniel Mareu. Sta- tistical phrase-based translation [-C//Proeeedings of HLT-NAACL. Edmonton, Canada, 2003: 48-54.
  • 8Philipp Koehn, Hieu Hoang, Alexandra Birch, et al. Moses. open source toolkit for statistical machine translation I-C//Proceedings of ACL of demo and poster sessions. Prague, Czech Republic, 2007: 177- 180.
  • 9David Chiang. A hierarchical phrase-based model for statistical machine translation [- C]//Proceedings of ACL05. Ann Arbor, MI, 2005: 263-270.
  • 10Kenji Yamada, Kevin Knight. A syntax-based statisti- cal translation model [C]//Proceedings of ACL- EACL01. Toulouse, France, 2001:523 530.

共引文献128

同被引文献112

引证文献8

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部