期刊文献+

How Do Pronouns Affect Word Embedding

How Do Pronouns Affect Word Embedding
原文传递
导出
摘要 Word embedding has drawn a lot of attention due to its usefulness in many NLP tasks. So far a handful of neural-network based word embedding algorithms have been proposed without considering the effects of pronouns in the training corpus. In this paper, we propose using co-reference resolution to improve the word embedding by extracting better context. We evaluate four word embeddings with considerations of co-reference resolution and compare the quality of word embedding on the task of word analogy and word similarity on multiple data sets.Experiments show that by using co-reference resolution, the word embedding performance in the word analogy task can be improved by around 1.88%. We find that the words that are names of countries are affected the most,which is as expected. Word embedding has drawn a lot of attention due to its usefulness in many NLP tasks. So far a handful of neural-network based word embedding algorithms have been proposed without considering the effects of pronouns in the training corpus. In this paper, we propose using co-reference resolution to improve the word embedding by extracting better context. We evaluate four word embeddings with considerations of co-reference resolution and compare the quality of word embedding on the task of word analogy and word similarity on multiple data sets.Experiments show that by using co-reference resolution, the word embedding performance in the word analogy task can be improved by around 1.88%. We find that the words that are names of countries are affected the most,which is as expected.
出处 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2017年第6期586-594,共9页 清华大学学报(自然科学版(英文版)
基金 supported by the National HighTech Research and Development(863)Program(No.2015AA015401) the National Natural Science Foundation of China(Nos.61533018 and 61402220) the State Scholarship Fund of CSC(No.201608430240) the Philosophy and Social Science Foundation of Hunan Province(No.16YBA323) the Scientific Research Fund of Hunan Provincial Education Department(Nos.16C1378 and 14B153)
关键词 word embedding co-reference resolution representation learning word embedding co-reference resolution representation learning
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部