期刊文献+

一种基于多字互信息与邻接熵的改进新词合成算法 被引量:5

An Improved New Word Synthesis Algorithm Based on Multi Word Mutual Information and Branch Entropy
下载PDF
导出
摘要 在微博中,新词的构词规则多样且复杂多变。针对基于词内部结合度与边界自由度的新词发现方法对新词内部结合度不高的问题,改进一种融合多字互信息与左右邻接熵的新词合成算法。利用多字互信息提高新词的内部结合度,最终达到提高新词识别精度的目的。实验结果表明,改进的方法能有效提高微博新词识别的性能。 In microblog, the word formation rules of new words are various and complex and changeable. Aiming at the problem that the new word discovery method does not have a high inner combination degree of new words, a new word synthesis algorithm that combines multiple word mutual information and branch entropy is improved. The inner combination degree of new words is improved through multi-word mutual information, which achieves the purpose of improving the accuracy of new word recognition. Experimental results show that the improved method can effectively improve the performance of microblog new word recognition.
作者 王欣 WANG Xin(College of Computer and Ilfformation Science, Chongqing Normal University, Chongqing 401331)
出处 《现代计算机(中旬刊)》 2018年第4期7-11,共5页 Modern Computer
关键词 多字互信息 邻接熵 新词合成算法 Multi Word Mutual Information Relative Branch Entropy New Word Synthesis Algorithm
  • 相关文献

参考文献13

二级参考文献65

共引文献186

同被引文献45

引证文献5

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部