摘要
提出了一种综合多特征的句子相似度计算方法,该方法分别从句子的句法、词汇语义、词形三个方面来度量句子的相似度,最后将这三个方面加权整合计算得到句子的相似度。本方法综合考虑了句子的深层和表层信息,并对句子进行了词汇扩展,从而使句子相似度计算更加准确。
A method for sentence similarity computation by integrating multi-features was proposed.According to the syntax feature,semantic feature and word feature of the sentences,the similarity was measured,respectively.Then,this paper combined the sentence similarity by endowing the above three features with different weights.Comparatively,this kind of estimation of sentence similarity is more accurate than the previous because both the deep and surface information of the sentences were taken into accounted,and the vocabulary of sentences was also extended in the process of calculation.
出处
《计算机系统应用》
2010年第11期110-114,共5页
Computer Systems & Applications
关键词
句子相似度计算
多特征
树核
权值
sentence similarity computation
multi-features
tree kernel
weight