摘要
通过分析传统的基于向量空间模型(VSM)文本相似度计算算法存在的不足,提出一种改进的文本相似度计算算法。改进算法充分考虑到了文本间相同特征词对文本相似度的影响,有效减少了相似度低的文本干扰。仿真实验和系统运行结果验证了改进算法的有效性和准确性。
Aiming at the shortcoming of traditional VSM-based text similarity algorithm,an improved algorithm of text similarity is proposed in this paper.It fully takes into account the effect of same feature words between texts on the similarity of text,therefore effectively reduces the interference of the texts with lower similarity.Simulative experiment and system running results have attested the new algorithm in its effectiveness and accuracy.
出处
《计算机应用与软件》
CSCD
北大核心
2012年第2期282-284,共3页
Computer Applications and Software