摘要
本文介绍了文档剽窃检测算法——Winnowing算法。该算法利用划分字符串,哈希每个字符串的值,最后通过一定的选取策略选出某些哈希值作为文档的指纹,通过对指纹的比较判断不同文本间的相似度,并判断文档之间是否存在剽窃现象。
This paper introduces an algorithm of document plagiarism detection - - winnowing algorithm. The algorithm uses the partition of strings, hash each string value and select some hash value as the document fingerprint by certain stratety . By judging the fingerprint to judge the similarity among different versions, and to determine whether there is plagiarism between documents.
出处
《安徽科技学院学报》
2013年第4期42-45,共4页
Journal of Anhui Science and Technology University
基金
安徽科技学院科研项目(ZRC2011273)