期刊文献+

Winnowing算法在作业剽窃检测中的应用 被引量:1

Application of the Winnowing Algorithm in Assignment Plagiarism Detection
下载PDF
导出
摘要 本文介绍了文档剽窃检测算法——Winnowing算法。该算法利用划分字符串,哈希每个字符串的值,最后通过一定的选取策略选出某些哈希值作为文档的指纹,通过对指纹的比较判断不同文本间的相似度,并判断文档之间是否存在剽窃现象。 This paper introduces an algorithm of document plagiarism detection - - winnowing algorithm. The algorithm uses the partition of strings, hash each string value and select some hash value as the document fingerprint by certain stratety . By judging the fingerprint to judge the similarity among different versions, and to determine whether there is plagiarism between documents.
作者 李香云 葛华
出处 《安徽科技学院学报》 2013年第4期42-45,共4页 Journal of Anhui Science and Technology University
基金 安徽科技学院科研项目(ZRC2011273)
关键词 剽窃检测 Winnowing算法 文档指纹 Plagiarism detection Winnowing algorithm Document fingerprint
  • 相关文献

参考文献5

二级参考文献20

  • 1程金宏,刘东升.程序代码相似度自动度量技术研究综述[J].内蒙古师范大学学报(自然科学汉文版),2006,35(4):457-461. 被引量:13
  • 2Karp R M,Rabin M O. Efficient Randomized Pattern-Matching Algorithms[J]. IBM Journal of Research and Development, 1987:115-126.
  • 3Schleimer S, Wilkerson D S, Aiken A. Winnowing: Local Algorithms for Document Fingerprinting [C]//Proc of the 2003 ACM SIGMOD Int'l Conf on Management of Data, 2003 : 76-85.
  • 4Broder A. On the Resemblance and Containment of Documents[C]//Proe of SEQS:Sequences' 91, 1998.
  • 5Cosma G,Mike J.Source-code plagiarism:A UK academic perspective[R].The University of Warwick,2006.
  • 6Mander U.Finding similar files in large file system[C]//The Proceedings of the USENIX Winter 1994 Technical Conference,1994:1-10.
  • 7Brin S,Davis J,Garcia-Molina H.Copy detection mechanisms for digital documents[C]//the Proceedings of the ACM SIGMOD Annual Conference,1995:398-409.
  • 8Finkel R A,Zaslavsky A.Signature extraction for overlap detection in documents[C]//The Twenty-fifth Australasian Computer Science Conference,2002:59-64.
  • 9Saul S,Daniel S W,Aiken A.Winnowing:Local algorithms for document fingerprinting[C]//ACM SIGMOD 2003,2003:204-212.
  • 10Clough P. Plagiarism in natural and programming languages: an overview of current tools and technologies[R].Internal Report CS-00-05, University of Sheffield, 2000.

共引文献94

同被引文献10

  • 1SHEARD J, DICK M, MARKHAM S, et al. Cheating and Plagiarism: Perceptions and Practices of First Year IT Students [C]//Proccedings of the 7th Annum SIGCSE Conference on Innovation and Technology in ComputerScience Education. New York: ACM Press, 2002: 183-157.
  • 2GEORGINA C, MIKE J. Source-Code Plagiarism: AUK Academic Perspective [ R]. Warks: Department of Computer Scienee, University of Warwick, 2006.
  • 3YAMAMOTO T, MATSUSHITA M. Measuring Similarity of Large Software Systems Based on Source Code Correspondence [ D]. Osaka: Division of Software Science, Graduate School of Engineering Science, Osaka University, 2002: 4-5.
  • 4MICHAEL J WISE. String Similarity via Greedy String Tiling and Running Karp-Rabin Matching [ D]. Sydney: Department of Computer Science, University of Sydney, 1993.
  • 5MICHAEL J WISE. Neweyes: A System for Comparing Biological Sequences Using the Running Karp-Rabin Greedy String Tiling Algorithm [ C ]//Third International Conference on Intelligent Systems for Molecular Biology. Cambridge, England: [s. n. ] , 2006: 393-401.
  • 6AIKEN A MOSS: A System for Detecting Software Plagiarism [ EB/OL ]. (2009-02-01). [ 2012-10-08 ]. http :// theory. stanford, edu/: aiken/moss/.
  • 7SAUL SCHLEIMER, DANIEL S WILKERSON, ALEX AIKEN. Winnowing: Local Algorithms for Documemt Fingerprinting [ C] ]JACM SIGMOD 2003. San Diego: ACM Press, 2003: 204-212.
  • 8赵长海,晏海华,金茂忠.基于编译优化和反汇编的程序相似性检测方法[J].北京航空航天大学学报,2008,34(6):711-715. 被引量:28
  • 9熊浩,晏海华,郭涛,黄永刚,郝永乐,李舟军.代码相似性检测技术:研究综述[J].计算机科学,2010,37(8):9-14. 被引量:23
  • 10刘云龙.基于Token的结构化匹配同源性代码检测技术研究[J].计算机应用研究,2014,31(6):1841-1845. 被引量:6

引证文献1

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部