Source code plagiarism is a common occurrence in undergraduate computer science education. In order to identify such cases, many source code plagiarism detection tools have been proposed. A source code plagiarism detection tool evaluates pairs of assignment submissions to detect indications of plagiarism. However, a plagiarising student will commonly apply plagiarism-hiding modifications to source code in an attempt to evade detection. Subsequently, prior work has implied that currently available source code plagiarism detection tools are not robust to the application of pervasive plagiarism-hiding modifications. In this article, 11 source code plagiarism detection tools are evaluated for robustness against plagiarism-hiding modifications. The tools are evaluated with data sets of simulated undergraduate plagiarism, constructed with source code modifications representative of undergraduate students. The results of the performed evaluations indicate that currently available source code plagiarism detection tools are not robust against modifications which apply fine-grained transformations to the source code structure. Of the evaluated tools, JPlag and Plaggie demonstrates the greatest robustness to different types of plagiarism-hiding modifications. However, the results also indicate that graph-based tools (specifically those that compare programs as program dependence graphs) show potentially greater robustness to pervasive plagiarism-hiding modifications.
翻译:在本科本科计算机科学教育中常见的就是源代码图案。为了识别此类情况,提出了许多源代码图案图案检测工具。源代码图案检测工具评估了成对任务提交书的稳健性,以发现有污损性迹象。然而,一个受挫学生通常会对源代码进行有污损的修改,以图逃避检测。随后,先前的工作表明,现有源代码图案检测工具对应用普遍存在的损害性图案的修改并不健全。在本篇文章中,对11个源代码图案检测工具进行了评估,以稳健性防止受挫性图案影响的变化。这些工具是用模拟本科生模型图案模型图案数据集进行的,该模型图案将代表本科生学生的源代码修改。所完成的评估结果显示,目前可用的源代码图案检测工具对源代码图案结构进行精确的修改并不健全。在经过评估的工具中,JBlagi和Plaggie图案显示了最稳健的精确性,这些图案图案的对比性也显示了更稳健的图状图型模型。