• Jul 03, 2017 News!Good News! Since 2017, IJMLC has been indexed by Scopus!
  • Jul 06, 2017 News!Vol.7, No.2 has been published with online version.   [Click]
  • Jul 01, 2017 News!Vol.7, No.1 has been published with online version.   [Click]
Search
General Information
Editor-in-chief
Dr. Lin Huang
Metropolitan State University of Denver, USA
It's my honor to take on the position of editor in chief of IJMLC. We encourage authors to submit papers concerning any branch of machine learning and computing.
IJMLC 2011 Vol.1(4): 337-343 ISSN: 2010-3700
DOI: 10.7763/IJMLC.2011.V1.50

A Machine Learning Based Tool for Source Code Plagiarism Detection

Upul Bandara and Gamini Wijayarathna

Abstract—Source code plagiarism is a severe problem in academia. In academia programming assignments are used to evaluate students in programming courses. Therefore checking programming assignments for plagiarism is essential. If a course consists of a large number of students, it is impractical to check each assignment by a human inspector. Therefore it is essential to have automated tools in order to assist detection of plagiarism in programming assignments. Majority of the current source code plagiarism detection tools are based on structured methods. Structural properties of a plagiarized program and the original program differ significantly. Therefore it is hard to detect plagiarized programs when plagiarism level is 4 or above by using tools which are based on structural methods. This paper presents a new plagiarism detection method, which is based on machine learning techniques. We have trained and tested three machine learning algorithms for detecting source code plagiarism. Furthermore, we have utilized a meta-learning algorithm in order to improve the accuracy of our system.

Index Terms—k-nearest neighbor, machine learning, naïve bayes classifier, plagiarism detection, source code

U Bandara is with the Virtusa Corporation, Sri Lanka (e-mail: upulbandara@ gmail.com).
G. Wijayarathna is with the Faculty of Science, University of Kelaniya, Sri Lanka(e-mailgamini@kln.ac.lk).

[PDF]

Cite: Upul Bandara and Gamini Wijayarathna, "A Machine Learning Based Tool for Source Code Plagiarism Detection," International Journal of Machine Learning and Computing vol. 1, no. 4, pp. 337-343 , 2011.

Copyright © 2008-2015. International Journal of Machine Learning and Computing. All rights reserved.
E-mail: ijmlc@ejournal.net