Text Documents Plagiarism Detection using Rabin-Karp and Jaro-Winkler Distance Algorithms

Plagiarism is an act that is considered by the university as a fraud by taking someone ideas or writings without mentioning the references and claimed as his own. Plagiarism detection system is generally implement string matching algorithm in a text document to search for common words between docume...

Full description

Saved in:
Bibliographic Details
Main Authors: Leonardo, Brinardi (Author), Hansun, Seng (Author)
Format: EJournal Article
Published: Institute of Advanced Engineering and Science, 2017-02-01.
Subjects:
Online Access:Get fulltext
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 02320 am a22003013u 4500
001 ijeecs6097_6128
042 |a dc 
100 1 0 |a Leonardo, Brinardi  |e author 
100 1 0 |e contributor 
700 1 0 |a Hansun, Seng  |e author 
245 0 0 |a Text Documents Plagiarism Detection using Rabin-Karp and Jaro-Winkler Distance Algorithms 
260 |b Institute of Advanced Engineering and Science,   |c 2017-02-01. 
500 |a https://ijeecs.iaescore.com/index.php/IJEECS/article/view/6097 
520 |a Plagiarism is an act that is considered by the university as a fraud by taking someone ideas or writings without mentioning the references and claimed as his own. Plagiarism detection system is generally implement string matching algorithm in a text document to search for common words between documents. There are some algorithms used for string matching, two of them are Rabin-Karp and Jaro-Winkler Distance algorithms. Rabin-Karp algorithm is one of compatible algorithms to solve the problem of multiple string patterns, while, Jaro-Winkler Distance algorithm has advantages in terms of time. A plagiarism detection application is developed and tested on different types of documents, i.e. doc, docx, pdf and txt. From the experimental results, we obtained that both of these algorithms can be used to perform plagiarism detection of those documents, but in terms of their effectiveness, Rabin-Karp algorithm is much more effective and faster in the process of detecting the document with the size more than 1000 KB. 
540 |a Copyright (c) 2017 Indonesian Journal of Electrical Engineering and Computer Science 
540 |a http://creativecommons.org/licenses/by-nc-nd/4.0 
546 |a eng 
690 |a technology; computer science 
690 |a Jaro-Winkler distance; Plagiarism; Rabin-Karp; String matching 
655 7 |a info:eu-repo/semantics/article  |2 local 
655 7 |a info:eu-repo/semantics/publishedVersion  |2 local 
655 7 |2 local 
786 0 |n Indonesian Journal of Electrical Engineering and Computer Science; Vol 5, No 2: February 2017; 462-471 
786 0 |n 2502-4760 
786 0 |n 2502-4752 
786 0 |n 10.11591/ijeecs.v5.i2 
787 0 |n https://ijeecs.iaescore.com/index.php/IJEECS/article/view/6097/6128 
856 4 1 |u https://ijeecs.iaescore.com/index.php/IJEECS/article/view/6097/6128  |z Get fulltext