Text Documents Plagiarism Detection using Rabin-Karp and Jaro-Winkler Distance Algorithms
Plagiarism is an act that is considered by the university as a fraud by taking someone ideas or writings without mentioning the references and claimed as his own. Plagiarism detection system is generally implement string matching algorithm in a text document to search for common words between docume...
Saved in:
Main Authors: | , |
---|---|
Format: | EJournal Article |
Published: |
Institute of Advanced Engineering and Science,
2017-02-01.
|
Subjects: | |
Online Access: | Get fulltext |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
LEADER | 02320 am a22003013u 4500 | ||
---|---|---|---|
001 | ijeecs6097_6128 | ||
042 | |a dc | ||
100 | 1 | 0 | |a Leonardo, Brinardi |e author |
100 | 1 | 0 | |e contributor |
700 | 1 | 0 | |a Hansun, Seng |e author |
245 | 0 | 0 | |a Text Documents Plagiarism Detection using Rabin-Karp and Jaro-Winkler Distance Algorithms |
260 | |b Institute of Advanced Engineering and Science, |c 2017-02-01. | ||
500 | |a https://ijeecs.iaescore.com/index.php/IJEECS/article/view/6097 | ||
520 | |a Plagiarism is an act that is considered by the university as a fraud by taking someone ideas or writings without mentioning the references and claimed as his own. Plagiarism detection system is generally implement string matching algorithm in a text document to search for common words between documents. There are some algorithms used for string matching, two of them are Rabin-Karp and Jaro-Winkler Distance algorithms. Rabin-Karp algorithm is one of compatible algorithms to solve the problem of multiple string patterns, while, Jaro-Winkler Distance algorithm has advantages in terms of time. A plagiarism detection application is developed and tested on different types of documents, i.e. doc, docx, pdf and txt. From the experimental results, we obtained that both of these algorithms can be used to perform plagiarism detection of those documents, but in terms of their effectiveness, Rabin-Karp algorithm is much more effective and faster in the process of detecting the document with the size more than 1000 KB. | ||
540 | |a Copyright (c) 2017 Indonesian Journal of Electrical Engineering and Computer Science | ||
540 | |a http://creativecommons.org/licenses/by-nc-nd/4.0 | ||
546 | |a eng | ||
690 | |a technology; computer science | ||
690 | |a Jaro-Winkler distance; Plagiarism; Rabin-Karp; String matching | ||
655 | 7 | |a info:eu-repo/semantics/article |2 local | |
655 | 7 | |a info:eu-repo/semantics/publishedVersion |2 local | |
655 | 7 | |2 local | |
786 | 0 | |n Indonesian Journal of Electrical Engineering and Computer Science; Vol 5, No 2: February 2017; 462-471 | |
786 | 0 | |n 2502-4760 | |
786 | 0 | |n 2502-4752 | |
786 | 0 | |n 10.11591/ijeecs.v5.i2 | |
787 | 0 | |n https://ijeecs.iaescore.com/index.php/IJEECS/article/view/6097/6128 | |
856 | 4 | 1 | |u https://ijeecs.iaescore.com/index.php/IJEECS/article/view/6097/6128 |z Get fulltext |