skip to main content

Perbandingan Hasil Deteksi Plagiarisme Dokumen dengan Metode Jaro-Winkler Distance dan Metode Latent Semantic Analysis

Comparison of Document Plagiarism Detection Results by Jaro-Winkler Distance and Latent Semantic Analysis Methods

1Department of Informatics Management, Akademi Manajemen dan Informatika MDP Palembang, Indonesia

2Department of Information System, STMIK Global Informatika MDP Palembang, Indonesia

Received: 27 Nov 2017; Published: 31 Jan 2018.
Open Access Copyright (c) 2018 Jurnal Teknologi dan Sistem Komputer under http://creativecommons.org/licenses/by-sa/4.0.

Citation Format:
Abstract

Various methods are applied in the application of plagiarism detection to help check the similarity of a document. Jaro-Winkler Distance can measure the distance between two strings. However, this method basically depends on the position of the word. Latent Semantic Analysis emphasizes the words contained in the document regardless of its linguistic character. This study compares the results of plagiarism detection using the Jaro-Winkler Distance and the Latent Semantic Analysis method. From comparing results of  Jaro-Winkler Distance method and Latent Semantic Analysis method, Jaro-Winkler Distance method is better than Latent Semantic Analysis method if using the same test data. Jaro-Winkler Distance method will give plagiarism result 100% and Latent Semantic Analysis method will give plagiarism result 97,14%.

Keywords: Plagiarism; undergraduated thesis; Jaro-Winkler distance; Latent Semantic Analysis

Article Metrics:

  1. S. Sastroasmoro, "Beberapa Catatan tentang Plagiarisme," Majalah Kedokteran Indonesia, vol. 56, no. 1, pp. 1-6, 2006
  2. Z. Zulkarnain, “Plagiarisme Dalam Menghasilkan Karya Tulis Ilmiah,” April 2013. [Online]. Available: http://www.unja.ac.id/2013/04/10/prof-dr-ir-h-zulkarnain-mhortsc/. [Diakses: Nov, 15, 2017]
  3. A. Kornain, F. Yansen, and T. Tinaliah, “Penerapan Algoritma Jaro-Winkler Distance Untuk Sistem Deteksi Plagiarisme pada Dokumen Teks Berbahasa Indonesia,” Skripsi, STMIK MDP, Oktober 2014
  4. A. Kurniawati, S. Puspitodjati, and S. Rahman, "Implementasi Algoritma Jaro-Winkler Distance untuk Membandingkan Kesamaan Dokumen Berbahasa Indonesia", Skripsi Program Studi Sistem Informasi, Universitas Gunadarma, 2010
  5. Y. Faranika, H. Kurniawan, and N. Nikentari, "Sistem Pengukur Kemiripan Dokumen Menggunakan Algoritma Jaro-Winkler Distance," Skripsi, Universitas Maritim Raja Ali Haji, 2014
  6. D. A. Perkasa, E. Saputra, and M. Fronita, “Sistem Ujian Online Essay dengan Penilaian Menggunakan Metode Latent Semantic Analysis (LSA), “ Jurnal Rekayasa dan Manajemen Sistem Informasi, vol. 1, no. 1, Februari 2015, pp. 1-9
  7. D. W. Wicaksono, M. I. Irawan, and A. M. Rukmi, “Sistem Deteksi Kemiripan Antar Dokumen Teks Menggunakan Model Bayesian pada Term Latent Semantic Analysis (LSA),” Jurnal Sains dan Seni POMITS, vol. 3, no. 2, 2014, pp.41-46
  8. N. Khairunnisa, D. S. Sihabudin, and A. Wibowo, “Aplikasi Pendeteksi Plagiat dengan Menggunakan Metode Latent Semantic Analysis (Studi Kasus : Laporan TA PCR),“ Jurnal Aksara Komputer Terapan, vol. 1, no. 2, 2012
  9. T. Tudesman, E. Oktalina, T. Tinaliah, Y. Yoannita, “Sistem Deteksi Plagiarisme Dokumen Bahasa Indonesia Menggunakan Metode Vector Space Model,” Skripsi, STMIK MDP, Oktober 2014
  10. S. Soleman, and A. Purwirianti, “Experiments on the Indonesian Plagiarism Detection using Latent Semantic Analysis,” in 2014 2nd International Conference on Information and Communication Technology (IcoICT), 28-30 May 2014, Bandung, Indonesia
  11. T. Tinaliah, “Ringkasan Multi-Dokumen Berbahasa Indonesia Secara Otomatis Menggunakan Metode Latent Semantic Analysis dan Centroid-Based Summarization,” Tesis, Universitas Indonesia, 2013
  12. B. Leonardo, and S. Hansun, "Text Documents Plagiarism Detection using Rabin-Karp and Jaro-Winkler Distance Algorithms," Indonesian Journal of Electrical Engineering and Computer Science, vol. 5, no. 2, pp. 462-471, 2017
  13. W. E. Winkler, "String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage," in Proceedings of the Section on Survey Methods, American Statistical Association, 1990, pp. 354-359
  14. S. T. Dumais, "Latent Semantic Analysis," Annual Review of Information Science and Technology, vol. 38, no. 1, pp. 188-230, 2004

Last update:

  1. Comparison between Jaro-Winkler Distance Algorithm and Winnowing Algorithm in detecting word similarities in Indonesian documents

    Inte Christinawati Bu'ulolo, Melani Isabella Siregar, Clara Fellysa Simanjuntak. THE 2ND INTERNATIONAL CONFERENCE OF SCIENCE AND INFORMATION TECHNOLOGY IN SMART ADMINISTRATION (ICSINTESA 2021), 2658 , 2022. doi: 10.1063/5.0111181
  2. Comparison of document similarity measurements in scientific writing using Jaro-Winkler Distance method and Paragraph Vector method

    S C Cahyono. IOP Conference Series: Materials Science and Engineering, 662 (5), 2019. doi: 10.1088/1757-899X/662/5/052016
  3. The Hybrid of Jaro-Winkler and Rabin-Karp Algorithm in Detecting Indonesian Text Similarity

    Muhamad Arief Yulianto, Nurhasanah Nurhasanah. Jurnal Online Informatika, 6 (1), 2021. doi: 10.15575/join.v6i1.640

Last update: 2024-11-22 05:40:17

No citation recorded.