Perbandingan Hasil Deteksi Plagiarisme Dokumen dengan Metode Jaro-Winkler Distance dan Metode Latent Semantic Analysis

Comparison of Document Plagiarism Detection Results by Jaro-Winkler Distance and Latent Semantic Analysis Methods

Tinaliah Tinaliah -  Department of Informatics Management, Akademi Manajemen dan Informatika MDP Palembang, Indonesia
Triana Elizabeth -  Department of Information System, STMIK Global Informatika MDP Palembang, Indonesia
Open Access Copyright (c) 2018 Jurnal Teknologi dan Sistem Komputer

Various methods are applied in the application of plagiarism detection to help check the similarity of a document. Jaro-Winkler Distance can measure the distance between two strings. However, this method basically depends on the position of the word. Latent Semantic Analysis emphasizes the words contained in the document regardless of its linguistic character. This study compares the results of plagiarism detection using the Jaro-Winkler Distance and the Latent Semantic Analysis method. From comparing results of  Jaro-Winkler Distance method and Latent Semantic Analysis method, Jaro-Winkler Distance method is better than Latent Semantic Analysis method if using the same test data. Jaro-Winkler Distance method will give plagiarism result 100% and Latent Semantic Analysis method will give plagiarism result 97,14%.

Keywords
Plagiarism; undergraduated thesis; Jaro-Winkler distance; Latent Semantic Analysis

How to cite:

Full Text:

Article Metrics:

Article Info
Submitted: 2017-11-27
Published: 2018-01-31
Section: Articles
Language: ID
Statistics: 293 157
  1. S. Sastroasmoro, "Beberapa Catatan tentang Plagiarisme," Majalah Kedokteran Indonesia, vol. 56, no. 1, pp. 1-6, 2006.
  2. Z. Zulkarnain, “Plagiarisme Dalam Menghasilkan Karya Tulis Ilmiah,” April 2013. [Online]. Available: http://www.unja.ac.id/2013/04/10/prof-dr-ir-h-zulkarnain-mhortsc/. [Diakses: Nov, 15, 2017]
  3. A. Kornain, F. Yansen, and T. Tinaliah, “Penerapan Algoritma Jaro-Winkler Distance Untuk Sistem Deteksi Plagiarisme pada Dokumen Teks Berbahasa Indonesia,” Skripsi, STMIK MDP, Oktober 2014.
  4. A. Kurniawati, S. Puspitodjati, and S. Rahman, "Implementasi Algoritma Jaro-Winkler Distance untuk Membandingkan Kesamaan Dokumen Berbahasa Indonesia", Skripsi Program Studi Sistem Informasi, Universitas Gunadarma, 2010.
  5. Y. Faranika, H. Kurniawan, and N. Nikentari, "Sistem Pengukur Kemiripan Dokumen Menggunakan Algoritma Jaro-Winkler Distance," Skripsi, Universitas Maritim Raja Ali Haji, 2014.
  6. D. A. Perkasa, E. Saputra, and M. Fronita, “Sistem Ujian Online Essay dengan Penilaian Menggunakan Metode Latent Semantic Analysis (LSA), “ Jurnal Rekayasa dan Manajemen Sistem Informasi, vol. 1, no. 1, Februari 2015, pp. 1-9.
  7. D. W. Wicaksono, M. I. Irawan, and A. M. Rukmi, “Sistem Deteksi Kemiripan Antar Dokumen Teks Menggunakan Model Bayesian pada Term Latent Semantic Analysis (LSA),” Jurnal Sains dan Seni POMITS, vol. 3, no. 2, 2014, pp.41-46.
  8. N. Khairunnisa, D. S. Sihabudin, and A. Wibowo, “Aplikasi Pendeteksi Plagiat dengan Menggunakan Metode Latent Semantic Analysis (Studi Kasus : Laporan TA PCR),“ Jurnal Aksara Komputer Terapan, vol. 1, no. 2, 2012.
  9. T. Tudesman, E. Oktalina, T. Tinaliah, Y. Yoannita, “Sistem Deteksi Plagiarisme Dokumen Bahasa Indonesia Menggunakan Metode Vector Space Model,” Skripsi, STMIK MDP, Oktober 2014.
  10. S. Soleman, and A. Purwirianti, “Experiments on the Indonesian Plagiarism Detection using Latent Semantic Analysis,” in 2014 2nd International Conference on Information and Communication Technology (IcoICT), 28-30 May 2014, Bandung, Indonesia.
  11. T. Tinaliah, “Ringkasan Multi-Dokumen Berbahasa Indonesia Secara Otomatis Menggunakan Metode Latent Semantic Analysis dan Centroid-Based Summarization,” Tesis, Universitas Indonesia, 2013.
  12. B. Leonardo, and S. Hansun, "Text Documents Plagiarism Detection using Rabin-Karp and Jaro-Winkler Distance Algorithms," Indonesian Journal of Electrical Engineering and Computer Science, vol. 5, no. 2, pp. 462-471, 2017.
  13. W. E. Winkler, "String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage," in Proceedings of the Section on Survey Methods, American Statistical Association, 1990, pp. 354-359.
  14. S. T. Dumais, "Latent Semantic Analysis," Annual Review of Information Science and Technology, vol. 38, no. 1, pp. 188-230, 2004.