Optimasi MWMOTE pada data tidak seimbang menggunakan complete linkage

MWMOTE optimization for imbalanced data using complete linkage

Department of Informatics, Institut Teknologi Sumatera. Jl. Ryacudu, Lampung Selatan, Indonesia 35365, Indonesia

Received: 11 May 2020; Revised: 7 Dec 2020; Accepted: 18 Jan 2021; Published: 30 Apr 2021; Available online: 20 Apr 2021.
Imbalanced data can result in classification errors, such as in WMMOTE, and can decrease its performance and accuracy. Clustering in MWMOTE can be optimized to improve synthetic data generation and improve MWMOTE performance. This study aims to optimize the MWMOTE algorithm's performance in the clustering process in making synthetic data with complete linkage (CL). The dataset used a variety of data ratios to handle imbalanced data. The decision tree was used to determine the performance of MWMOTE and CL-MWMOTE oversampling. CL-MWMOTE evaluation results provide better and optimal performance than MWMOTE and increase the precision, recall, f-measure, and accuracy of 0.53 %, 0.67 %, 0.66 %, and 0.67 %, respectively.

imbalaced; clustering; complete linkage; optimization; oversampling
Keywords: imbalanced data; clustering; complete linkage; optimization; oversampling
Funding: Institut Teknologi Sumatera, Indonesia

