Prediksi pembatalan pemesanan hotel menggunakan optimalisasi hiperparameter pada algoritme Random Forest

Yufis Azhar; Galang Aji Mahesa; Moch. Chamdani Mustaqim

doi:10.14710/jtsiskom.2020.13790

DOI: https://doi.org/10.14710/jtsiskom.2020.13790

Prediksi pembatalan pemesanan hotel menggunakan optimalisasi hiperparameter pada algoritme Random Forest

Prediction of hotel bookings cancellation using hyperparameter optimization on Random Forest algorithm

Yufis Azhar, Galang Aji Mahesa, Moch. Chamdani Mustaqim

Department of Informatics, Universitas Muhammadiyah Malang. Jl. Raya Tlogomas No.246, Malang, Jawa Timur 65144, Indonesia

Received: 14 Jun 2020; Revised: 16 Nov 2020; Accepted: 27 Nov 2020; Available online: 7 Dec 2020; Published: 31 Jan 2021.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Citation Format:

Abstract

Cancellation of hotel bookings by customers greatly influences hotel managerial decision making. To minimize losses by this problem, the hotel management made a fairly rigid policy that could damage the reputation and business performance. Therefore, this study focuses on solving these problems using machine learning algorithms. To get the best model performance, hyperparameter optimization is applied to the random forest algorithm. It aims to obtain the best combination of model parameters in predicting hotel booking cancellations. The proposed model is proven to have the best performance with the highest accuracy results of 87 %. This study's results can be used as a model component in hotel managerial decision-making systems related to future bookings' cancellation.

Fulltext View|Download Email colleagues

Keywords: classification; hyperparameter optimization; random forest

Funding: Universitas Muhammadiyah Malang, Indonesia

Article Metrics:

Article Info

Section: Original Research Articles

Language : ID

In Volume 9, Issue 1, Year 2021 (January 2021)

PSS Tuning on Power Generator System using Flower Pollination Algorithm Segmentation and analysis of Pap smear microscopic images using the K-means and J48 algorithms Face recognition system with PCA-GA algorithm for smart home door security using Rasberry Pi Perbandingan Metode Ensemble Machine Learning untuk Klasifikasi Tenaga Kerja di Indonesia dengan Random Forest, XGBoost, dan CatBoost Data scaling performance on various machine learning algorithms to identify abalone sex More related articles

Most cited articles

Implementasi Algoritma Kriptografi RSA untuk Enkripsi dan Dekripsi Email Design of wireless sensor networks (WSN) to monitor temperature and humidity using nrf24l01 Android Application of Expert System for Gastroenteritis Detection Pengembangan Sistem Pakar Untuk Diagnosis Penyakit Hepatitis Berbasis Web Menggunakan Metode Certainty Factor Web Monitoring System of pH Level, Temperature and Color on River Water using Wireless Sensor Network More cited articles

S. Kitamori, H. Sakai, and H. Sakaji, “Extraction of sentences concerning business performance forecast and economic forecast from summaries of financial statements by deep learning,” in IEEE Symposium Series on Computational Intelligence, Honolulu, USA, Dec. 2017, pp. 1-7. doi: 10.1109/SSCI.2017.8285335
N. Antonio, A. de Almeida, and L. Nunes, “Predicting hotel bookings cancellation with a machine learning classification model,” in IEEE International Conference on Machine Learning and Applications, Cancun, Mexico, Dec. 2017, pp. 1049–1054. doi: 10.1109/ICMLA.2017.00-11
N. Antonio, A. de Almeida, and L. Nunes, “Predicting hotel booking cancellations to decrease uncertainty and increase revenue,” Tourism & Management Studies, vol. 13, no. 2, pp. 25–39, 2017. doi: 10.18089/tms.2017.13203
L. Rokach, “Decision forest: Twenty years of research,” Information Fusion, vol. 27, pp. 111–125, 2016. doi: 10.1016/j.inffus.2015.06.005
P. Fernandez-Gonzalez, C. Bielza, and P. Larranaga, “Random forests for regression as a weighted sum of k-potential nearest neighbors,” IEEE Access, vol. 7, pp. 25660–25672, 2019. doi: 10.1109/ACCESS.2019.2900755
M. C. M. Oo and T. Thein, “Hyperparameters optimization in scalable random forest for big data analytics,” in 4th International Conference on Computer and Communication Systems, Singapore, Singapore, Feb. 2019, pp. 125-129. doi: 10.1109/CCOMS.2019.8821752
B. H. Shekar and G. Dagnew, “Grid search-based hyperparameter tuning and classification of microarray cancer data,” in International Conference on Advanced Computational and Communication Paradigms, Gantok, India, Feb. 2019, pp. 1-8. doi: 10.1109/ICACCP.2019.8882943
T. Wang et al., “Random forest-bayesian optimization for product quality prediction with large-scale dimensions in process industrial cyber-physical systems,” IEEE Internet Things Journal, vol. 7, no. 9, pp. 8641-8653, 2020. doi: 10.1109/JIOT.2020.2992811
E. Hazan, A. Klivans, and Y. Yuan, “Hyperparameter optimization: A spectral approach,” 2017, arXiv:1706.00764
J. Bergstra and Y. Bengio, “Random search for hyper-parameter optimization,” Journal of Machine Learning Research, vol. 13, pp. 281-305, 2012
N. Antonio, A. de Almeida, and L. Nunes, “Hotel booking demand datasets,” Data in Brief, vol. 22, pp. 41–49, 2019. doi: 10.1016/j.dib.2018.11.126
C. Seger, “An investigation of categorical variable encoding techniques in machine learning: binary versus one-hot and feature hashing,” thesis, KTH Royal Insitute of Technology, Stockholm, Sweden, 2018
J. S. Lee, “AUC4.5: AUC-based C4.5 decision tree algorithm for imbalanced data classification,” IEEE Access, vol. 7, pp. 106034 – 106042, 2019, doi: 10.1109/ACCESS.2019.2931865
J. Li, S. Ma, T. Le, L. Liu, and J. Liu, “Causal decision trees,” IEEE Transactions on Knowledge and Data Engineering, vol. 29, no. 2, pp. 257–271, 2017. doi: 10.1109/TKDE.2016.2619350
T. Chen and C. Guestrin, “XGBoost: A scalable tree boosting system,” in International Conference on Knowledge Discovery and Data Mining, New York, USA, Aug. 2016, pp. 785-794. doi: 10.1145/2939672.2939785
M. Chen, Q. Liu, S. Chen, Y. Liu, C. H. Zhang, and R. Liu, “XGBoost-Based algorithm interpretation and application on post-fault transient stability status prediction of power system,” IEEE Access, vol. 7, pp. 13149-13158, 2019. doi: 10.1109/ACCESS.2019.2893448
N. Li, B. Li, and L. Gao, “Transient stability assessment of power system based on XGBoost and factorization machine,” IEEE Access, vol. 8, pp. 28403-28414,2020. doi: 10.1109/ACCESS.2020.2969446
S. Georganos, T. Grippa, S. Vanhuysse, M. Lennert, M. Shimoni, and E. Wolff, “Very high resolution object-based land use-land cover urban classification using extreme gradient boosting,” IEEE Geoscience and Remote Sensing Letter, vol. 15, no. 4, pp. 607-611, 2018. doi: 10.1109/LGRS.2018.2803259
D. Zhang, L. Qian, B. Mao, C. Huang, B. Huang, and Y. Si, “A data-driven design for fault detection of wind turbines using Random Forests and XGboost,” IEEE Access, vol. 6, pp. 21020–21031, 2018. doi: 10.1109/ACCESS.2018.2818678
L. Breiman, “Random forests,” Machine Learning, vol. 45, pp. 5–32, 2001. doi: 10.1023/A:1010933404324
S. Liu, H. Li, Y. Zhang, B. Zou, and J. Zhao, “Random forest-based track initiation method,” Journal of Engineering, vol. 2019, no. 19, pp. 6175-6179, 2019. doi: 10.1049/joe.2019.0180
A. Primajaya and B. N. Sari, “Random Forest algorithm for prediction of precipitation,” Indonesian Journal of Artificial Intelligence and Data Mining, vol. 1, no. 1, pp. 27, 2018. doi: 10.24014/ijaidm.v1i1.4903
D. Marinov and D. Karapetyan, “Hyperparameter optimisation with early termination of poor performers,” in Computer Science and Electronic Engineering, Colchester, UK, Sept. 2019, pp. 160–163. doi: 10.1109/CEEC47804.2019.8974317
B. Nakisa, M. N. Rastgoo, A. Rakotonirainy, F. Maire, and V. Chandran, “Long short term memory hyperparameter optimization for a neural network based emotion recognition framework,” IEEE Access, vol. 6, pp. 49325–49338, 2018. doi: 10.1109/ACCESS.2018.2868361
M. Feurer and F. Hutter, Hyperparameter Optimization. Springer, 2019. doi: 10.1007/978-3-030-05318-5_1

Last update:

Proceedings of the 2023 3rd International Conference on Public Management and Intelligent Society (PMIS 2023)
Ying Wang. Atlantis Highlights in Intelligent Systems, 8 , 2023. doi: 10.2991/978-94-6463-200-2_41
Classification Analysis of Star Hotels in Banyumas Using the Random Forest Method
Annisaa Utami, Dimas Fanny Hebrasianto Permadi, Bita Parga Zen, Faisal Dharma Adhinata, Wanvy Arifha Saputra. 2024 International Conference on Information Technology and Computing (ICITCOM), 2024. doi: 10.1109/ICITCOM62788.2024.10762547
Hotel Booking Cancelation Prediction using ML algorithms
M. Venkata Rakesh, S. Prasanna Kumar, Yogitha, R Aishwarya.. 2022 Second International Conference on Artificial Intelligence and Smart Energy (ICAIS), 2022. doi: 10.1109/ICAIS53314.2022.9742843
Rapid Identification of Tobacco Mildew Based on Random Forest Algorithm
Zhimin Jiang, Wenjun Zhang, Haixia Huang, Zhengguang Zhai, Dairong chen, Yongfeng Ai, Bo Li, Xiaoxiang Chen, Lianhui Li. Scientific Programming, 2022 , 2022. doi: 10.1155/2022/1818398
Dataset Outlier Detection Method Based on Random Forest Algorithm
Yingzi Zheng. 2022 4th International Conference on Artificial Intelligence and Advanced Manufacturing (AIAM), 2022. doi: 10.1109/AIAM57466.2022.00111
Online credit default prediction model based on fusion of Random Forest and XGBoost algorithm
Yanhua Shen, Han Bao, Yanhui Feng, Yonghui Dai. 2023 IEEE 3rd International Conference on Information Technology, Big Data and Artificial Intelligence (ICIBA), 2023. doi: 10.1109/ICIBA56860.2023.10165530
Equipment Damage Measurement Method of Wartime Based on FCE-PCA-RF
Mingyu Li, Lu Gao, Hongwei Xu, Kai Li, Yisong Huang. Journal of Systems Engineering and Electronics, 35 (3), 2024. doi: 10.23919/JSEE.2024.000065
Clinical Nursing Strategies for Pulmonary Disease Patients Based on Random Forest Algorithm
Dai Yu, Ding Luping, He Lingmei, Wang Junfeng, Cui Dehua. 2023 International Conference on Electronics and Devices, Computational Science (ICEDCS), 2023. doi: 10.1109/ICEDCS60513.2023.00114

Last update: 2025-07-06 16:52:11

No citation recorded.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Starting from 2021, the author(s) whose article is published in the JTSiskom journal attain the copyright for their article and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. By submitting the manuscript to JTSiskom, the author(s) agree with this policy. No special document approval is required.

The author(s) guarantee that:

their article is original, written by the mentioned author(s),
has never been published before,
does not contain statements that violate the law, and
does not violate the rights of others, is subject to copyright held exclusively by the author(s), is free from the rights of third parties, and the necessary written permission to quote from other sources has been obtained by the author(s).

The author(s) retain all rights to the published work, such as (but not limited to) the following rights:

Copyright and other proprietary rights related to the article, such as patents,
The right to use the substance of the article in its own future works, including lectures and books,
The right to reproduce the article for its own purposes,
The right to archive all versions of the article in any repository, and
The right to enter into separate additional contractual arrangements for the non-exclusive distribution of published versions of the article (for example, posting them to institutional repositories or publishing them in a book), acknowledging its initial publication in this journal (Jurnal Teknologi dan Sistem Komputer).

Suppose the article was prepared jointly by more than one author. Each author submitting the manuscript warrants that all co-authors have given their permission to agree to copyright and license notices (agreements) on their behalf and notify co-authors of the terms of this policy. JTSiskom will not be held responsible for anything arising because of the writer's internal dispute. JTSiskom will only communicate with correspondence authors.

Authors should also understand that their articles (and any additional files, including data sets and analysis/computation data) will become publicly available once published. The license of published articles (and additional data) will be governed by a Creative Commons Attribution-ShareAlike 4.0 International License. JTSiskom allows users to copy, distribute, display and perform work under license. Users need to attribute the author(s) and JTSiskom to distribute works in journals and other publication media. Unless otherwise stated, the author(s) is a public entity as soon as the article is published.

Prediksi pembatalan pemesanan hotel menggunakan optimalisasi hiperparameter pada algoritme Random Forest

Prediction of hotel bookings cancellation using hyperparameter optimization on Random Forest algorithm

EDITORIAL OFFICE OF JURNAL TEKNOLOGI DAN SISTEM KOMPUTER