Prediksi Prediksi Perpindahan Pelanggan Pada Toko Online Menggunakan Metode Tree-Based Gradient Boosted Models
Abstract
Customers are a critical asset to a company's success and ensuring their satisfaction is paramount. However, continuous churn can lead to reduced value flowing from customers, potentially jeopardizing a company's competitive advantage. Customer churn, where consumers choose products from other brands, is influenced by various factors such as promotion, price, product availability, and customer satisfaction levels. While much of the research on churn prediction is concentrated in the telecommunications, retail, and banking industries and only a few have conducted churn prediction research on online stores. This research aims to utilize data mining with a focus on machine learning algorithms, especially the tree-based gradient boosted models method that applies XGBoost, LightGBM, and CatBoost models, to predict customer churn in online stores. The research methodology involves data collection, data pre-processing, model selection and training, model evaluation, analysis and results. This research uses several libraries such as pandas library, numpy, matplotlib, and so on. The results of this study show that the XGBoost model achieved the highest accuracy in predicting customer churn, with an ROC curve of 0.66 and an accuracy value of 0.80032. The feature importance analysis highlights the gender variable as an important factor in model performance. This research contributes to improving customer service, minimizing churn, and ultimately increasing company profitability in the online store sector. Suggestions for future research include expanding data sources, testing with more evaluation metrics, exploring additional churn factors and comparing with other prediction methods for validation.
Downloads
References
V. R. R. Raj and R. A. Azad .V, “Customer Churn Prediction in Telecommunication Industry Having Data Certainty,” Int. J. Sci. Res. Sci. Eng. Technol., vol. 4099, pp. 113–122, 2020, doi: 10.32628/ijsrset207427.
C. A. License, Q. Zeng, M. Chang, Q. Tong, and J. Su, “Retracted: A Prediction Model of Customer Churn considering Customer Value: An Empirical Research of Telecom Industry in China,” Discret. Dyn. Nat. Soc., vol. 2023, pp. 1–1, 2023, doi: 10.1155/2023/9876034.
D. V. Hanifah and Y. P. Astuti, “Analisis Perpindahan Pelanggan Dan Strategi Persaingan Restoran Dengan Metode Markov Chain Dan Game Theory,” MATHunesa J. Ilm. Mat., vol. 11, no. 3, pp. 310–317, 2023, doi: 10.26740/mathunesa.v11n3.p310-317.
M. Rizki Kurniawan, P. Nurul Sabrina, and R. Ilyas, “Prediksi Customer Churn Pada Perusahaan Telekomunikasi Menggunakan Algoritma C4.5 Berbasis Particle Swarm Optimization,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 5, pp. 3369–3375, 2024, doi: 10.36040/jati.v7i5.7476.
A. Khattak, Z. Mehak, H. Ahmad, M. U. Asghar, M. Z. Asghar, and A. Khan, “Customer churn prediction using composite deep learning technique,” Sci. Rep., vol. 13, no. 1, pp. 1–17, 2023, doi: 10.1038/s41598-023-44396-w.
D. Ika Sugiarti and R. Iskandar, “Pengaruh Consumer Review Terhadap Keputusan Pembeli Terhadap Toko Online Shopee,” J. Sos. Teknol., vol. 1, no. 9, pp. 954–962, 2021, doi: 10.59188/jurnalsostech.v1i9.195.
E. T. Oktaria, Y. Yuniarthe, H. Hairudin, and ..., “Sarana Publikasi Dan Media Promosi Produk Kreatifitas Siswa Menggunakan E-Commerce Pada Smk Gading Rejo Kabupaten …,” J. Pengabdi. …, vol. 2, pp. 78–83, 2023, [Online]. Available: https://www.jpu.ubl.ac.id/index.php/jpu/article/view/34%0Ahttps://www.jpu.ubl.ac.id/index.php/jpu/article/download/34/32
L. Dwi, “Perbandingan Performa Model Prediksi Customer Churn Berbasis Machine Learning Pada Fashion E-Commerce,” 2023.
Z. Kedah, “Use of E-Commerce in The World of Business,” Startupreneur Bus. Digit. (SABDA Journal), vol. 2, no. 1, pp. 51–60, 2023, doi: 10.33050/sabda.v2i1.273.
X. Xiahou and Y. Harada, “B2C E-Commerce Customer Churn Prediction Based on K-Means and SVM,” J. Theor. Appl. Electron. Commer. Res., vol. 17, no. 2, pp. 458–475, 2022, doi: 10.3390/jtaer17020024.
A. Mauludin, N. Aziz, A. Mauliddin, V. A. Sintalana, D. Hafiz, and A. A. Rismayadi, “Prediksi Customer Churn Menggunakan Logistic Regression dan Decission Tree,” vol. 4, no. 1, pp. 11–19, 2023.
Y. Yudiana, A. Yulia Agustina, and dan Nur Khofifah, “Prediksi Customer Churn Menggunakan Metode CRISP-DM Pada Industri Telekomunikasi Sebagai Implementasi Mempertahankan Pelanggan,” Indones. J. Islam. Econ. Bus., vol. 8, no. 1, pp. 01–20, 2023, [Online]. Available: http://e-journal.lp2m.uinjambi.ac.id/ojp/index.php/ijoieb
R. Alfarez and V. Purwayoga, “PENERAPAN NAÏVE BAYES UNTUK PREDIKSI CUSTOMER CHURN ( STUDI KASUS : PT HUTCHISON 3 INDONESIA ),” vol. 05, no. 02, pp. 301–307, 2024.
A. F. Azmi and A. Voutama, “KOMPUTA : Jurnal Ilmiah Komputer dan Informatika PREDIKSI CHURN NASABAH BANK MENGGUNAKAN KLASIFIKASI RANDOM FOREST DAN DECISION TREE DENGAN EVALUASI CONFUSION MATRIX KOMPUTA : Jurnal Ilmiah Komputer dan Informatika,” vol. 13, no. 1, 2024.
T. Verdonck, B. Baesens, M. Óskarsdóttir, and S. vanden Broucke, “Special issue on feature engineering editorial,” Mach. Learn., no. 0123456789, 2021, doi: 10.1007/s10994-021-06042-2.
N. Subramani, S. V. Easwaramoorthy, P. Mohan, M. Subramanian, and V. Sambath, “A Gradient Boosted Decision Tree-Based Influencer Prediction in Social Network Analysis,” Big Data Cogn. Comput., vol. 7, no. 1, 2023, doi: 10.3390/bdcc7010006.
J. Brownlee, “Train-Test Split for Evaluating Machine Learning Algorithms,” machine learning mastery. Tanggal akses 20 Februari 2024 [Online]. Available: https://machinelearningmastery.com/train-test-split-for-evaluating-machine-learning-algorithms/
H. Azis, P. Purnawansyah, F. Fattah, and I. P. Putri, “Performa Klasifikasi K-NN dan Cross Validation Pada Data Pasien Pengidap Penyakit Jantung,” Ilk. J. Ilm., vol. 12, no. 2, pp. 81–86, 2020, doi: 10.33096/ilkom.v12i2.507.81-86.
A. P. Windarto, S. Defit, and A. Wanto, “Optimalisasi Parameter dengan Cross Validation dan Neural Back-propagation Pada Model Prediksi Pertumbuhan Industri Mikro dan Kecil,” J. Sist. Inf. Bisnis, vol. 11, no. 1, pp. 34–42, 2021, doi: 10.21456/vol11iss1pp34-42.
V. V. Putri, A. Tholib, and C. Novia, “Deteksi Kaggle Bot Account Menggunakan Deep Neural Networks,” NJCA (Nusantara J. Comput. Its Appl., vol. 8, no. 1, p. 13, 2023, doi: 10.36564/njca.v8i1.304.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Prediksi Prediksi Perpindahan Pelanggan Pada Toko Online Menggunakan Metode Tree-Based Gradient Boosted Models
Pages: 605-614
Copyright (c) 2024 Selfia Hafidatus Sholeha, Mochammad Faid, Moh. Ainol Yaqin

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).






















