Prediksi Prediksi Perpindahan Pelanggan Pada Toko Online Menggunakan Metode Tree-Based Gradient Boosted Models


  • Selfia Hafidatus Sholeha * Mail Universitas Nurul Jadid, Probolinggo, Indonesia
  • Mochammad Faid Universitas Nurul Jadid, Probolinggo, Indonesia
  • Moh. Ainol Yaqin Universitas Nurul Jadid, Probolinggo, Indonesia
  • (*) Corresponding Author
Keywords: Customer Churn; Online Stores; Data Mining; Tree-Based Gradient Boosted Models

Abstract

Customers are a critical asset to a company's success and ensuring their satisfaction is paramount. However, continuous churn can lead to reduced value flowing from customers, potentially jeopardizing a company's competitive advantage. Customer churn, where consumers choose products from other brands, is influenced by various factors such as promotion, price, product availability, and customer satisfaction levels. While much of the research on churn prediction is concentrated in the telecommunications, retail, and banking industries and only a few have conducted churn prediction research on online stores. This research aims to utilize data mining with a focus on machine learning algorithms, especially the tree-based gradient boosted models method that applies XGBoost, LightGBM, and CatBoost models, to predict customer churn in online stores. The research methodology involves data collection, data pre-processing, model selection and training, model evaluation, analysis and results. This research uses several libraries such as pandas library, numpy, matplotlib, and so on. The results of this study show that the XGBoost model achieved the highest accuracy in predicting customer churn, with an ROC curve of 0.66 and an accuracy value of 0.80032. The feature importance analysis highlights the gender variable as an important factor in model performance. This research contributes to improving customer service, minimizing churn, and ultimately increasing company profitability in the online store sector. Suggestions for future research include expanding data sources, testing with more evaluation metrics, exploring additional churn factors and comparing with other prediction methods for validation.

Downloads

Download data is not yet available.

References

V. R. R. Raj and R. A. Azad .V, “Customer Churn Prediction in Telecommunication Industry Having Data Certainty,” Int. J. Sci. Res. Sci. Eng. Technol., vol. 4099, pp. 113–122, 2020, doi: 10.32628/ijsrset207427.

C. A. License, Q. Zeng, M. Chang, Q. Tong, and J. Su, “Retracted: A Prediction Model of Customer Churn considering Customer Value: An Empirical Research of Telecom Industry in China,” Discret. Dyn. Nat. Soc., vol. 2023, pp. 1–1, 2023, doi: 10.1155/2023/9876034.

D. V. Hanifah and Y. P. Astuti, “Analisis Perpindahan Pelanggan Dan Strategi Persaingan Restoran Dengan Metode Markov Chain Dan Game Theory,” MATHunesa J. Ilm. Mat., vol. 11, no. 3, pp. 310–317, 2023, doi: 10.26740/mathunesa.v11n3.p310-317.

M. Rizki Kurniawan, P. Nurul Sabrina, and R. Ilyas, “Prediksi Customer Churn Pada Perusahaan Telekomunikasi Menggunakan Algoritma C4.5 Berbasis Particle Swarm Optimization,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 5, pp. 3369–3375, 2024, doi: 10.36040/jati.v7i5.7476.

A. Khattak, Z. Mehak, H. Ahmad, M. U. Asghar, M. Z. Asghar, and A. Khan, “Customer churn prediction using composite deep learning technique,” Sci. Rep., vol. 13, no. 1, pp. 1–17, 2023, doi: 10.1038/s41598-023-44396-w.

D. Ika Sugiarti and R. Iskandar, “Pengaruh Consumer Review Terhadap Keputusan Pembeli Terhadap Toko Online Shopee,” J. Sos. Teknol., vol. 1, no. 9, pp. 954–962, 2021, doi: 10.59188/jurnalsostech.v1i9.195.

E. T. Oktaria, Y. Yuniarthe, H. Hairudin, and ..., “Sarana Publikasi Dan Media Promosi Produk Kreatifitas Siswa Menggunakan E-Commerce Pada Smk Gading Rejo Kabupaten …,” J. Pengabdi. …, vol. 2, pp. 78–83, 2023, [Online]. Available: https://www.jpu.ubl.ac.id/index.php/jpu/article/view/34%0Ahttps://www.jpu.ubl.ac.id/index.php/jpu/article/download/34/32

L. Dwi, “Perbandingan Performa Model Prediksi Customer Churn Berbasis Machine Learning Pada Fashion E-Commerce,” 2023.

Z. Kedah, “Use of E-Commerce in The World of Business,” Startupreneur Bus. Digit. (SABDA Journal), vol. 2, no. 1, pp. 51–60, 2023, doi: 10.33050/sabda.v2i1.273.

X. Xiahou and Y. Harada, “B2C E-Commerce Customer Churn Prediction Based on K-Means and SVM,” J. Theor. Appl. Electron. Commer. Res., vol. 17, no. 2, pp. 458–475, 2022, doi: 10.3390/jtaer17020024.

A. Mauludin, N. Aziz, A. Mauliddin, V. A. Sintalana, D. Hafiz, and A. A. Rismayadi, “Prediksi Customer Churn Menggunakan Logistic Regression dan Decission Tree,” vol. 4, no. 1, pp. 11–19, 2023.

Y. Yudiana, A. Yulia Agustina, and dan Nur Khofifah, “Prediksi Customer Churn Menggunakan Metode CRISP-DM Pada Industri Telekomunikasi Sebagai Implementasi Mempertahankan Pelanggan,” Indones. J. Islam. Econ. Bus., vol. 8, no. 1, pp. 01–20, 2023, [Online]. Available: http://e-journal.lp2m.uinjambi.ac.id/ojp/index.php/ijoieb

R. Alfarez and V. Purwayoga, “PENERAPAN NAÏVE BAYES UNTUK PREDIKSI CUSTOMER CHURN ( STUDI KASUS : PT HUTCHISON 3 INDONESIA ),” vol. 05, no. 02, pp. 301–307, 2024.

A. F. Azmi and A. Voutama, “KOMPUTA : Jurnal Ilmiah Komputer dan Informatika PREDIKSI CHURN NASABAH BANK MENGGUNAKAN KLASIFIKASI RANDOM FOREST DAN DECISION TREE DENGAN EVALUASI CONFUSION MATRIX KOMPUTA : Jurnal Ilmiah Komputer dan Informatika,” vol. 13, no. 1, 2024.

T. Verdonck, B. Baesens, M. Óskarsdóttir, and S. vanden Broucke, “Special issue on feature engineering editorial,” Mach. Learn., no. 0123456789, 2021, doi: 10.1007/s10994-021-06042-2.

N. Subramani, S. V. Easwaramoorthy, P. Mohan, M. Subramanian, and V. Sambath, “A Gradient Boosted Decision Tree-Based Influencer Prediction in Social Network Analysis,” Big Data Cogn. Comput., vol. 7, no. 1, 2023, doi: 10.3390/bdcc7010006.

J. Brownlee, “Train-Test Split for Evaluating Machine Learning Algorithms,” machine learning mastery. Tanggal akses 20 Februari 2024 [Online]. Available: https://machinelearningmastery.com/train-test-split-for-evaluating-machine-learning-algorithms/

H. Azis, P. Purnawansyah, F. Fattah, and I. P. Putri, “Performa Klasifikasi K-NN dan Cross Validation Pada Data Pasien Pengidap Penyakit Jantung,” Ilk. J. Ilm., vol. 12, no. 2, pp. 81–86, 2020, doi: 10.33096/ilkom.v12i2.507.81-86.

A. P. Windarto, S. Defit, and A. Wanto, “Optimalisasi Parameter dengan Cross Validation dan Neural Back-propagation Pada Model Prediksi Pertumbuhan Industri Mikro dan Kecil,” J. Sist. Inf. Bisnis, vol. 11, no. 1, pp. 34–42, 2021, doi: 10.21456/vol11iss1pp34-42.

V. V. Putri, A. Tholib, and C. Novia, “Deteksi Kaggle Bot Account Menggunakan Deep Neural Networks,” NJCA (Nusantara J. Comput. Its Appl., vol. 8, no. 1, p. 13, 2023, doi: 10.36564/njca.v8i1.304.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Prediksi Prediksi Perpindahan Pelanggan Pada Toko Online Menggunakan Metode Tree-Based Gradient Boosted Models

Dimensions Badge
Article History
Submitted: 2024-05-22
Published: 2024-05-30
Abstract View: 744 times
PDF Download: 837 times
Issue
Section
Articles