Perbandingan Model Naïve Bayes, Logistic Regression, SVM, XGBoost, dan SVM-XGBoost untuk Analisis Sentimen Tunaiku


  • Yabes Aryanto Melapa * Mail Universitas PGRI Semarang, Semarang, Indonesia
  • Setyoningsih Wibowo Universitas PGRI Semarang, Semarang, Indonesia
  • Nur Latifah Dwi Mutiara Sari Universitas PGRI Semarang, Semarang, Indonesia
  • (*) Corresponding Author
Keywords: Sentiment Analysis; Classification; SVM–XGBoost; Machine Learning

Abstract

Sentiment analysis is used to explore user perceptions of fintech services such as Tunaiku through the evaluation of customer reviews. This study specifically aims to compare the performance of several sentiment classification algorithms to determine the most optimal model for classifying Tunaiku app user reviews. The dataset used in this study is a collection of Tunaiku app user reviews obtained from the Google Play Store, with a total of 18,458 reviews. This study compares the performance of five classification algorithms, namely Naïve Bayes, Logistic Regression, Support Vector Machine (SVM), XGBoost, and a hybrid SVM-XGBoost model. The research stages include text preprocessing, feature extraction using TF-IDF, and the application of a validated classification model using the cross-validation method. Model performance evaluation is carried out based on accuracy, precision, recall, and F1-score metrics. The test results showed that Naïve Bayes (91.96%), Logistic Regression (92.81%), SVM (92.56%), and XGBoost (92.52%) provided good performance, while the hybrid SVM-XGBoost model produced the best performance with the highest accuracy of 93.05%. These findings indicate that the hybrid approach is more effective in analyzing user review sentiment and has the potential to be a basis for decision-making in improving Tunaiku's service quality according to user needs.

Downloads

Download data is not yet available.

References

A. Daza, N. Saboya, J. I. Necochea-Chamorro, K. Zavaleta Ramos, and Y. D. R. Vásquez Valencia, “Systematic review of machine learning techniques to predict anxiety and stress in college students,” Inform. Med. Unlocked, vol. 43, p. 101391, 2023, doi: 10.1016/j.imu.2023.101391.

M. Mustaqim, A. Gunawan, Y. B. Pratama, and I. Zaliman, “Pengembangan Chatbot Layanan Publik Menggunakan Machine Learning Dan Natural Languange Processing,” J. Inf. Technol. Soc., vol. 1, no. 1, pp. 1–4, Jun. 2023, doi: 10.35438/jits.v1i1.16.

S. Yadav and N. Saleena, “Sentiment Analysis Of Reviews Using an Augmented Dictionary Approach,” in 2020 5th International Conference on Computing, Communication and Security (ICCCS), Patna, India: IEEE, Oct. 2020, pp. 1–5. doi: 10.1109/ICCCS49678.2020.9277094.

M. Melda, S. P. Cipta, N. Nurhaeni, M. Mambang, and M. H. Adini, “Analisis Sentimen pada Aplikasi Pinjaman Online Easycash Menggunakan Algoritma Naïve Bayes di Media Sosial Twitter,” J. Nas. Komputasi Dan Teknol. Inf. JNKTI, vol. 7, no. 4, pp. 981–988, Aug. 2024, doi: 10.32672/jnkti.v7i4.7918.

X. Wu, Y. Linghu, T. Wang, and Y. Fan, “Sentiment Analysis of Weak-RuleText Based on the Combination of Sentiment Lexicon and Neural Network,” in 2021 IEEE 6th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA), Chengdu, China: IEEE, Apr. 2021, pp. 205–209. doi: 10.1109/ICCCBDA51879.2021.9442593.

M. Iqbal, M. Afdal, and R. Novita, “Implementasi Algoritma Support Vector Machine Untuk Analisa Sentimen Data Ulasan Aplikasi Pinjaman Online di Google Play Store,” MALCOM Indones. J. Mach. Learn. Comput. Sci., vol. 4, no. 4, pp. 1244–1252, Jul. 2024, doi: 10.57152/malcom.v4i4.1435.

Alfandi Safira and F. N. Hasan, “Analisis Sentimen Masyarakat terhadap Paylater Menggunakan Metode Naive Bayes Classifier,” ZONAsi J. Sist. Inf., vol. 5, no. 1, pp. 59–70, Jan. 2023, doi: 10.31849/zn.v5i1.12856.

B. Satya, M. H. S J, M. Rahardi, and F. F. Abdulloh, “Sentiment Analysis of Review Sestyc Using Support Vector Machine, Naive Bayes, and Logistic Regression Algorithm,” in 2022 5th International Conference on Information and Communications Technology (ICOIACT), Yogyakarta, Indonesia: IEEE, Aug. 2022, pp. 188–193. doi: 10.1109/ICOIACT55506.2022.9972046.

J. Liu, X. Zhu, and Y. Zhang, “Application of DE-GWO-SVM Algorithm in Business Order Prediction Model,” in 2020 IEEE 11th International Conference on Software Engineering and Service Science (ICSESS), Beijing, China: IEEE, Oct. 2020, pp. 432–435. doi: 10.1109/ICSESS49938.2020.9237714.

M. R. Kurniawanda and F. A. T. Tobing, “Analysis Sentiment Cyberbullying In Instagram Comments with XGBoost Method,” IJNMT Int. J. New Media Technol., vol. 9, no. 1, pp. 28–34, Jul. 2022, doi: 10.31937/ijnmt.v9i1.2670.

C. Shuran and L. Yian, “Breast cancer diagnosis and prediction model based on improved PSO-SVM based on gray relational analysis,” in 2020 19th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES), Xuzhou, China: IEEE, Oct. 2020, pp. 231–234. doi: 10.1109/DCABES50732.2020.00067.

S. Poria, D. Hazarika, N. Majumder, and R. Mihalcea, “Beneath the Tip of the Iceberg: Current Challenges and New Directions in Sentiment Analysis Research,” IEEE Trans. Affect. Comput., vol. 14, no. 1, pp. 108–132, Jan. 2023, doi: 10.1109/TAFFC.2020.3038167.

M. Ladjal, M. A. Ouali, and M. D. Lass, “optimization of SVM parameters with hybrid PCA-PSO methods for water quality monitoring,” in 2020 International Conference on Electrical Engineering (ICEE), Istanbul, Turkey: IEEE, Sep. 2020, pp. 1–6. doi: 10.1109/ICEE49691.2020.9249881.

M. Kasri, M. Birjali, M. Nabil, A. Beni-Hssane, A. El-Ansari, and M. El Fissaoui, “Refining Word Embeddings with Sentiment Information for Sentiment Analysis,” J. ICT Stand., Aug. 2022, doi: 10.13052/jicts2245-800X.1031.

Rizky Rizaldi, M Ridho, Arraihan Tahta Ainullah, Lusiana Efrizoni, Rahmaddeni Rahmaddeni, and M Fahrel Dea Putra, “Penerapan Algoritma Support Vector Machine dan XGBoost Dalam Mengklasifikasikan Sentimen Opini Publik Terhadap Aplikasi Uber,” J. Inform. Dan Tekonologi Komput. JITEK, vol. 5, no. 1, pp. 01–09, Apr. 2025, doi: 10.55606/jitek.v5i1.5735.

N. P. Husain and A. F. Syam, “Analisis Sentimen Ulasan Pengguna Tiktok pada Google Play Store Berbasis TF-IDF dan Support Vector Machine,” vol. 5, no. 1, 2024.

“Implementasi Machine Learning untuk Prediksi Harga Mobil Bekas dengan Algoritma Regresi Linear berbasis Web,” J. Ilm. Komputasi, vol. 21, no. 4, Dec. 2022, doi: 10.32409/jikstik.21.4.3327.

F. T. Saputra, S. H. Wijaya, Y. Nurhadryani, and Defina, “Lexicon Addition Effect on Lexicon-Based of Indonesian Sentiment Analysis on Twitter,” in 2020 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS), Jakarta, Indonesia: IEEE, Nov. 2020, pp. 136–141. doi: 10.1109/ICIMCIS51567.2020.9354269.

H. Hermanto, R. Fahlapi, A. Y. Kuntoro, and T. Asra, “Perbandingan Algoritma Klasifikasi Analisis Sentimen Pengguna Aplikasi Getcontact Dalam Pencegahan Penipuan Online,” J-INTECH, vol. 12, no. 1, pp. 158–167, Jul. 2024, doi: 10.32664/j-intech.v12i1.1262.

R. Hassan and Md. R. Islam, “Impact of Sentiment Analysis in Fake Online Review Detection,” in 2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD), Dhaka, Bangladesh: IEEE, Feb. 2021, pp. 21–24. doi: 10.1109/ICICT4SD50815.2021.9396899.

H. T. Phan, V. C. Tran, N. T. Nguyen, and D. Hwang, “Improving the Performance of Sentiment Analysis of Tweets Containing Fuzzy Sentiment Using the Feature Ensemble Model,” IEEE Access, vol. 8, pp. 14630–14641, 2020, doi: 10.1109/ACCESS.2019.2963702.

B. Gunawan, H. S. Pratiwi, and E. E. Pratama, “Sistem Analisis Sentimen pada Ulasan Produk Menggunakan Metode Naive Bayes,” J. Edukasi Dan Penelit. Inform. JEPIN, vol. 4, no. 2, p. 113, Dec. 2018, doi: 10.26418/jp.v4i2.27526.

M. Choirul Rahmadan, A. Nizar Hidayanto, D. Swadani Ekasari, B. Purwandari, and Theresiawati, “Sentiment Analysis and Topic Modelling Using the LDA Method related to the Flood Disaster in Jakarta on Twitter,” in 2020 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS), Jakarta, Indonesia: IEEE, Nov. 2020, pp. 126–130. doi: 10.1109/ICIMCIS51567.2020.9354320.

V. A. Riyanto and D. B. Santoso, “Penerapan Model Support Vector Machine Pada Klasifikasi Sentimen Ulasan Aplikasi Lazada,” J. Ris. Sist. Inf. Dan Tek. Inform. JURASIK, vol. 9, pp. 178–184, 2024.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Perbandingan Model Naïve Bayes, Logistic Regression, SVM, XGBoost, dan SVM-XGBoost untuk Analisis Sentimen Tunaiku

Dimensions Badge
Article History
Submitted: 2025-12-10
Published: 2025-12-26
Abstract View: 428 times
PDF Download: 304 times
How to Cite
Melapa, Y., Wibowo, S., & Sari, N. (2025). Perbandingan Model Naïve Bayes, Logistic Regression, SVM, XGBoost, dan SVM-XGBoost untuk Analisis Sentimen Tunaiku. Building of Informatics, Technology and Science (BITS), 7(3), 1986-1995. https://doi.org/10.47065/bits.v7i3.8914
Issue
Section
Articles