Evaluasi Kinerja Algoritma Naïve Bayes, SVM, dan IndoBERT pada Analisis Sentimen Ulasan Pengguna Gojek Berbasis Text Mining

I Wayan Aries Agetia; Ni Luh Eka Armoni; I Putu Ari Utama Irawan

doi:10.47065/tin.v7i1.9617

I Wayan Aries Agetia * Politeknik Negeri Bali, Badung, Indonesia
Ni Luh Eka Armoni Politeknik Negeri Bali, Badung, Indonesia
I Putu Ari Utama Irawan Politeknik Negeri Bali, Badung, Indonesia

(*) Corresponding Author

DOI: https://doi.org/10.47065/tin.v7i1.9617

Keywords: Sentiment Analysis; Naïve Bayes; Support Vector Machine; IndoBERT; Text Mining

Abstract

This study aims to evaluate and compare the performance of Naïve Bayes, Support Vector Machine (SVM), and IndoBERT algorithms in the task of sentiment classification of user reviews on the Gojek application. Data were collected through web scraping from the Google Play Store and subsequently labeled into three sentiment categories: negative, neutral, and positive. A quantitative approach with a descriptive-comparative design was employed. The research procedure consisted of data collection, text preprocessing, dataset splitting into training and testing sets, model development, and evaluation using accuracy, precision, recall, F1-score, and confusion matrix metrics. The results indicate that the IndoBERT algorithm achieved the best performance, with an accuracy of 92.46%, outperforming Naïve Bayes (87.94%) and SVM (86.43%). Furthermore, IndoBERT demonstrated greater consistency in precision, recall, and F1-score across all sentiment categories. In contrast, Naïve Bayes exhibited a tendency to misclassify certain classes, while SVM showed relatively stable performance, although it did not reach optimal results. These findings suggest that transformer-based approaches are more effective in capturing the contextual complexity of the Indonesian language. This study contributes by providing a comparative analysis of classical and transformer-based methods in Indonesian sentiment classification and offers empirical evidence of the superiority of transformer-based approaches in capturing linguistic contextual nuances in user reviews of digital applications.

Downloads

Download data is not yet available.

References

Abdullah, T., & Ahmet, A. (2023). Deep Learning in Sentiment Analysis: Recent Architectures. ACM Computing Surveys, 55(8). https://doi.org/10.1145/3548772

Alahmadi, K., Alharbi, S., Chen, J., & Wang, X. (2025). Generalizing sentiment analysis: a review of progress, challenges, and emerging directions. Social Network Analysis and Mining, 15(1), 1–28. https://doi.org/10.1007/s13278-025-01461-8

Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, 1, 4171–4186.

Geni, L., Yulianti, E., & Sensuse, D. I. (2023). Sentiment Analysis of Tweets Before the 2024 Elections in Indonesia Using Bert Language Models. Jurnal Ilmiah Teknik Elektro Komputer Dan Informatika, 9(3), 746–757. https://doi.org/10.26555/jiteki.v9i3.26490

Irmawan, O. A., Budi, I., Santoso, A. B., & Putra, P. K. (2024). Improving Sentiment Analysis and Topic Extraction in Indonesian Travel App Reviews Through BERT Fine-Tuning. Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI), 13(2), 359–370. https://doi.org/10.23887/janapati.v13i2.77028

Koto, F., Lau, J. H., & Baldwin, T. (2021). INDOBERTWEET: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization. EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings, 10660–10668. https://doi.org/10.18653/v1/2021.emnlp-main.833

Kowsari, K., Meimandi, K. J., Heidarysafa, M., Mendu, S., Barnes, L., & Brown, D. (2019). Text classification algorithms: A survey. Information (Switzerland), 10(4), 1–68. https://doi.org/10.3390/info10040150

Nandwani, P., & Verma, R. (2021). A review on sentiment analysis and emotion detection from text. Social Network Analysis and Mining, 11(1), 1–19. https://doi.org/10.1007/s13278-021-00776-6

Nugroho, K. S., Sukmadewa, A. Y., Wuswilahaken Dw, H., Bachtiar, F. A., & Yudistira, N. (2021). BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps Reviews. ACM International Conference Proceeding Series, April, 258–264. https://doi.org/10.1145/3479645.3479679

Powers, D. M. W. (2020). Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. 37–63. http://arxiv.org/abs/2010.16061

Putra, V. H. C., Kanugrahan, G., Tho, C., Siswaja, H. D., Prasetio, R. T., & Ramdhani, Y. (2025). Real-Time Sentiment Analysis of the Salaman App: A Comparative Study of SVM, LSTM, and BERT Models with IoT Integration. 2025 Tenth International Conference on Informatics and Computing (ICIC), 1–6. https://doi.org/10.1109/icic68054.2025.11309525

Ramdhan, H. M., Dwifebri Purbolaksono, M., & Bunyamin, B. (2024). Sentiment Analysis of Beauty Product Reviews Using the IndoBERT Method and Naive Bayes Classification. 2024 12th International Conference on Information and Communication Technology, ICoICT 2024, 397–404. https://doi.org/10.1109/ICoICT61617.2024.10698198

Raschka, S., Yuxi, L., & Mirjalili, V. (2022). Machine Learning with PyTorch and Scikit-Learn (1st ed.). Packt Publishing.

Sabrina Amanda Salsabila, Bayu Priyatna, Agustia Hananto, & Tukino. (2025). Komparasi Kinerja Model Naive Bayes, SVM, dan BERT dalam Klasifikasi Sentimen Ulasan Pada Aplikasi YUMMY. STORAGE: Jurnal Ilmiah Teknik Dan Ilmu Komputer, 4(2), 42–47. https://doi.org/10.55123/storage.v4i2.5120

Sahoo, C., Wankhade, M., & Singh, B. K. (2023). Sentiment analysis using deep learning techniques: a comprehensive review. International Journal of Multimedia Information Retrieval, 12(2), 41. https://doi.org/10.1007/s13735-023-00308-2

Setiawan, B. (2025). A Review of Sentiment Analysis Applications in Indonesia Between 2023-2024. Journal of Information Engineering and Educational Technology, 8(2), 71–83. https://doi.org/10.26740/jieet.v8n2.p71-83

Setiawan, V. D., Iswavigra, D. U., & Anggiratih, E. (2025). Implementation of IndoBERT for Sentiment Analysis of the Constitutional Court’s Decision Regarding the Minimum Age of Vice Presidential Candidates. Scientific Journal of Informatics, 12(3), 397–406. https://doi.org/10.15294/sji.v12i3.26320

Taufiq Dwi Purnomo, & Joko Sutopo. (2024). Comparison of Pre-Trained Bert-Based Transformer Models for Regional Language Text Sentiment Analysis in Indonesia. International Journal Science and Technology, 3(3), 11–21. https://doi.org/10.56127/ijst.v3i3.1739

Wankhade, M., Rao, A. C. S., & Kulkarni, C. (2022). A survey on sentiment analysis methods, applications, and challenges. In Artificial Intelligence Review (Vol. 55, Issue 7). Springer Netherlands. https://doi.org/10.1007/s10462-022-10144-1

Wijaya, D. R., Sasmitha, G. M. A., & Vihikan, W. O. (2024). Sentiment Analysis of Indonesian Citizens on Electric Vehicle Using FastText and BERT Method. Journal of Information Systems and Informatics, 6(3), 1360–1372. https://doi.org/10.51519/journalisi.v6i3.784

Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Evaluasi Kinerja Algoritma Naïve Bayes, SVM, dan IndoBERT pada Analisis Sentimen Ulasan Pengguna Gojek Berbasis Text Mining