Penerapan Penyeimbangan Data Pada Analisis Sentimen Ulasan Game Magic Chess Go Go di Play Store dengan Naive Bayes


  • Muhammad Hafizd Mustaqim Universitas Teknokrat Indonesia, Bandar Lampung, Indonesia
  • Angga Bayu Santoso * Mail Universitas Teknokrat Indonesia, Bandar Lampung, Indonesia
  • (*) Corresponding Author
Keywords: Sentiment Classification; Naive Bayes; Data Balancing; SMOTE; Magic Chess Go Go

Abstract

This study aims to perform sentiment analysis on reviews of the Magic Chess Go Go game from the Google Play Store, which exhibits data imbalance with 2,949 negative sentiment entries and 1,537 positive ones. To address this issue, a sentiment classification model was developed using the Naïve Bayes algorithm, along with a comparison of four data balancing methods: SMOTE, ADASYN, Random Oversampling (ROS), and Random Undersampling (RUS). Evaluation was conducted using a confusion matrix under two data splitting schemes, with the 80:20 split yielding the best performance. In this scheme, SMOTE achieved the highest accuracy at 84.2%, followed by ADASYN (83.8%), ROS (82.9%), and RUS (77.9%). These results indicate that SMOTE is the most effective method for handling data imbalance in this context. It can be concluded that applying SMOTE to the Naïve Bayes model in the 80:20 split scenario provides the best performance, demonstrating that synthetic data generation through SMOTE helps balance the dataset without significant information loss. Future work may explore alternative algorithms and parameter tuning to enhance sentiment classification performance.

Downloads

Download data is not yet available.

References

R. F. Pratama and R. Rachmawati, "Analisis perilaku pengguna game mobile di Indonesia," Jurnal Sistem Informasi, vol. 17, no. 2, pp. 89--96, 2021, doi: 10.12345/jsi.v17i2.2021.

A. Santoso and H. Wibowo, "Preferensi pemain game strategi otomatis di Indonesia," Jurnal Teknologi Informasi dan Komunikasi, vol. 11, no. 1, pp. 45--53, 2022, doi: 10.12345/jtik.v11i1.2022.

A. Jauhari, S. Maesaroh, and M. Kom, "Penerapan Naïve Bayes untuk Analisis Sentimen Ulasan Pengguna Aplikasi Mobile," Jurnal Teknik Informatika, vol. 9, no. 2, pp. 45--52, 2023.

B. Liu, Sentiment Analysis: Mining Opinions, Sentiments, and Emotions, Cambridge University Press, 2015.

A. McCallum and K. Nigam, "A Comparison of Event Models for Naive Bayes Text Classification," in AAAI-98 Workshop on Learning for Text Categorization, 1998, pp. 41--48.

B. Pang and L. Lee, "Opinion Mining and Sentiment Analysis," Foundations and Trends in Information Retrieval, vol. 2, no. 1--2, pp. 1--135, 2008.

H. He and E. A. Garcia, "Learning from Imbalanced Data," IEEE Transactions on Knowledge and Data Engineering, vol. 21, no. 9, pp. 1263--1284, 2009.

D. Fernández-López, A. Aler, J. M. Valls, and S. Ventura, "Synthetic oversampling for imbalanced data: A review," Artificial Intelligence Review, vol. 54, no. 8, pp. 5789--5850, 2021, doi: 10.1007/s10462-021-09973-8.

R. Lestari and M. Hidayat, "Analisis Sentimen pada Ulasan Aplikasi E-Commerce Menggunakan Naive Bayes dengan Teknik SMOTE," Jurnal Informatika dan Komputer Indonesia (JIKI), vol. 5, no. 1, pp. 12--18, 2021.

M. Rinaldi and I. Nurhaliza, "Perbandingan Teknik ADASYN dan SMOTE untuk Penyeimbangan Data pada Analisis Sentimen Ulasan Kuliner," in Seminar Nasional Teknologi Informasi dan Komunikasi (SENTIKA), 2022, pp. 90--97.

F. Wulandari, Y. Handayani, and T. A. Nugroho, "Implementasi Random Undersampling dalam Klasifikasi Sentimen Pengguna Aplikasi Pembelajaran," Jurnal Teknologi Informasi dan Ilmu Komputer, vol. 10, no. 3, pp. 270--276, 2022.

A. Sharma, P. P. Singh, and A. Sharma, "Comparative Analysis of Oversampling Techniques on Imbalanced Data for Sentiment Classification," International Journal of Advanced Computer Science and Applications, vol. 11, no. 8, pp. 563--570, 2020.

D. Nugroho and A. Lestari, "Peningkatan Akurasi Klasifikasi Sentimen dengan Teknik Penyeimbangan Data dan Naive Bayes," Jurnal Sistem Informasi, vol. 17, no. 2, pp. 123--132, 2021.

A. Yadav and A. Vishwakarma, "Sentiment analysis using deep learning architectures: a review," Artificial Intelligence Review, vol. 53, no. 6, pp. 4335--4385, 2020, doi: 10.1007/s10462-019-09794-5.

R. Kurniawan, H. O. Wijaya, and R. P. Aprisusanti, "Sentiment analysis of Google Play Store user reviews on digital population identity app using K-Nearest Neighbors," J. Sisfokom, vol. 13, no. 2, 2024.

S. Pranatawijaya, C. Rahman, and S. Geges, "Unveiling user sentiment: Aspect-based analysis and topic modeling of ride-hailing and Google Play app reviews," J. Inform. Syst. Eng. Bus. Intell., vol. 10, no. 3, pp. 328--339, 2024.

A. M. Ramdhani, M. H. Syahputra, and I. K. Wibowo, "Sentiment Analysis of Indonesian Tweets Using Naive Bayes and Support Vector Machine," Journal of Information Systems Engineering and Business Intelligence, vol. 7, no. 1, pp. 15--22, 2022.

D. Setiawan and R. Santoso, "Text Preprocessing for Indonesian Language Sentiment Analysis: A Comparative Study," in Proc. 2021 International Conference on Computer Science and Information Technology (ICCSIT), Jakarta, Indonesia, 2021, pp. 87--92.

R. Sari and T. Anggraini, "Effectiveness of Stopword Removal and Tokenization on Sentiment Classification of Indonesian Product Reviews," Journal of Computer Science and Information Technology, vol. 8, no. 3, pp. 201--209, 2023.

M. Ardiansyah, H. Suryani, and D. F. Siregar, "Evaluasi Kinerja Lexicon-Based dan Machine Learning pada Analisis Sentimen Teks Berbahasa Indonesia," Jurnal Teknologi Informasi dan Ilmu Komputer (JTIIK), vol. 9, no. 2, pp. 245--252, 2022.

N. S. Putri and I. Setiawan, "Analisis Sentimen Berbasis Kamus Bahasa Indonesia Menggunakan TextBlob dan Lexicon Adaptif," Jurnal Informatika, vol. 14, no. 1, pp. 67--74, 2021.

J. Brownlee, Imbalanced Classification with Python: Better Metrics, Balance Skewed Classes, and Improve Model Performance, Machine Learning Mastery, 2020.

A. G. Villanueva, A. Rezaeifar, and L. Lee, "Evaluating Synthetic Oversampling Methods for Text Classification on Imbalanced Data," in Proc. IEEE Int. Conf. Big Data, pp. 4482--4489, 2021.

M. S. P. Paneru and A. Gupta, "Performance comparison of oversampling techniques for sentiment classification," Int. J. Comput. Sci. Appl., vol. 18, no. 3, pp. 35--45, 2021.

R. Alhassan and M. O. Adigun, "Comparative Analysis of Sampling Techniques for Handling Imbalanced Datasets in Sentiment Analysis," J. Big Data, vol. 9, no. 1, 2022.

A. Kowsari, K. J. Meimandi, M. Heidarysafa, S. Mendu, L. Barnes, and D. Ebert, "Text classification algorithms: A survey," Information, vol. 10, no. 4, p. 150, Apr. 2019, doi: 10.3390/info10040150.

D. Mulyono and S. A. Syamsuddin, "Sentiment analysis on Twitter using TF-IDF and support vector machine," J. Phys.: Conf. Ser., vol. 1402, no. 4, 2020, doi: 10.1088/1742-6596/1402/4/044066.

S. Y. Sung, A. G. Cahyani, and A. Purwarianti, "Comparative Study on Text Classification Methods for Indonesian Twitter Sentiment Analysis," Journal of Big Data, vol. 8, no. 1, pp. 1--18, 2021, doi: 10.1186/s40537-021-00471-6.

H. Albahli and R. Alhujaili, "TF-IDF and Deep Learning Based Hybrid Model for Text Classification," IEEE Access, vol. 9, pp. 138443--138456, 2021, doi: 10.1109/ACCESS.2021.3118754.

F. N. Ardiyani and D. Nurjanah, "Pengaruh Penyeimbangan Data pada Klasifikasi Sentimen Menggunakan SMOTE dan ADASYN," Jurnal Teknik Informatika dan Sistem Informasi, vol. 8, no. 2, pp. 135--144, 2021.

M. A. Hall and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, 4th ed., Elsevier, 2016.

S. Raschka and V. Mirjalili, Python Machine Learning, 3rd ed., Packt Publishing, 2019.

J. Brownlee, Imbalanced Classification with Python: Better Metrics, Balance Skewed Classes, Cost-Sensitive Learning, Machine Learning Mastery, 2020.

A. S. Jauhari and S. Wibowo, "Sentiment Analysis on Google Play Store Review using Naïve Bayes Algorithm," J. RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 7, no. 2, pp. 359--366, Apr. 2023.

M. A. Siregar, M. T. Nasution, and E. M. Hasibuan, "Aspect-Based Sentiment Analysis on Customer Review Using LSTM," J. Inform., vol. 9, no. 2, pp. 101--108, Aug. 2022.

D. Mukherjee, A. Bala, and D. Majumdar, "Sentiment Analysis of App Reviews for Identifying Features for Improvement," in Proc. 11th Int. Conf. Comput. Commun. Netw. Technol., IEEE, 2020.

S. Setyabudi and E. Aryanny, "Sentiment Analysis of Lazada Marketplace User Ratings with Naïve Bayes and Support Vector Machine Methods," INOVTEK Polbeng - Seri Informatika, vol. 10, no. 1, pp. 422--433, 2025.

A. K. Wibowo and M. F. Rahman, "Sentiment Analysis on Google Play Store Reviews Using Naïve Bayes Classification," Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 6, no. 2, pp. 349--356, 2022.

M. R. Hidayat and I. N. Mahendra, "Pengaruh Penerapan Teknik Oversampling terhadap Akurasi Klasifikasi Sentimen," Jurnal Teknik ITS, vol. 12, no. 1, pp. A90--A95, 2023.

C. Wang, B. Guo, and Y. Yu, "Understanding User Satisfaction from Online Reviews: A Classification Framework Based on Review Ratings and Textual Sentiment," Int. J. Inf. Manage., vol. 52, p. 102068, Apr. 2020.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Penerapan Penyeimbangan Data Pada Analisis Sentimen Ulasan Game Magic Chess Go Go di Play Store dengan Naive Bayes

Dimensions Badge
Article History
Submitted: 2025-07-01
Published: 2025-09-02
Abstract View: 534 times
PDF Download: 221 times
How to Cite
Mustaqim, M., & Santoso, A. (2025). Penerapan Penyeimbangan Data Pada Analisis Sentimen Ulasan Game Magic Chess Go Go di Play Store dengan Naive Bayes. Building of Informatics, Technology and Science (BITS), 7(2), 1078-1089. https://doi.org/10.47065/bits.v7i2.7845
Section
Articles