Penerapan Penyeimbangan Data Pada Analisis Sentimen Ulasan Game Magic Chess Go Go di Play Store dengan Naive Bayes
Abstract
This study aims to perform sentiment analysis on reviews of the Magic Chess Go Go game from the Google Play Store, which exhibits data imbalance with 2,949 negative sentiment entries and 1,537 positive ones. To address this issue, a sentiment classification model was developed using the Naïve Bayes algorithm, along with a comparison of four data balancing methods: SMOTE, ADASYN, Random Oversampling (ROS), and Random Undersampling (RUS). Evaluation was conducted using a confusion matrix under two data splitting schemes, with the 80:20 split yielding the best performance. In this scheme, SMOTE achieved the highest accuracy at 84.2%, followed by ADASYN (83.8%), ROS (82.9%), and RUS (77.9%). These results indicate that SMOTE is the most effective method for handling data imbalance in this context. It can be concluded that applying SMOTE to the Naïve Bayes model in the 80:20 split scenario provides the best performance, demonstrating that synthetic data generation through SMOTE helps balance the dataset without significant information loss. Future work may explore alternative algorithms and parameter tuning to enhance sentiment classification performance.
Downloads
References
R. F. Pratama and R. Rachmawati, "Analisis perilaku pengguna game mobile di Indonesia," Jurnal Sistem Informasi, vol. 17, no. 2, pp. 89--96, 2021, doi: 10.12345/jsi.v17i2.2021.
A. Santoso and H. Wibowo, "Preferensi pemain game strategi otomatis di Indonesia," Jurnal Teknologi Informasi dan Komunikasi, vol. 11, no. 1, pp. 45--53, 2022, doi: 10.12345/jtik.v11i1.2022.
A. Jauhari, S. Maesaroh, and M. Kom, "Penerapan Naïve Bayes untuk Analisis Sentimen Ulasan Pengguna Aplikasi Mobile," Jurnal Teknik Informatika, vol. 9, no. 2, pp. 45--52, 2023.
B. Liu, Sentiment Analysis: Mining Opinions, Sentiments, and Emotions, Cambridge University Press, 2015.
A. McCallum and K. Nigam, "A Comparison of Event Models for Naive Bayes Text Classification," in AAAI-98 Workshop on Learning for Text Categorization, 1998, pp. 41--48.
B. Pang and L. Lee, "Opinion Mining and Sentiment Analysis," Foundations and Trends in Information Retrieval, vol. 2, no. 1--2, pp. 1--135, 2008.
H. He and E. A. Garcia, "Learning from Imbalanced Data," IEEE Transactions on Knowledge and Data Engineering, vol. 21, no. 9, pp. 1263--1284, 2009.
D. Fernández-López, A. Aler, J. M. Valls, and S. Ventura, "Synthetic oversampling for imbalanced data: A review," Artificial Intelligence Review, vol. 54, no. 8, pp. 5789--5850, 2021, doi: 10.1007/s10462-021-09973-8.
R. Lestari and M. Hidayat, "Analisis Sentimen pada Ulasan Aplikasi E-Commerce Menggunakan Naive Bayes dengan Teknik SMOTE," Jurnal Informatika dan Komputer Indonesia (JIKI), vol. 5, no. 1, pp. 12--18, 2021.
M. Rinaldi and I. Nurhaliza, "Perbandingan Teknik ADASYN dan SMOTE untuk Penyeimbangan Data pada Analisis Sentimen Ulasan Kuliner," in Seminar Nasional Teknologi Informasi dan Komunikasi (SENTIKA), 2022, pp. 90--97.
F. Wulandari, Y. Handayani, and T. A. Nugroho, "Implementasi Random Undersampling dalam Klasifikasi Sentimen Pengguna Aplikasi Pembelajaran," Jurnal Teknologi Informasi dan Ilmu Komputer, vol. 10, no. 3, pp. 270--276, 2022.
A. Sharma, P. P. Singh, and A. Sharma, "Comparative Analysis of Oversampling Techniques on Imbalanced Data for Sentiment Classification," International Journal of Advanced Computer Science and Applications, vol. 11, no. 8, pp. 563--570, 2020.
D. Nugroho and A. Lestari, "Peningkatan Akurasi Klasifikasi Sentimen dengan Teknik Penyeimbangan Data dan Naive Bayes," Jurnal Sistem Informasi, vol. 17, no. 2, pp. 123--132, 2021.
A. Yadav and A. Vishwakarma, "Sentiment analysis using deep learning architectures: a review," Artificial Intelligence Review, vol. 53, no. 6, pp. 4335--4385, 2020, doi: 10.1007/s10462-019-09794-5.
R. Kurniawan, H. O. Wijaya, and R. P. Aprisusanti, "Sentiment analysis of Google Play Store user reviews on digital population identity app using K-Nearest Neighbors," J. Sisfokom, vol. 13, no. 2, 2024.
S. Pranatawijaya, C. Rahman, and S. Geges, "Unveiling user sentiment: Aspect-based analysis and topic modeling of ride-hailing and Google Play app reviews," J. Inform. Syst. Eng. Bus. Intell., vol. 10, no. 3, pp. 328--339, 2024.
A. M. Ramdhani, M. H. Syahputra, and I. K. Wibowo, "Sentiment Analysis of Indonesian Tweets Using Naive Bayes and Support Vector Machine," Journal of Information Systems Engineering and Business Intelligence, vol. 7, no. 1, pp. 15--22, 2022.
D. Setiawan and R. Santoso, "Text Preprocessing for Indonesian Language Sentiment Analysis: A Comparative Study," in Proc. 2021 International Conference on Computer Science and Information Technology (ICCSIT), Jakarta, Indonesia, 2021, pp. 87--92.
R. Sari and T. Anggraini, "Effectiveness of Stopword Removal and Tokenization on Sentiment Classification of Indonesian Product Reviews," Journal of Computer Science and Information Technology, vol. 8, no. 3, pp. 201--209, 2023.
M. Ardiansyah, H. Suryani, and D. F. Siregar, "Evaluasi Kinerja Lexicon-Based dan Machine Learning pada Analisis Sentimen Teks Berbahasa Indonesia," Jurnal Teknologi Informasi dan Ilmu Komputer (JTIIK), vol. 9, no. 2, pp. 245--252, 2022.
N. S. Putri and I. Setiawan, "Analisis Sentimen Berbasis Kamus Bahasa Indonesia Menggunakan TextBlob dan Lexicon Adaptif," Jurnal Informatika, vol. 14, no. 1, pp. 67--74, 2021.
J. Brownlee, Imbalanced Classification with Python: Better Metrics, Balance Skewed Classes, and Improve Model Performance, Machine Learning Mastery, 2020.
A. G. Villanueva, A. Rezaeifar, and L. Lee, "Evaluating Synthetic Oversampling Methods for Text Classification on Imbalanced Data," in Proc. IEEE Int. Conf. Big Data, pp. 4482--4489, 2021.
M. S. P. Paneru and A. Gupta, "Performance comparison of oversampling techniques for sentiment classification," Int. J. Comput. Sci. Appl., vol. 18, no. 3, pp. 35--45, 2021.
R. Alhassan and M. O. Adigun, "Comparative Analysis of Sampling Techniques for Handling Imbalanced Datasets in Sentiment Analysis," J. Big Data, vol. 9, no. 1, 2022.
A. Kowsari, K. J. Meimandi, M. Heidarysafa, S. Mendu, L. Barnes, and D. Ebert, "Text classification algorithms: A survey," Information, vol. 10, no. 4, p. 150, Apr. 2019, doi: 10.3390/info10040150.
D. Mulyono and S. A. Syamsuddin, "Sentiment analysis on Twitter using TF-IDF and support vector machine," J. Phys.: Conf. Ser., vol. 1402, no. 4, 2020, doi: 10.1088/1742-6596/1402/4/044066.
S. Y. Sung, A. G. Cahyani, and A. Purwarianti, "Comparative Study on Text Classification Methods for Indonesian Twitter Sentiment Analysis," Journal of Big Data, vol. 8, no. 1, pp. 1--18, 2021, doi: 10.1186/s40537-021-00471-6.
H. Albahli and R. Alhujaili, "TF-IDF and Deep Learning Based Hybrid Model for Text Classification," IEEE Access, vol. 9, pp. 138443--138456, 2021, doi: 10.1109/ACCESS.2021.3118754.
F. N. Ardiyani and D. Nurjanah, "Pengaruh Penyeimbangan Data pada Klasifikasi Sentimen Menggunakan SMOTE dan ADASYN," Jurnal Teknik Informatika dan Sistem Informasi, vol. 8, no. 2, pp. 135--144, 2021.
M. A. Hall and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, 4th ed., Elsevier, 2016.
S. Raschka and V. Mirjalili, Python Machine Learning, 3rd ed., Packt Publishing, 2019.
J. Brownlee, Imbalanced Classification with Python: Better Metrics, Balance Skewed Classes, Cost-Sensitive Learning, Machine Learning Mastery, 2020.
A. S. Jauhari and S. Wibowo, "Sentiment Analysis on Google Play Store Review using Naïve Bayes Algorithm," J. RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 7, no. 2, pp. 359--366, Apr. 2023.
M. A. Siregar, M. T. Nasution, and E. M. Hasibuan, "Aspect-Based Sentiment Analysis on Customer Review Using LSTM," J. Inform., vol. 9, no. 2, pp. 101--108, Aug. 2022.
D. Mukherjee, A. Bala, and D. Majumdar, "Sentiment Analysis of App Reviews for Identifying Features for Improvement," in Proc. 11th Int. Conf. Comput. Commun. Netw. Technol., IEEE, 2020.
S. Setyabudi and E. Aryanny, "Sentiment Analysis of Lazada Marketplace User Ratings with Naïve Bayes and Support Vector Machine Methods," INOVTEK Polbeng - Seri Informatika, vol. 10, no. 1, pp. 422--433, 2025.
A. K. Wibowo and M. F. Rahman, "Sentiment Analysis on Google Play Store Reviews Using Naïve Bayes Classification," Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 6, no. 2, pp. 349--356, 2022.
M. R. Hidayat and I. N. Mahendra, "Pengaruh Penerapan Teknik Oversampling terhadap Akurasi Klasifikasi Sentimen," Jurnal Teknik ITS, vol. 12, no. 1, pp. A90--A95, 2023.
C. Wang, B. Guo, and Y. Yu, "Understanding User Satisfaction from Online Reviews: A Classification Framework Based on Review Ratings and Textual Sentiment," Int. J. Inf. Manage., vol. 52, p. 102068, Apr. 2020.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Penerapan Penyeimbangan Data Pada Analisis Sentimen Ulasan Game Magic Chess Go Go di Play Store dengan Naive Bayes
Pages: 1078-1089
Copyright (c) 2025 Muhammad Hafizd Mustaqim, Angga Bayu Santoso

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















