Analisis Sentimen Timnas Indonesia pada Data Tidak Seimbang Menggunakan Perbandingan Naïve Bayes dan IndoBERT


  • Maharani Navila Salsa Bela * Mail Universitas Mercu Buana Yogyakarta, Yogyakarta, Indonesia
  • Putry Wahyu Setyaningsih Universitas Mercu Buana Yogyakarta, Yogyakarta, Indonesia
  • (*) Corresponding Author
Keywords: Sentimen Analysis; Social Media X; Naïve Bayes; IndoBERT; Indonesian National Team; Data Imbalance

Abstract

Social media platform X is widely used by the public to express opinions on the performance of the Indonesian National Team, especially in the fourth round of the 2026 World Cup Qualifiers. In this phase, the Indonesian National Team suffered two consecutive defeats, namely 2–3 to Saudi Arabia and 0–1 to Iraq, which triggered an increase in emotional responses and public criticism on social media. This condition makes sentiment analysis important to understand public perception more objectively. This study aims to analyze the sentiment of social media users X and compare the performance of the Naïve Bayes and IndoBERT models in imbalanced data conditions. The research data amounted to 1,268 tweets that were processed through a pre-processing stage, then automatically labeled using a lexicon-based approach as an initial labeling into two classes, namely positive and negative. The dataset was divided into training data and test data with a ratio of 70:30. The data distribution shows the dominance of negative sentiment at 84.1% and positive at 15.9%. Classification was performed using TF-IDF-based Naïve Bayes and IndoBERT-base-p1, with data imbalance management using random oversampling and class weighting. The results show that Naïve Bayes without treatment achieved 84% accuracy but failed to recognize the positive class. After oversampling, the positive class recall increased to 45%. IndoBERT achieved 85% accuracy, with positive recall increasing from 35% to 43% and the positive class F1-score increasing by 47% after applying class weighting. Despite the relatively high accuracy, the evaluation shows the importance of considering performance on minority classes. Overall, IndoBERT with class weighting provided more balanced results. However, the use of lexicon-based automatic labeling is a limitation of this study.

Downloads

Download data is not yet available.

References

I. Apryani, A. Fauzi, D. S. Kusumaningrum, and H. H. Handayani, “Analisis Sentimen Performa Timnas Sepak Bola Indonesia pada Kolom Komentar Aplikasi TikTok Menggunakan Algoritma Machine Learning,” J. Account. Inf. Syst., vol. 8, no. 1, pp. 76–89, 2025, doi: 10.32627/aims.v8i1.1160.

A. D. Pradiptha, F. Siddiq, A. Farisi, M. F. Pratama, and D. Aprianti, “Analisis Sentimen Suporter terhadap Performa Tim Nasional Sepakbola Indonesia pada Turnamen Sea Games 2023 dengan Metode Naive Bayes,” J. Sains dan Inform., vol. 11, pp. 30–39, 2025, doi: 10.34128/jsi.v11i1.775.

Nouvan, “Negara Dengan Pengguna X (twitter) terbanyak Juli 2025, Indonesia urutan keempat,” Dataloka.id, https://dataloka.id/humaniora/4627/negara-dengan-pengguna-x-twitter-terbanyak-juli-2025-indonesia-urutan-keempat/ (accessed Apr. 22, 2026).

W. Alfiyani, D. Abdul Fatah, and F. Irhamni, “Penerapan algoritma naïve Bayes Untuk Analisis Sentimen Pada media sosial X Terhadap Performa tim nasional sepak bola indonesia di era Kepemimpinan Shin Tae-yong,” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 9, no. 3, pp. 3969–3977, 2025, doi:10.36040/jati.v9i3.13451.

F. F. Wati, Suleman, and A. E. Widodo, “Analisis Sentimen Ulasan Pengguna Aplikasi Deepseek Menggunakan Algoritma Random Forest Dan Naive Bayes,” CONTEN Comput. Netw. Technol., vol. 5, no. 1, pp. 8–15, 2025, doi: 10.31294/hqpha267.

R. Lasepa, S. Riyadi, S. Ramadhan, and D. D. Saputra, “Sentiment Analysis Terhadap Perspektif Warganet Atas Tragedi Kanjuruhan Malang di Twitter Menggunakan Naïve Bayes Classifier,” J. Inform., vol. 10, no. 1, pp. 45–53, 2023, doi: 10.31294/inf.v10i1.14546.

R. H. Nufus and U. Surapati, “Analisis Sentimen Persepsi Masyarakat Terhadap Timnas Indonesia U-23 dalam AFC-23 Asian Cup 2024 Pada Media Sosial X Menggunakan Metode Naïve Bayes Classifier,” J. Indones. Manaj. Inform. dan Komun., vol. 5, no. 3, pp. 2647–2657, 2024, doi: 10.35870/jimik.v5i3.964.

H. S. Anggraheni et al., “Deteksi Spam Berbahasa Indonesia Berbasis Teks Menggunakan Model Bert,” J. Teknol. Inf. dan Ilmu Komput., vol. 11, no. 6, pp. 1291–1301, 2024, doi: 10.25126/jtiik.2024118121.

C. J. Tobing, IGN Lanang Wijayakusuma, and Luh Putu Ida Harini, “Perbandingan Kinerja Indobert dan Mbert Untuk Deteksi Berita Hoaks Politik Dalam Bahasa Indonesia,” JST (Jurnal Sains dan Teknologi), vol. 14, no. 1, pp. 114–123, 2025, doi:10.23887/jstundiksha.v14i1.92126.

W. Widyananda, Maskur, and A. Fauzi, “Machine Learning and Transformer-based Model for Sentiment Analysis of Indonesian E-Commerce Reviews,” Indones. J. Comput. Sci., vol. 14, no. 4, pp. 6262–6271, 2025, doi: 10.33022/ijcs.v14i4.4980.

M. I. Abidin and E. W. Pamungkas, “Analisis Sentimen terhadap Tim Nasional Indonesia pada Piala Asia 2023 Menggunakan Model Transformer Bahasa Indonesia,” Rabit : Jurnal Teknologi dan Sistem Informasi Univrab, vol. 10, no. 2, pp. 482–496, 2025, doi:10.36341/rabit.v10i2.6142.

C. Ramadhan, V. Atina, and H. Permatasari, “Analisis Perbandingan Model CNN dan IndoBERT Dalam Sentimen Berita Politik Indonesia,” Pros. Semin. Nas. Teknol. Inf. dan Bisnis, pp. 110–118, 2025, doi: 10.47701/v1r9ka69.

A. Agustin, S. Andrean, S. Susanti, R. Rahmiati, and H. Hamdani, “Review Aplikasi Kredivo Menggunakan Analisis Sentimen dengan Algoritma Support Vector Machine,” Rabit : Jurnal Teknologi dan Sistem Informasi Univrab, vol. 9, no. 1, pp. 39–49, 2023, doi: 10.36341/rabit.v9i1.4107.

N. N. Aini Aryanti and O. Suria, “Analisis Sentimen terhadap Pemutusan Hubungan Kerja di Indonesia: Komparasi IndoBERT dengan SVM, Random Forest, dan Decision Tree dengan Optimasi TF-IDF,” Rabit : Jurnal Teknologi dan Sistem Informasi Univrab, vol. 10, no. 2, pp. 1158–1176, 2025, doi:10.36341/rabit.v10i2.6364.

F. Alvin, N. Anisa, and S. Winarsih, “Perbandingan Kinerja Model IndoBERT , IndoBERTweet , dan Algoritma Klasik pada Analisis Sentimen Isu Indonesia Gelap,” Build. Informatics, Technol. Sci., vol. 7, no. 3, pp. 1601–1613, 2025, doi: 10.47065/bits.v7i3.8636.

A. A. Rohman and G. A. Trisnapradika, “Perbandingan Algoritma NBC , SVM , Logistic Regression untuk Analisis Sentimen Terhadap Wacana KaburAjaDulu di Media Sosial X,” Build. Informatics, Technol. Sci., vol. 7, no. 1, pp. 169–178, 2025, doi: 10.47065/bits.v7i1.7261.

E. A. Winanto, S. M. Z. Ali Difyah, P. A. Jusia, and Sharipuddin, “Analisis Sentimen Terhadap Tagar Kabur Aja Dulu Di Twitter Menggunakan Metode Lexicon-Based,” Jurnal PROCESSOR, vol. 20, no. 2, 2025, doi:10.33998/processor.2025.20.2.2542.

A. A. Qolbu, N. Fitriyati, and N. Inayah, “Performa Naïve Bayes , SVM , dan IndoBERT pada Analisis Sentimen Twitter IndiHome dengan Strategi Penanganan Data Tidak Seimbang,” J. FOURIER, vol. 814, no. 1, pp. 29–44, 2025, doi: 10.14421/fourier.2025.141.29-44.

A. D. M. Putri, N. Sulistianingsih, and R. Rismayati, “Pengaruh Teknik Representasi Teks Bag-of-Words dan TF-IDF terhadap Akurasi Klasifikasi Sentimen Teks Multi-Domain,” JTIM J. Teknol. Inf. dan Multimed., vol. 7, no. 4, pp. 675–688, 2025, doi: 10.35746/jtim.v7i4.756.

N. M. Damayanti, I. D. Ariningtyas, M. Izuddin, and A. Icham, “Analisis Sentimen Tagar #BTSComeback di Platform X Menggunakan IndoBERTweet,” JITET (Jurnal Inform. dan Tek. Elektro Ter., vol. 13, no. 3, 2025, doi: 10.23960/jitet.v13i3.7176.

T. Juniardi and C. A. Sugianto, “Analisis Sentimen Terhadap Tim Nasional Sepak Bola Indonesia pada Piala Dunia U-17 Menggunakan Algoritma Naïve Bayes di Platform X,” Jurnal Informatika dan Teknik Elektro Terapan, vol. 12, no. 3S1, 2024, doi:10.23960/jitet.v12i3s1.5188.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Analisis Sentimen Timnas Indonesia pada Data Tidak Seimbang Menggunakan Perbandingan Naïve Bayes dan IndoBERT

Dimensions Badge
Article History
Submitted: 2026-04-07
Published: 2026-05-26
Abstract View: 34 times
PDF Download: 38 times
Issue
Section
Articles