Deteksi Komentar dan Analisis Sentimen Promosi Judi Online pada Youtube Menggunakan IndoBERT dan XGBoost
Abstract
YouTube, as a highly interactive platform, has become a medium for online gambling promotions, raising legal issues under the Electronic Information and Transactions (ITE) Law and social risks, particularly for adolescents. This study aims to analyse public responses to gambling-related comments and to develop an automatic detection system using Natural Language Processing (NLP). The research follows the Knowledge Discovery in Databases (KDD) stages, including web scraping, preprocessing, text transformation, model training, and evaluation. Sentiment analysis was performed on 999 comments labelled positive, negative, and neutral. Detection of promotional content was tested using IndoBERT and TF-IDF-based XGBoost, with 587 training samples and 885 external testing samples at an 80:20 ratio. The results show that the majority of comments (52.65%) are positive with a fairly high average confidence score (0.914), indicating public support for the eradication of online gambling. Meanwhile, negative comments (24.72%) with a confidence score of 0.888 generally contained criticism of the rampant practice of gambling promotion or YouTube's weak moderation system. For automatic detection, IndoBERT achieved superior performance with 0.94 accuracy and F1-score and only 10 misclassifications, significantly outperforming XGBoost, which reached 0.73 accuracy with 47 errors. This study highlights the effectiveness of transformer-based models in detecting gambling promotions while also indicating strong public support for eradication efforts. These findings provide an empirical foundation for advancing research on adaptive automated moderation systems capable of identifying concealed patterns of illicit content in digital platforms, particularly in the detection of online gambling promotional comments within the YouTube ecosystem.
Downloads
References
J. P. Suharsono and D. Nurahman, “Pemanfaatan Youtube Sebagai Media Peningkatan Pelayanan Dan Informasi,” Ganaya: Jurnal Ilmu Sosial dan Humaniora, vol. 7, no. 1, pp. 298–304, 2024, doi: https://doi.org/10.37329/ganaya.v7i1.3157.
F. N. F. Firmansyah, “Analisis Sentimen dan Jaringan Sosial pada Komen Video ‘Youtube Tangan Kotor Influencer’ dalam Judi Online Milik Ferri Irwandi,” Soedirman Accounting, Auditing and Public Sector Journal (SAAP), vol. 3, pp. 1–10, doi: https://doi.org/10.32424/1.saap.2024.3.1.12267.
M. D. Angelo, R. W. Harwenda, I. Budi, A. B. Santoso, and P. K. Putra, “Sentiment Analysis and Topic Modeling of Public Opinion on Indonesia New Capital City Development Policies,” Eduvest-Journal of Universal Studies, vol. 5, no. 5, pp. 5951–5969, 2025, doi: https://doi.org/10.59188/eduvest.v5i5.51234.
H. Cokronegoro and H. Prasetyo, “Optimizing Law Enforcement in the Case of Online Gambling Promoters/Endorsements.,” Media Iuris, vol. 8, no. 1, 2025, doi: https://doi.org/10.20473/mi.v8i1.59348.
I. Rodhiyah, I. P. Hapsari, and H. I. Iskandar, “Pertanggung Jawaban Pelaku Tindak Pidana Perjudian Online di Indonesia,” Al-Manhaj: Jurnal Hukum Dan Pranata Sosial Islam, vol. 4, no. 2, pp. 591–600, 2022, doi: https://doi.org/10.37680/almanhaj.v4i2.1986.
D. D. N. Cahyo, R. Handayani, V. B. Lestari, and S. Febriani, “Sentiment Analysis of Public Opinion on Online Gambling Through Social Media Using Convolutional Neural Network,” Journal Of Informatics And Telecommunication Engineering, vol. 9, no. 1, pp. 99–115, 2025, doi: https://doi.org/10.31289/jite.v9i1.15024.
A. F. Al Farizi and Y. Sibaroni, “Implementation of BiLSTM and IndoBERT for Sentiment Analysis of TikTok Reviews,” JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika), vol. 10, no. 1, pp. 96–106, 2025, doi: https://doi.org/10.29100/jipi.v10i1.5815.
B. P. Aji, C. Sri, and K. Aditya, “Klasifikasi Sentimen Ulasan Produk pada Platform E-Commerce di Indonesia dengan Menggunakan Model Pre-Trained IndoBERT,” Technology and Science (BITS), vol. 6, no. 4, 2025, doi: 10.47065/bits.v6i4.6968, doi: https://doi.org/10.47065/bits.v6i4.6968.
F. M. Apriansyah, T. I. Ramadhan, C. R. Hidayat, and A. K. Wijaya, “Perbandingan IndoBERT dan IndoRoBERTa Untuk Analisis Sentimen Pada Film Dokumenter Dirty Vote,” Jurnal Informatika: Jurnal Pengembangan IT, vol. 10, no. 3, pp. 593–605, 2025, doi: https://doi.org/10.30591/jpit.v10i3.8607.
A. Y. Pratama, G. A. Sanjaya, N. K. Lubis, M. R. Aditya, and Y. Yennimar, “Analisis Sentimen Publik Terkait Danantara Menggunakan Algoritma IndoBERT pada Platform Media Sosial,” METIK Jurnal, vol. 9, no. 1, pp. 92–100, 2025, doi: https://doi.org/10.47002/metik.v9i1.1055.
G. F. Situmorang and R. Purba, “Deteksi Potensi Depresi dari Unggahan Media Sosial X Menggunakan IndoBERT,” Building of Informatics, Technology and Science (BITS), vol. 6, no. 2, pp. 649–661, Sep. 2024, doi: https://doi.org/10.47065/bits.v6i2.5496.
M. D. Maulana and C. S. K. Aditya, “Perbandingan IndoBERT dan Bi-LSTM Dalam Mendeteksi Pelanggaran Undang-Undang ITE,” SINTECH (Science and Information Technology) Journal, vol. 8, no. 1, pp. 52–59, 2025, doi: https://doi.org/10.31598/sintechjournal.v8i1.1846.
M. I. Rizaldi, D. R. Chandranegara, and D. R. Akbi, “Comparison Of Machine Learning Techniques For Classification Of Distributed Denial Of Service Attacks Based On Feature Engineering In Sdn-Based Networks,” JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika), vol. 9, no. 3, pp. 1180–1197, 2024, doi: https://doi.org/10.29100/jipi.v9i3.5262.
I. A. Latifah, F. A. Rafrastara, J. Bintoro, W. Ghozi, and W. M. Osman, “Comparative Analysis of Feature Selection Methods with XGBoost for Malware Detection on the Drebin Dataset,” Jurnal Sisfokom (Sistem Informasi dan Komputer), vol. 13, no. 3, pp. 403–409, 2024, doi: https://doi.org/10.32736/sisfokom.v13i3.2294.
S. Rosyada, F. A. Rafrastara, A. Ramadhani, W. Ghozi, and W. Yassin, “Enhancing XGBoost Performance in Malware Detection through Chi-Squared Feature Selection,” Jurnal Sisfokom (Sistem Informasi dan Komputer), vol. 13, no. 3, pp. 396–402, 2024, doi: https://doi.org/10.32736/sisfokom.v13i3.2293.
S. B. Bulkisah, R. Astuti, and A. Bahtiar, “Implementasi Data Mining Algoritma Decision Tree Untuk Klasifikasi Status Gizi Balita Di Kecamatan Ciledug,” Jurnal Ilmiah Informatika Komputer, vol. 29, no. 1, pp. 1–12, 2024, doi: https://doi.org/10.35760/ik.2024.v29i1.10346.
A. S. Aribowo, H. Basiron, N. F. Abd Yusof, and S. Khomsah, “Cross-domain sentiment analysis model on Indonesian YouTube comment.,” International Journal of Advances in Intelligent Informatics, vol. 7, no. 1, 2021, doi: https://doi.org/10.26555/ijain.v7i1.554.
S. Khairunnisa, A. Adiwijaya, and S. Al Faraby, “Pengaruh Text Preprocessing terhadap Analisis Sentimen Komentar Masyarakat pada Media Sosial Twitter (Studi Kasus Pandemi COVID-19),” J. Media Inform. Budidarma, vol. 5, no. 2, p. 406, 2021, doi: https://doi.org/10.30865/mib.v5i2.2835.
L. R. Krosuri and R. S. Aravapalli, “Novel heuristic-based hybrid ResNeXt with recurrent neural network to handle multi class classification of sentiment analysis,” Mach Learn Sci Technol, vol. 4, no. 1, p. 015033, 2023, doi: https://doi.org/10.1088/2632-2153/acc0d5.
F. R. Andhika, W. Witanti, and P. N. Sabrina, “Analisis Sentimen Menggunakan Metode IndoBERT Pada Ulasan Aplikasi Zoom Menggunakan Fitur Ekstrasi GloVe,” METIK Jurnal, vol. 9, no. 2, pp. 439–448, 2025, doi: https://doi.org/10.47002/g38fxb78.
A. K. Saputra, M. R. Handayani, N. C. H. Wibowo, and K. Umam, “Sentiment Analysis of User Reviews on the Game GTA V Using Support Vector Machine,” Jurnal Sisfokom (Sistem Informasi dan Komputer), vol. 14, no. 3, pp. 284–290, 2025, doi: https://doi.org/10.32736/sisfokom.v14i3.2368.
Y. Yulistiani and S. Styawati, “Analisis Sentimen Terhadap Calon Presiden Indonesia 2024 dengan Metode Extreme Gradient Boosting (XGBoost),” Jurnal Informatika: Jurnal Pengembangan IT, vol. 9, no. 3, pp. 322–328, 2024, doi: https://doi.org/10.30591/jpit.v9i3.6127.
A. Rácz, D. Bajusz, and K. Héberger, “Effect of dataset size and train/test split ratios in QSAR/QSPR multiclass classification,” Molecules, vol. 26, no. 4, p. 1111, 2021, doi: https://doi.org/10.3390/molecules26041111.
A. Kunaefi, Z. Abidin, and R. Kusumawati, “Klasifikasi Berita Hoaks Bahasa Indonesia Menggunakan Indobert Fine-Tuning Dengan Pendeka-Tan Focal Loss Pada Data Tidak Seimbang,” JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika), vol. 10, no. 2, pp. 1706–1714, May 2025, doi: https://doi/org/10.29100/jipi.v10i2.7811.
M. C. T. Manullang, A. Z. Rakhman, H. Tantriawan, and A. Setiawan, “Comparative Analysis of CNN, Transformers, and Traditional ML for Classifying Online Gambling Spam Comments in Indonesian,” Journal of Applied Informatics and Computing, vol. 9, no. 3, pp. 592–602, 2025, doi: https://doi.org/10.30871/jaic.v9i3.9468.
C. M. Bachri and W. Gunawan, “Deteksi Email Spam menggunakan Algoritma Convolutional Neural Network (CNN),” JEPIN (Jurnal Edukasi dan Penelitian Informatika), vol. 10, no. 1, pp. 88–94, 2024, doi: https://doi.org/10.26418/jp.v10i1.73306.
J. Singer, A. Wöhr, and S. Otterbach, “Gambling operators’ use of advertising strategies on social media and their effects: A systematic review,” Curr Addict Rep, vol. 11, no. 3, pp. 437–446, 2024, doi: https://doi.org/10.1007/s40429-024-00560-4.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Deteksi Komentar dan Analisis Sentimen Promosi Judi Online pada Youtube Menggunakan IndoBERT dan XGBoost
Pages: 1592-1600
Copyright (c) 2025 Naila Raihana Putri, Dedy Kurniawan, Ken Ditha Tania

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















