Penerapan Natural Language Processing Dalam Klasifikasi Sentimen Komentar Youtube Tentang Judi Online
Abstract
YouTube, as a video-sharing platform, has become a public interaction space rich in opinions regarding online gambling issues in Indonesia. However, large-scale manual sentiment analysis is difficult due to the high data volume and local language nuances. This study aims to develop a sentiment classification model for Indonesian-language YouTube comments using Natural Language Processing (NLP) techniques to understand public perceptions of the online gambling phenomenon. Data of 3,000 comments were collected from YouTube videos related to online gambling through the YouTube Data API in Indonesia. All data were manually annotated by three annotators (kappa 0.85) into three sentiment classes (positive, negative, neutral) along with relevance, then divided into 80% training and 20% testing. Pre-processing included case folding, text cleaning, tokenization, stopword removal, stemming, lemmatization, and slang normalization. Models tested included Naive Bayes, Support Vector Machine (SVM), Long Short-Term Memory (LSTM), Bidirectional LSTM (BiLSTM), and IndoBERT. Evaluation using accuracy, precision, recall, and F1-score metrics showed IndoBERT achieved the best performance with 91.67% accuracy, 90% precision (negative class), 95% recall (negative class), and 91.66% F1-score. This research contributes to understanding public attitudes toward online gambling and the development of an adaptive sentiment classification system for the Indonesian language.
Downloads
References
Alia, I. R., Lumbanraja, F. R., Aristoteles, A., & Andrian, R. (2025). Classification of Public Sentiment towards the Performance of the Ministry of Communication and Digital regarding Online Gambling. Jurnal Pepadun, 6(3), 264–275. https://doi.org/10.23960/pepadun.v6i3.295
Amin, A. D. B. M., Bhuiyan, M. I., Kamarudin, N. S., Toh, Z., & Nafis, N. S. (2025). Sentiment Analysis On YouTube Comments Using Machine Learning Techniques Based On Video Games Content (No. arXiv:2511.06708). arXiv. https://doi.org/10.48550/arXiv.2511.06708
Ammar, M. Z., Putra, R. E., & Yamasari, Y. (2025). Deep Learning-Based Detection of Online Gambling Promotion Spam in Indonesian YouTube Comments. Journal of Applied Informatics and Computing, 9(6), 3632–3641. https://doi.org/10.30871/jaic.v9i6.11240
Apriansyah, F. M., Ramadhan, T. I., Hidayat, C. R., & Wijaya, A. K. (2025). Perbandingan IndoBERT dan IndoRoBERTa Untuk Analisis Sentimen Pada Film Dokumenter Dirty Vote. Jurnal Informatika: Jurnal Pengembangan IT, 10(3), 593–605. https://doi.org/10.30591/jpit.v10i3.8607
El Azzouzy, O., Chanyour, T., & Andaloussi, S. J. (2025). Transformer-based models for sentiment analysis of YouTube video comments. Scientific African, 29, e02836. https://doi.org/10.1016/j.sciaf.2025.e02836
Iansyah, K., Nurlaili, A. L., & Haromainy, M. M. A. (2025). Comparative Analysis of IndoBERT, IndoBERTweet, and XLM-RoBERTa for Detecting Online Gambling Comments on YouTube. Bit-Tech, 8(2), 2379–2390. https://doi.org/10.32877/bt.v8i2.3257
Prastiko, A. D., & Wiranata, A. D. (2025). Analisis Sentimen Publik terhadap Fenomena Judi Online di Media Sosial X dengan SVM. Jurnal Pustaka AI (Pusat Akses Kajian Teknologi Artificial Intelligence), 5(2), 306–315. https://doi.org/10.55382/jurnalpustakaai.v5i2.1180
Rahadian, G., & Feriza, G. (2026). Penerapan Pasal 45 Undang Undang Informasi dan Elektronik Terkait Judi Online dalam Perspektif Upaya Penegakan Hukum Terhadap Kejahatan Siber: Penelitian. Jurnal Pengabdian Masyarakat Dan Riset Pendidikan, 4(3), 15715–15727. https://doi.org/10.31004/jerkin.v4i3.4368
Widiyanto, A., Prameswari, M., & Latief, M. A. (2025). Gambling Comments Detection on Youtube: A Comparative Study of Tree-Based Boosting, LSTM and GRU Models. JUTI: Jurnal Ilmiah Teknologi Informasi, 144–160. https://doi.org/10.12962/j24068535.v23i2.a1305
Atsilah Siregar, J., & Nugroho, C. (2025). Social-mediated communication and network dynamics in online gambling discourse: A social network analysis of YouTube comments on “Indonesia Darurat Judi Online.” Frontiers in Communication, 10. https://doi.org/10.3389/fcomm.2025.1584444
Cahyo, D. D. N., Handayani, R., Lestari, V. B., & Febriani, S. (2025). Sentiment Analysis of Public Opinion on Online Gambling Through Social Media Using Convolutional Neural Network. Journal Of Informatics And Telecommunication Engineering, 9(1), 99–115. https://doi.org/10.31289/jite.v9i1.15024
Hasibuan, I. H., Budianita, E., Agustian, S., & Pizaini, P. (2023). Klasifikasi Sentimen Komentar Youtube Tentang Pembatalan Indonesia Sebagai Tuan Rumah Piala Dunia U-20 Menggunakan Algoritma Naïve Bayes Classifer. Jurnal Sistem Komputer dan Informatika (JSON), 5(2), 249. https://doi.org/10.30865/json.v5i2.7096
Kim, K., Park, K., Yoon, K., & Kim, Y. (2026). Perceptions of Sports Betting and Promotions in the U.S.: Evidence from a Mixed-Methods Sentiment Analysis of YouTube Comments. Journal of Gambling Studies. https://doi.org/10.1007/s10899-025-10467-y
Maulana, A., & Yuliana, A. (2024). Analisis Sentimen Opini Publik Terkait Judi Online Pada Pengguna Aplikasi X Menggunakan Algoritma Naïve Bayes Dan Support Vector Mechine. Jurnal Informatika dan Teknik Elektro Terapan, 12(3S1). https://doi.org/10.23960/jitet.v12i3S1.5187
Prastyo, D., Irawan, D., & Mursyidin, I. H. (2024a). Klasifikasi Sentimen Komentar YouTube dengan NLP pada Debat Pilkada Banten 2024. bit-Tech, 7(2), 413–421. https://doi.org/10.32877/bt.v7i2.1833
Putra, A., & Hendrawan, A. (2025). Analisis Sentimen Terhadap Komentar Youtube Terkait Judi Online Menggunakan Metode Convolutional Neural Network (CNN). Djtechno: Jurnal Teknologi Informasi, 6(3), 1227–1235. https://doi.org/10.46576/djtechno.v6i3.8085
Sumihar, Y. P., Lase, K. J. D., & Lase, J. H. (2025). Pengembangan Model Deep Learning dengan Slang-Aware Embeddings untuk Deteksi Promosi Judi Online. Jutisi : Jurnal Ilmiah Teknik Informatika dan Sistem Informasi, 14(3). https://doi.org/10.35889/jutisi.v14i3.3226
Sumon, R. I., Uddin, S. M. I., Akter, S., Mozumder, M. A. I., Khan, M. O., & Kim, H.-C. (2024). Natural Language Processing Influence on Digital Socialization and Linguistic Interactions in the Integration of the Metaverse in Regular Social Life. Electronics, 13(7), 1331. https://doi.org/10.3390/electronics13071331
Wisnu Mukti Darwansah, Amalia Anjani Arifiyanti, & Rizka Hadiwiyanti. (2025). Classification and Mapping of Online Gambling Based on News Articles Using NER and SVM. Jurnal Teknologi Dan Open Source, 8(2), 788–797. https://doi.org/10.36378/jtos.v8i2.4707
Arpipi, M. Y. R., Handhayani, T., & Hendryli, J. (2025). Climate Change Sentiment Analysis using LSTM. Journal of Dinda : Data Science, Information Technology, and Data Analytics, 5(1), 22–27. https://doi.org/10.20895/dinda.v5i1.1719
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Penerapan Natural Language Processing Dalam Klasifikasi Sentimen Komentar Youtube Tentang Judi Online
Pages: 1593-1602
Copyright (c) 2026 Fransiskus Oktanesius Lase, Yoel Pieter S, Kristian Juri Damai Lase

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).













