Penerapan Natural Language Processing Dalam Klasifikasi Sentimen Komentar Youtube Tentang Judi Online


  • Fransiskus Oktanesius Lase * Mail Universitas Kristen Immanuel, Yogyakarta, Indonesia
  • Yoel Pieter S Universitas Kristen Immanuel, Yogyakarta, Indonesia
  • Kristian Juri Damai Lase Universitas Kristen Immanuel, Yogyakarta, Indonesia
  • (*) Corresponding Author
Keywords: YouTube; Sentiment Analysis; Natural Language Processing; Online Gambling; IndoBERT; LSTM; SVM; Naïve Bayes

Abstract

YouTube, as a video-sharing platform, has become a public interaction space rich in opinions regarding online gambling issues in Indonesia. However, large-scale manual sentiment analysis is difficult due to the high data volume and local language nuances. This study aims to develop a sentiment classification model for Indonesian-language YouTube comments using Natural Language Processing (NLP) techniques to understand public perceptions of the online gambling phenomenon. Data of 3,000 comments were collected from YouTube videos related to online gambling through the YouTube Data API in Indonesia. All data were manually annotated by three annotators (kappa 0.85) into three sentiment classes (positive, negative, neutral) along with relevance, then divided into 80% training and 20% testing. Pre-processing included case folding, text cleaning, tokenization, stopword removal, stemming, lemmatization, and slang normalization. Models tested included Naive Bayes, Support Vector Machine (SVM), Long Short-Term Memory (LSTM), Bidirectional LSTM (BiLSTM), and IndoBERT. Evaluation using accuracy, precision, recall, and F1-score metrics showed IndoBERT achieved the best performance with 91.67% accuracy, 90% precision (negative class), 95% recall (negative class), and 91.66% F1-score. This research contributes to understanding public attitudes toward online gambling and the development of an adaptive sentiment classification system for the Indonesian language.

Downloads

Download data is not yet available.

References

Alia, I. R., Lumbanraja, F. R., Aristoteles, A., & Andrian, R. (2025). Classification of Public Sentiment towards the Performance of the Ministry of Communication and Digital regarding Online Gambling. Jurnal Pepadun, 6(3), 264–275. https://doi.org/10.23960/pepadun.v6i3.295

Amin, A. D. B. M., Bhuiyan, M. I., Kamarudin, N. S., Toh, Z., & Nafis, N. S. (2025). Sentiment Analysis On YouTube Comments Using Machine Learning Techniques Based On Video Games Content (No. arXiv:2511.06708). arXiv. https://doi.org/10.48550/arXiv.2511.06708

Ammar, M. Z., Putra, R. E., & Yamasari, Y. (2025). Deep Learning-Based Detection of Online Gambling Promotion Spam in Indonesian YouTube Comments. Journal of Applied Informatics and Computing, 9(6), 3632–3641. https://doi.org/10.30871/jaic.v9i6.11240

Apriansyah, F. M., Ramadhan, T. I., Hidayat, C. R., & Wijaya, A. K. (2025). Perbandingan IndoBERT dan IndoRoBERTa Untuk Analisis Sentimen Pada Film Dokumenter Dirty Vote. Jurnal Informatika: Jurnal Pengembangan IT, 10(3), 593–605. https://doi.org/10.30591/jpit.v10i3.8607

El Azzouzy, O., Chanyour, T., & Andaloussi, S. J. (2025). Transformer-based models for sentiment analysis of YouTube video comments. Scientific African, 29, e02836. https://doi.org/10.1016/j.sciaf.2025.e02836

Iansyah, K., Nurlaili, A. L., & Haromainy, M. M. A. (2025). Comparative Analysis of IndoBERT, IndoBERTweet, and XLM-RoBERTa for Detecting Online Gambling Comments on YouTube. Bit-Tech, 8(2), 2379–2390. https://doi.org/10.32877/bt.v8i2.3257

Prastiko, A. D., & Wiranata, A. D. (2025). Analisis Sentimen Publik terhadap Fenomena Judi Online di Media Sosial X dengan SVM. Jurnal Pustaka AI (Pusat Akses Kajian Teknologi Artificial Intelligence), 5(2), 306–315. https://doi.org/10.55382/jurnalpustakaai.v5i2.1180

Rahadian, G., & Feriza, G. (2026). Penerapan Pasal 45 Undang Undang Informasi dan Elektronik Terkait Judi Online dalam Perspektif Upaya Penegakan Hukum Terhadap Kejahatan Siber: Penelitian. Jurnal Pengabdian Masyarakat Dan Riset Pendidikan, 4(3), 15715–15727. https://doi.org/10.31004/jerkin.v4i3.4368

Widiyanto, A., Prameswari, M., & Latief, M. A. (2025). Gambling Comments Detection on Youtube: A Comparative Study of Tree-Based Boosting, LSTM and GRU Models. JUTI: Jurnal Ilmiah Teknologi Informasi, 144–160. https://doi.org/10.12962/j24068535.v23i2.a1305

Atsilah Siregar, J., & Nugroho, C. (2025). Social-mediated communication and network dynamics in online gambling discourse: A social network analysis of YouTube comments on “Indonesia Darurat Judi Online.” Frontiers in Communication, 10. https://doi.org/10.3389/fcomm.2025.1584444

Cahyo, D. D. N., Handayani, R., Lestari, V. B., & Febriani, S. (2025). Sentiment Analysis of Public Opinion on Online Gambling Through Social Media Using Convolutional Neural Network. Journal Of Informatics And Telecommunication Engineering, 9(1), 99–115. https://doi.org/10.31289/jite.v9i1.15024

Hasibuan, I. H., Budianita, E., Agustian, S., & Pizaini, P. (2023). Klasifikasi Sentimen Komentar Youtube Tentang Pembatalan Indonesia Sebagai Tuan Rumah Piala Dunia U-20 Menggunakan Algoritma Naïve Bayes Classifer. Jurnal Sistem Komputer dan Informatika (JSON), 5(2), 249. https://doi.org/10.30865/json.v5i2.7096

Kim, K., Park, K., Yoon, K., & Kim, Y. (2026). Perceptions of Sports Betting and Promotions in the U.S.: Evidence from a Mixed-Methods Sentiment Analysis of YouTube Comments. Journal of Gambling Studies. https://doi.org/10.1007/s10899-025-10467-y

Maulana, A., & Yuliana, A. (2024). Analisis Sentimen Opini Publik Terkait Judi Online Pada Pengguna Aplikasi X Menggunakan Algoritma Naïve Bayes Dan Support Vector Mechine. Jurnal Informatika dan Teknik Elektro Terapan, 12(3S1). https://doi.org/10.23960/jitet.v12i3S1.5187

Prastyo, D., Irawan, D., & Mursyidin, I. H. (2024a). Klasifikasi Sentimen Komentar YouTube dengan NLP pada Debat Pilkada Banten 2024. bit-Tech, 7(2), 413–421. https://doi.org/10.32877/bt.v7i2.1833

Putra, A., & Hendrawan, A. (2025). Analisis Sentimen Terhadap Komentar Youtube Terkait Judi Online Menggunakan Metode Convolutional Neural Network (CNN). Djtechno: Jurnal Teknologi Informasi, 6(3), 1227–1235. https://doi.org/10.46576/djtechno.v6i3.8085

Sumihar, Y. P., Lase, K. J. D., & Lase, J. H. (2025). Pengembangan Model Deep Learning dengan Slang-Aware Embeddings untuk Deteksi Promosi Judi Online. Jutisi : Jurnal Ilmiah Teknik Informatika dan Sistem Informasi, 14(3). https://doi.org/10.35889/jutisi.v14i3.3226

Sumon, R. I., Uddin, S. M. I., Akter, S., Mozumder, M. A. I., Khan, M. O., & Kim, H.-C. (2024). Natural Language Processing Influence on Digital Socialization and Linguistic Interactions in the Integration of the Metaverse in Regular Social Life. Electronics, 13(7), 1331. https://doi.org/10.3390/electronics13071331

Wisnu Mukti Darwansah, Amalia Anjani Arifiyanti, & Rizka Hadiwiyanti. (2025). Classification and Mapping of Online Gambling Based on News Articles Using NER and SVM. Jurnal Teknologi Dan Open Source, 8(2), 788–797. https://doi.org/10.36378/jtos.v8i2.4707

Arpipi, M. Y. R., Handhayani, T., & Hendryli, J. (2025). Climate Change Sentiment Analysis using LSTM. Journal of Dinda : Data Science, Information Technology, and Data Analytics, 5(1), 22–27. https://doi.org/10.20895/dinda.v5i1.1719


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Penerapan Natural Language Processing Dalam Klasifikasi Sentimen Komentar Youtube Tentang Judi Online

Dimensions Badge
Article History
Published: 2026-02-01
Abstract View: 722 times
PDF Download: 640 times
Issue
Section
Articles