Election Hoax Detection on X using CNN with TF-RF and TF-IDF Weighting Features

Dila Adelia; Widi Astuti; Kemas Muslim Lhaksmana

doi:10.47065/josyc.v5i4.5778

Dila Adelia * Telkom University, Bandung, Indonesia
Widi Astuti Telkom University, Bandung, Indonesia
Kemas Muslim Lhaksmana Telkom University, Bandung, Indonesia

(*) Corresponding Author

DOI: https://doi.org/10.47065/josyc.v5i4.5778

Keywords: X Social Media; Hoax; Convolutional Neural Network; TF-RF; TF-IDF

Abstract

X social media is a microblogging platform for sharing brief thoughts and trends. It has become a focal point for expressing political views. The increased political engagement on X social media has facilitated the swift and extensive sharing of ideas. Still, it also brings the risk of spreading false information and hoaxes that can manipulate public opinion. Preventing fake news on social media is crucial because it can influence election outcomes and social stability. For example, X social media has been used during elections to spread hoaxes, such as false claims of vote tampering or misleading information about candidate qualifications. This study implements a Convolutional Neural Network (CNN) due to its advantages in recognizing complex patterns and achieving high performance in tasks like classification. The dataset used in this study consists of 2,670 tweets. The dataset is divided into three subsets: 60% for training, 20% for testing, and 20% for validation. It also uses Term Frequency Relevance Frequency (TF-RF) and Term Frequency Inverse Document Frequency (TF-IDF) weighting features to improve accuracy in detecting fake news. This study compares the TF-RF and TF-IDF weighting features using the CNN classification method on the topic of the 2024 election. The testing results indicate that both TF-RF and TF-IDF achieved similar overall performance, with TF-RF slightly excelling in recall and F1-score. At the same time, TF-IDF showed a marginally higher precision.

Downloads

Download data is not yet available.

References

C. C. Wang, “Fake News and Related Concepts: Definitions and Recent Research Development,” Contemporary Management Research, vol. 16, no. 3, pp. 145–174, Sep. 2020, doi: 10.7903/CMR.20677.

V. Oktaviana Yamin, A. Tenriawaru, L. Ode Saidi, and G. Arviana Rahman, “Penerapan Naïve Bayes Classifier dengan Algoritma Nazief dan Adriani Untuk Deteksi Hoaks,” Prosiding Seminar Nasional Pemanfaatan Sains dan Teknologi Informasi, vol. 1, no. 1, pp. 335–344, 2023.

S. Cinta Insani, N. Alisya Zahwa Khuzaimah, V. Zia Devita Maryadi, and T. Alya Hafizha, “Meninjau Etika Masyarakat Indonesia Dalam Bermedia Sosial Di Masa Pemilu Menggunakan Etika Media Sosial,” Jurnal Pendidikan, Seni, Sains dan Sosial Humaniora, vol. 1, no. 2, pp. 1–25, 2023, doi: 10.11111/nusantara.xxxxxxx.

F. Rakan Tama and Y. Sibaroni, “Fake News (Hoaxes) Detection on Twitter Social Media Content through Convolutional Neural Network (CNN) Method,” JINAV: Journal of Information and Visualization, vol. 1, no. 1, pp. 2746–1440, 2020, doi: 10.35877/454RI.jinav2125.

A. S. Nurhikam, R. Syaputra, S. Rohman, S. R. Priyambodo, and N. Agustina, “Deteksi Berita Palsu Pada Pemilu 2024 Dengan Menggunakan Algoritma Random Forest,” Journal of Computer and Information Technology, vol. 7, no. 1, pp. 41–50, 2023.

A. R. I. Fauzy and B. S. Erwin, “Detecting Fake News on Social Media Combined with the CNN Methods,” JURNAL RESTI (RekayasaSistemdanTeknologiInformasi), vol. 7, no. 2, pp. 271–277, 2023, doi: 10.29207/resti.v7i1.4889.

A. D. Cahyani, A. K. Ramdani, and Y. Sibaroni, “Hoax Detection of Covid-19 News using Convolutional Neural Network and Support Vector Machine,” Intl. Journal on ICT, vol. 9, no. 2, pp. 177–185, 2023, doi: 10.21108/ijoict.v9i2.872.

Z. Tang, W. Li, Y. Li, W. Zhao, and S. Li, “Several alternative term weighting methods for text representation and classification,” Knowl Based Syst, vol. 207, Nov. 2020, doi: 10.1016/j.knosys.2020.106399.

A. Thakkar and K. Chaudhari, “Predicting stock trend using an integrated term frequency–inverse document frequency-based feature weight matrix with neural networks,” Applied Soft Computing Journal, vol. 96, p. 106684, Nov. 2020, doi: 10.1016/j.asoc.2020.106684.

A. N. Assidyk, E. B. Setiawan, and I. Kurniawan, “Analisis Perbandingan Pembobotan TF-IDF dan TF-RF pada Trending Topic di Twitter dengan Menggunakan Klasifikasi K-Nearest Neighbor,” e-Proceeding of Engineering, vol. 7, no. 2, pp. 7773–7781, 2020.

J. A. Nasir, O. S. Khan, and I. Varlamis, “Fake news detection: A hybrid CNN-RNN based deep learning approach,” International Journal of Information Management Data Insights, vol. 1, no. 1, p. 100007, Apr. 2021, doi: 10.1016/j.jjimei.2020.100007.

D. A. N. Taradhita and I. K. G. D. Putra, “Hate Speech Classification in Indonesian Language Tweets by Using Convolutional Neural Network,” Journal of ICT Research and Applications, vol. 14, no. 3, pp. 225–239, 2021, doi: 10.5614/itbj.ict.res.appl.2021.14.3.2.

A. Awalina, F. A. Bachtiar, F. Utaminingrum, and P. Korespondensi, “Perbandingan Pretrained Model Transformer pada Deteksi Ulasan Palsu,” Jurnal Teknologi Informasi dan Ilmu Komputer (JTIIK), vol. 9, no. 3, pp. 597–604, 2022, doi: 10.25126/jtiik.202295696.

P. Song, C. Geng, and Z. Li, “Research on Text Classification Based on Convolutional Neural Network,” International Conference on Computer Network, Electronic and Automation, ICCNEA, pp. 229–232, Sep. 2019, doi: 10.1109/ICCNEA.2019.00052.

G. B. Herwanto, A. M. Ningtyas, I. G. Mujiyatna, K. E. Nugraha, and I. N. Prayana Trisna, “Hate Speech Detection in Indonesian Twitter using Contextual Embedding Approach,” IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 15, no. 2, pp. 177–188, Apr. 2021, doi: 10.22146/ijccs.64916.

M. K. A. Reiki, Y. Sibaroni, and E. B. Setiawan, “Comparison of Term Weighting Methods in Sentiment Analysis of the New State Capital of Indonesia with the SVM Method,” International Journal on Information and Communication Technology (IJoICT), vol. 8, no. 2, pp. 53–65, Jan. 2022, doi: 10.21108/ijoict.v8i2.681.

H. Liu, X. Chen, and X. Liu, “A Study of the Application of Weight Distributing Method Combining Sentiment Dictionary and TF-IDF for Text Sentiment Analysis,” IEEE Access, vol. 10, pp. 32280–32289, 2022, doi: 10.1109/ACCESS.2022.3160172.

R. N. Aufa, S. S. Prasetiyowati, and Y. Sibaroni, “The Effect of Feature Weighting on Sentiment Analysis TikTok Application Using The RNN Classification,” Building of Informatics, Technology and Science (BITS), vol. 5, no. 1, pp. 345–353, Jun. 2023, doi: 10.47065/bits.v5i1.3597.

M. M. Taye, “Understanding of Machine Learning with Deep Learning: Architectures, Workflow, Applications and Future Directions,” May 01, 2023, doi: 10.3390/computers12050091.

Z. Li, F. Liu, W. Yang, S. Peng, and J. Zhou, “A Survey of Convolutional Neural Networks: Analysis, Applications, and Prospects,” IEEE Trans Neural Netw Learn Syst, vol. 33, no. 12, pp. 6999–7019, Dec. 2022, doi: 10.1109/TNNLS.2021.3084827.

Yuliska, H. D. Qudsi, H. L. Juanda, U. K. Syaliman, and F. N. Nina, “Analisis sentimen pada data saran mahasiswa terhadap kinerja departemen di perguruan tinggi menggunakan Convolutional Neural Network,” Jurnal Teknologi Informasi dan Ilmu Komputer, vol. 8, no. 5, pp. 1067–1076, 2021, doi: 10.25126/jtiik.202184842.

Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Election Hoax Detection on X using CNN with TF-RF and TF-IDF Weighting Features