Detecting Hoax Content on Social Media Using Bi-LSTM and RNN
Abstract
Online media, such as websites and applications, have become a communication tool available on the internet. Social media is a part of online media that can be used to spread news, opinions, or even hoaxes, such as through Twitter. Although hoaxes are difficult to eliminate, several systems have been built using deep learning approaches that can process text and images to detect the truthfulness of news. In this study, four systems were built using four deep learning methods, namely Bi-directional Long Short-Term Memory (Bi-LSTM), Recurrent Neural Network (RNN), hybrid RNN-Bi-LSTM, and hybrid Bi-LSTM-RNN. Feature extraction was performed using Term Frequency - Inverse Document Frequency (TF-IDF) and feature expansion was performed using Global Vectors (GloVe). The data used has been adjusted according to the keyword of fake news on mainstream news portals. This study attempted several scenarios to compare the various methods that have been built, with the aim of finding the best method that provides the highest accuracy. The results showed that the Bi-LSTM method had the highest accuracy of 96.48%, while the hybrid Bi-LSTM-RNN method ranked second with an accuracy of 96.36%, followed by the RNN method with an accuracy of 95.49%, and the hybrid RNN-Bi-LSTM method with an accuracy of 95.34%.
Downloads
References
Online media, such as websites and applications, have become a communication tool available on the internet. Social media is a part of online media that can be used to spread news, opinions, or even hoaxes, such as through Twitter. Although hoaxes are difficult to eliminate, several systems have been built using deep learning approaches that can process text and images to detect the truthfulness of news. In this study, four systems were built using four deep learning methods, namely Bi-directional Long Short-Term Memory (Bi-LSTM), Recurrent Neural Network (RNN), hybrid RNN-Bi-LSTM, and hybrid Bi-LSTM-RNN. Feature extraction was performed using Term Frequency - Inverse Document Frequency (TF-IDF) and feature expansion was performed using Global Vectors (GloVe). The data used has been adjusted according to the keyword of fake news on mainstream news portals. This study attempted several scenarios to compare the various methods that have been built, with the aim of finding the best method that provides the highest accuracy. The results showed that the Bi-LSTM method had the highest accuracy of 96.48%, while the hybrid Bi-LSTM-RNN method ranked second with an accuracy of 96.36%, followed by the RNN method with an accuracy of 95.49%, and the hybrid RNN-Bi-LSTM method with an accuracy of 95.34%.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Detecting Hoax Content on Social Media Using Bi-LSTM and RNN
Pages: 114−125
Copyright (c) 2023 Hilman Bayu Aji, Erwin Budi Setiawan
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).