Comparison of Ensemble Methods for Detecting Hoax News


  • Delvanita Sri Wahyuni Telkom University, Bandung, Indonesia
  • Yuliant Sibaroni * Mail Telkom University, Bandung, Indonesia
  • (*) Corresponding Author
Keywords: Hoax; Covid-19 Vaccine; Ensemble; Random Forest; Adaboost

Abstract

The spread of hoaxes in Indonesia has become a big concern for the public, especially now that the COVID-19 virus pandemic is hitting the whole world. Due to the large number of people who believe the hoax news regarding the COVID-19 vaccination that has spread on social media, many people refuse to carry out the COVID-19 vaccination as a form of government effort in dealing with this pandemic. Therefore, people need to be wiser when reading news on social networks. To help the public not to read hoaxes, it is necessary to classify the COVID-19 vaccine hoax. This study builds a system to classify hoax news on the COVID-19 vaccine. The model was built using the ensemble method by comparing the Random Forest and AdaBoost algorithms to choose a good classification for detecting hoaxes. In this research, there are use two test scenarios. The first scenario is an experiment using the Random Forest algorithm method and the second scenario is an experiment using the Adoboost algorithm method. The experimental results show that the first scenario produces a good accuracy value with the random forest algorithm method of 93.58%.

Downloads

Download data is not yet available.

References

“WHO Director-General’s opening remarks at the media briefing on COVID-19 - 11 March 2020.” https://www.who.int/director-general/speeches/detail/who-director-general-s-opening-remarks-at-the-media-briefing-on-covid-19---11-march-2020 (accessed Jul. 07, 2022).

D. Cucinotta and M. Vanelli, “WHO Declares COVID-19 a Pandemic,” Acta Biomed, vol. 91, pp. 157–160, 2020, doi: 10.23750/abm.v91i1.9397.

“View of Behavior of Following Health Protocols (Keeping Distance, Washing Hands and Wearing Masks) as a Form of State Defense in the Era of COVID-19.” https://cerdika.publikasiindonesia.id/index.php/cerdika/article/view/9/64 (accessed Jul. 07, 2022).

K. Lutfiyah, “HOAX AND FAKE NEWS DURING COVID-19: IS THE LAW EFFECTIVE IN OVERCOMING IT?,” 2020, doi: 10.15294/ijicle.v2i3.38422.

M. D. Molina, S. S. Sundar, T. Le, and D. Lee, “‘Fake News’ Is Not Simply False Information: A Concept Explication and Taxonomy of Online Content,” American Behavioral Scientist, vol. 65, no. 2, pp. 180–212, Feb. 2021, doi: 10.1177/0002764219878224.

A. Awalina, J. Fawaid, R. Yunus Krisnabayu, and N. Yudistira, “Indonesia’s Fake News Detection using Transformer Network.” [Online]. Available: https://github.com/JibranFawaid/turnbackhoax-dataset.

H. A. Santoso, E. H. Rachmawanto, and U. Hidayati, “Fake Twitter Account Classification of Fake News Spreading Using Naïve Bayes,” Scientific Journal of Informatics, vol. 7, no. 2, pp. 2407–7658, 2020, [Online]. Available: http://journal.unnes.ac.id/nju/index.php/sji

A. T. B. Panjaitan and D. I. Santoso, “Deteksi Hoaks Pada Berita Berbahasa Indonesia Seputar COVID-19,” FORMAT: Jurnal Ilmiah Teknik Informatika, vol. 10, no. 1, pp. 76–85, 2021, [Online]. Available: https://turnbackhoax.id.

S. D. Das, A. Basak, and S. Dutta, “A Heuristic-driven Ensemble Framework for COVID-19 Fake News Detection,” Jan. 2021, [Online]. Available: http://arxiv.org/abs/2101.03545

S. Ramani, M. S. Kumar, and A. Professor, “Identification of Fake News through SVM and Random Forest,” 2020. [Online]. Available: http://ijesc.org/

“Beranda | Covid19.go.id.” https://covid19.go.id/ (accessed Jul. 14, 2022).

N. Mahdi Abdulkareem and A. Mohsin Abdulazeez, “Machine Learning Classification Based on Radom Forest Algorithm: A Review,” International Journal of Science and Business, vol. 5, no. 2, pp. 128–142, 2021, doi: 10.5281/zenodo.4471118.

K. Li, G. Zhou, J. Zhai, F. Li, and M. Shao, “Improved PSO_AdaBoost ensemble algorithm for imbalanced data,” Sensors (Switzerland), vol. 19, no. 6, Mar. 2019, doi: 10.3390/s19061476.

W. Hidayat, E. Utami, and A. Sunyoto, “Selection of the Best K-Gram Value on Modified Rabin-Karp Algorithm,” IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 16, no. 1, p. 11, Jan. 2022, doi: 10.22146/ijccs.63686.

K. Kowsari, K. J. Meimandi, M. Heidarysafa, S. Mendu, L. Barnes, and D. Brown, “Text classification algorithms: A survey,” Information (Switzerland), vol. 10, no. 4. MDPI AG, 2019. doi: 10.3390/info10040150.

M. Rusli et al., “EKSTRAKSI FITUR MENGGUNAKAN MODEL WORD2VEC PADA SENTIMENT ANALYSIS KOLOM KOMENTAR KUISIONER EVALUASI DOSEN OLEH MAHASISWA,” vol. 07, no. 1, 2020.

N. Nanda Widyastuti, A. Bijaksana, and I. Lukmana Sardi, “Analisis Word2vec untuk Perhitungan Kesamaan Semantik antar Kata,” e-Proceeding of Engineering, vol. 5, no. 3, pp. 7603–7612, 2018.

P. Compagnon and K. Ollivier, “Graph Embeddings for Social Network Analysis: State of the Art PhD Thesis-Toward unsupervised activity monitoring with sequence metric learning View project,” 2017. [Online]. Available: https://www.researchgate.net/publication/331714802

E. M. Dharma, F. Lumban Gaol, H. Leslie, H. S. Warnars, and B. Soewito, “THE ACCURACY COMPARISON AMONG WORD2VEC, GLOVE, AND FASTTEXT TOWARDS CONVOLUTION NEURAL NETWORK (CNN) TEXT CLASSIFICATION,” J Theor Appl Inf Technol, vol. 31, no. 2, 2022, [Online]. Available: www.jatit.org

R. Singh, “Machine Learning Algorithms and Ensemble Technique to Improve Prediction of Students Performance,” International Journal of Advanced Trends in Computer Science and Engineering, vol. 9, no. 3, pp. 3970–3976, Jun. 2020, doi: 10.30534/ijatcse/2020/221932020.

B. Bahrawi, “SENTIMENT ANALYSIS USING RANDOM FOREST ALGORITHM ONLINE SOCIAL MEDIA BASED SENTIMENT ANALYSIS USING RANDOM FOREST ALGORITHM-ONLINE SOCIAL MEDIA BASED,” 2019. [Online]. Available: https://www.researchgate.net/publication/338548518

C. Albon, “Machine Learning with Python Cookbook Practical Solutions from Preprocessing to Deep Learning,” 2019.

L. Malem Ginting, M. M. Sigiro, G. Yola Lumbantoruan, and dan Januard Lumbangaol, “Prediksi Indikator Perawatan Menggunakan Random Forest Classification dan Classification and Regression Trees,” 2022. [Online]. Available: http://journal-jati.del.ac.id/

M. Vakili, M. Ghamsari, and M. Rezaei, “Performance Analysis and Comparison of Machine and Deep Learning Algorithms for IoT Data Classification,” 2020.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Comparison of Ensemble Methods for Detecting Hoax News

Dimensions Badge
Article History
Submitted: 2022-07-25
Published: 2022-09-30
Abstract View: 1284 times
PDF Download: 506 times
How to Cite
Wahyuni, D., & Sibaroni, Y. (2022). Comparison of Ensemble Methods for Detecting Hoax News. Building of Informatics, Technology and Science (BITS), 4(2), 839−846. https://doi.org/10.47065/bits.v4i2.1957
Section
Articles

Most read articles by the same author(s)

1 2 3 > >>