Comparison of TF-IDF and GloVe Word Embedding for Sentiment Analysis of 2024 Presidential Candidates


  • Tiara Firdausa Abdillah * Mail Telkom University, Indonesia
  • Hasmawati Hasmawati Telkom University, Indonesia
  • Bunyamin Bunyamin Telkom University, Indonesia
  • (*) Corresponding Author
Keywords: Presidential Candidates; 2024 Elections; SVM; GloVe; Social media X

Abstract

In the ongoing digital era, social media, particularly the social media X, formerly known as Twitter, has become one of the main platforms for sharing public opinions. On the social media, users have the opportunity to express their sentiments or views, including those regarding the presidential election in Indonesia. The main problem in this study is the extent to which public opinion on presidential candidates is reflected in conversations on the social media X. This study involves the combination of Support Vector Machine (SVM) and GloVe Word Embedding algorithms to improve the accuracy of sentiment analysis towards presidential candidates. The performance of the method will be evaluated using a confusion matrix. The results of the study show that while GloVe has the ability to capture global semantic relationships, TF-IDF is more effective in identifying variations and nuances in diverse sentiment data. Therefore, TF-IDF can be a more effective choice for political sentiment analysis in Indonesia, providing more consistent and accurate results. It is seen on the Anies dataset, TF-IDF achieved an accuracy of 0.84 compared to GloVe's 0.82. For the Ganjar dataset, TF-IDF performed better in terms of F1-Score and precision. For the Prabowo dataset, TF-IDF slightly outperformed GloVe in recall, although both techniques had nearly equal high accuracy around 0.93.

Keywords: Presidential Candidates; 2024 Elections; SVM; GloVe; Social media X

Downloads

Download data is not yet available.

References

S. S. H. T. A. Putri Nardilasari A. Lia Hananto and B. Priyatna, “Analisis Sentimen Calon Presiden 2024 Menggunakan Algoritma SVM Pada Media Sosial Twitter,” J. Inf. Technol. Comput. Sci., vol. 8, no. 1, pp. 11–18, 2023, doi: 10.31328/jointecs.v8i1.4265.

E. Cano-Marin, S. Sánchez-Alonso, and M. Mora-Cantallops, “Unleashing Competitive Intelligence: News Mining Analysis on Technology Trends and Digital Health Driving Healthcare Innovation,” IEEE Trans. Eng. Manag., pp. 1–15, Nov. 2023, doi: 10.1109/TEM.2023.3326233.

A. S. Y. I. Salim Puad Garno, “Analisis Sentimen Masyarakat Pada Twitter Terhadap Pemilihan Umum 2024 Menggunakan Algoritma Naïve Bayes,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 3, Jun. 2023, [Online]. Available: https://student.unsika.ac.id

W. Kurnia, “Sentimen Analisis Aplikasi E-Commerce Berdasarkan Ulasan Pengguna Menggunakan Algoritma Stochastic Gradient Descent,” vol. 4, no. 1, pp. 138–143, 2023, doi: 10.33365/jtsi.v4i2.2561.

C. F. Hasri and D. Alita, “Penerapan Metode Naïve Bayes Classifier dan Support Vector Machine Pada Analisis Sentimen Terhadap Dampak Virus Corona di Twitter,” J. Inform. dan Rekayasa Perangkat Lunak, vol. 3, no. 2, pp. 145–160, 2022, doi: 10.33365/jatika.v3i2.2026.

Herwinsyah and A. Witanti, “Analisis Sentimen Masyarakat Terhadap Vaksinasi COVID-19 Pada Media Sosial Twitter Menggunakan Algoritma Support Vector Machine (SVM),” J. Sist. Inf. dan Inform., vol. 5, pp. 2622–6901, 2022, doi: 10.47080/simika.v5i1.1411.

S. Khairunnisa A. Adiwijaya and S. Al Faraby, “Pengaruh Text Preprocessing Terhadap Analisis Sentimen Komentar Masyarakat Pada Media Sosial Twitter (Studi Kasus Pandemi COVID-19),” J. Media Inform. Budidarma, vol. 5, no. 2, p. 406, Apr. 2021, doi: 10.30865/mib.v5i2.2835.

P. Shah S. Shah and S. Joshi, “A Study of Various Word Embeddings in Deep Learning,” 2022. doi: 10.1109/INCET54531.2022.9824963.

L. Xiaoyan R. C. Raga and S. Xuemei, “GloVe-CNN-BiLSTM Model for Sentiment Analysis on Text Reviews,” J. Sensors, 2022, doi: 10.1155/2022/7212366.

J. Ravi and S. Kulkarni, “Text Embedding Techniques for Efficient Clustering of Twitter Data,” Evol. Intell., vol. 16, no. 5, pp. 1667–1677, Oct. 2023, doi: 10.1007/s12065-023-00825-3.

M. Sahbuddin and S. Agustian, “Support Vector Machine Method with Word2vec for Covid-19 Vaccine Sentiment Classification on Twitter,” J. Informatics Telecommun. Eng., vol. 6, no. 1, pp. 288–297, Jul. 2022, doi: 10.31289/jite.v6i1.7534.

A. Azzawagama Firdaus A. Yudhana and I. Riadi, “Analisis Sentimen Pada Proyeksi Pemilihan Presiden 2024 Menggunakan Metode Support Vector Machine,” Decod. J. Pendidik. Teknol. Inf., vol. 3, no. 2, pp. 236–245, Jun. 2023, doi: 10.51454/decode.v3i2.172.

B. Liu, Sentiment Analysis: Mining Opinions, Sentiments, and Emotions, 2nd ed. Cambridge University Press, 2016.

M.T. Thai W. Wu and H. Xiong, Big Data in Complex and Social Networks. CRC Press, Taylor & Francis Group, 2016.

A. M. M. P. M. S. E. Baccarelli N. Cordeschi and J. Stefa, “Energy-efficient dynamic traffic offloading and reconfiguration of networked data centers for big data stream mobile computing: review, challenges, and a case study,” IEEE Netw., vol. 30, no. 2, pp. 54–61, 2016, doi: 10.1109/MNET.2016.7437025.

W. Fan and M. D. Gordon, “The Power of Social Media Analytics,” Commun. ACM, vol. 57, no. 6, pp. 74–81, 2014.

M. Tsikerdekis and S. Zeadally, “Online deception in social media,” Commun. ACM, vol. 57, no. 9, pp. 72–80, 2014.

E. M. F. A. Pozzi E. Fersini and B. Liu, Sentiment Analysis in Social Networks. Morgan Kaufmann, 2016.

F. N. I. Alfina D. Sigmawaty and A. N. Hidayanto, “Utilizing Hashtags for Sentiment Analysis of Tweets in The Political Domain,” in Proceedings of the 9th International Conference on Machine Learning and Computing (ICMLC ’17), Feb. 2017, pp. 43–47. doi: 10.1145/3055635.3056631.

Y. Didi A. Walha and A. Wali, “COVID-19 Tweets Classification Based on a Hybrid Word Embedding Method,” Big Data Cogn. Comput., vol. 6, no. 2, p. 58, 2022, doi: 10.3390/bdcc6020058.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Comparison of TF-IDF and GloVe Word Embedding for Sentiment Analysis of 2024 Presidential Candidates

Dimensions Badge
Article History
Submitted: 2024-07-31
Published: 2024-09-11
Abstract View: 47 times
PDF Download: 35 times
How to Cite
Abdillah, T., Hasmawati, H., & Bunyamin, B. (2024). Comparison of TF-IDF and GloVe Word Embedding for Sentiment Analysis of 2024 Presidential Candidates. Building of Informatics, Technology and Science (BITS), 6(2), 961-969. https://doi.org/10.47065/bits.v6i2.5668
Section
Articles