Comparison of TF-IDF and GloVe Word Embedding for Sentiment Analysis of 2024 Presidential Candidates
Abstract
In the ongoing digital era, social media, particularly the social media X, formerly known as Twitter, has become one of the main platforms for sharing public opinions. On the social media, users have the opportunity to express their sentiments or views, including those regarding the presidential election in Indonesia. The main problem in this study is the extent to which public opinion on presidential candidates is reflected in conversations on the social media X. This study involves the combination of Support Vector Machine (SVM) and GloVe Word Embedding algorithms to improve the accuracy of sentiment analysis towards presidential candidates. The performance of the method will be evaluated using a confusion matrix. The results of the study show that while GloVe has the ability to capture global semantic relationships, TF-IDF is more effective in identifying variations and nuances in diverse sentiment data. Therefore, TF-IDF can be a more effective choice for political sentiment analysis in Indonesia, providing more consistent and accurate results. It is seen on the Anies dataset, TF-IDF achieved an accuracy of 0.84 compared to GloVe's 0.82. For the Ganjar dataset, TF-IDF performed better in terms of F1-Score and precision. For the Prabowo dataset, TF-IDF slightly outperformed GloVe in recall, although both techniques had nearly equal high accuracy around 0.93.
Keywords: Presidential Candidates; 2024 Elections; SVM; GloVe; Social media X
Downloads
References
S. S. H. T. A. Putri Nardilasari A. Lia Hananto and B. Priyatna, “Analisis Sentimen Calon Presiden 2024 Menggunakan Algoritma SVM Pada Media Sosial Twitter,” J. Inf. Technol. Comput. Sci., vol. 8, no. 1, pp. 11–18, 2023, doi: 10.31328/jointecs.v8i1.4265.
E. Cano-Marin, S. Sánchez-Alonso, and M. Mora-Cantallops, “Unleashing Competitive Intelligence: News Mining Analysis on Technology Trends and Digital Health Driving Healthcare Innovation,” IEEE Trans. Eng. Manag., pp. 1–15, Nov. 2023, doi: 10.1109/TEM.2023.3326233.
A. S. Y. I. Salim Puad Garno, “Analisis Sentimen Masyarakat Pada Twitter Terhadap Pemilihan Umum 2024 Menggunakan Algoritma Naïve Bayes,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 3, Jun. 2023, [Online]. Available: https://student.unsika.ac.id
W. Kurnia, “Sentimen Analisis Aplikasi E-Commerce Berdasarkan Ulasan Pengguna Menggunakan Algoritma Stochastic Gradient Descent,” vol. 4, no. 1, pp. 138–143, 2023, doi: 10.33365/jtsi.v4i2.2561.
C. F. Hasri and D. Alita, “Penerapan Metode Naïve Bayes Classifier dan Support Vector Machine Pada Analisis Sentimen Terhadap Dampak Virus Corona di Twitter,” J. Inform. dan Rekayasa Perangkat Lunak, vol. 3, no. 2, pp. 145–160, 2022, doi: 10.33365/jatika.v3i2.2026.
Herwinsyah and A. Witanti, “Analisis Sentimen Masyarakat Terhadap Vaksinasi COVID-19 Pada Media Sosial Twitter Menggunakan Algoritma Support Vector Machine (SVM),” J. Sist. Inf. dan Inform., vol. 5, pp. 2622–6901, 2022, doi: 10.47080/simika.v5i1.1411.
S. Khairunnisa A. Adiwijaya and S. Al Faraby, “Pengaruh Text Preprocessing Terhadap Analisis Sentimen Komentar Masyarakat Pada Media Sosial Twitter (Studi Kasus Pandemi COVID-19),” J. Media Inform. Budidarma, vol. 5, no. 2, p. 406, Apr. 2021, doi: 10.30865/mib.v5i2.2835.
P. Shah S. Shah and S. Joshi, “A Study of Various Word Embeddings in Deep Learning,” 2022. doi: 10.1109/INCET54531.2022.9824963.
L. Xiaoyan R. C. Raga and S. Xuemei, “GloVe-CNN-BiLSTM Model for Sentiment Analysis on Text Reviews,” J. Sensors, 2022, doi: 10.1155/2022/7212366.
J. Ravi and S. Kulkarni, “Text Embedding Techniques for Efficient Clustering of Twitter Data,” Evol. Intell., vol. 16, no. 5, pp. 1667–1677, Oct. 2023, doi: 10.1007/s12065-023-00825-3.
M. Sahbuddin and S. Agustian, “Support Vector Machine Method with Word2vec for Covid-19 Vaccine Sentiment Classification on Twitter,” J. Informatics Telecommun. Eng., vol. 6, no. 1, pp. 288–297, Jul. 2022, doi: 10.31289/jite.v6i1.7534.
A. Azzawagama Firdaus A. Yudhana and I. Riadi, “Analisis Sentimen Pada Proyeksi Pemilihan Presiden 2024 Menggunakan Metode Support Vector Machine,” Decod. J. Pendidik. Teknol. Inf., vol. 3, no. 2, pp. 236–245, Jun. 2023, doi: 10.51454/decode.v3i2.172.
B. Liu, Sentiment Analysis: Mining Opinions, Sentiments, and Emotions, 2nd ed. Cambridge University Press, 2016.
M.T. Thai W. Wu and H. Xiong, Big Data in Complex and Social Networks. CRC Press, Taylor & Francis Group, 2016.
A. M. M. P. M. S. E. Baccarelli N. Cordeschi and J. Stefa, “Energy-efficient dynamic traffic offloading and reconfiguration of networked data centers for big data stream mobile computing: review, challenges, and a case study,” IEEE Netw., vol. 30, no. 2, pp. 54–61, 2016, doi: 10.1109/MNET.2016.7437025.
W. Fan and M. D. Gordon, “The Power of Social Media Analytics,” Commun. ACM, vol. 57, no. 6, pp. 74–81, 2014.
M. Tsikerdekis and S. Zeadally, “Online deception in social media,” Commun. ACM, vol. 57, no. 9, pp. 72–80, 2014.
E. M. F. A. Pozzi E. Fersini and B. Liu, Sentiment Analysis in Social Networks. Morgan Kaufmann, 2016.
F. N. I. Alfina D. Sigmawaty and A. N. Hidayanto, “Utilizing Hashtags for Sentiment Analysis of Tweets in The Political Domain,” in Proceedings of the 9th International Conference on Machine Learning and Computing (ICMLC ’17), Feb. 2017, pp. 43–47. doi: 10.1145/3055635.3056631.
Y. Didi A. Walha and A. Wali, “COVID-19 Tweets Classification Based on a Hybrid Word Embedding Method,” Big Data Cogn. Comput., vol. 6, no. 2, p. 58, 2022, doi: 10.3390/bdcc6020058.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Comparison of TF-IDF and GloVe Word Embedding for Sentiment Analysis of 2024 Presidential Candidates
Pages: 961-969
Copyright (c) 2024 Tiara Firdausa Abdillah
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).