Klasifikasi Sentimen Untuk Mengetahui Kecenderungan Politik Pengguna X Pada Calon Presiden Indonesia 2024 Menggunakan Metode IndoBert


  • Indro Abri Oktariansyah * Mail Universitas Jenderal Achmad Yani, Indonesia
  • Fajri Rakhmat Umbara Universitas Jenderal Achmad Yani, Indonesia
  • Fatan Kasyidi Universitas Jenderal Achmad Yani, Indonesia
  • (*) Corresponding Author
Keywords: Election; Sentiment Classification; Augmentation; IndoBert

Abstract

X has evolved into one of the most popular social media platforms in the world. In Indonesia, the use of X is quite widespread, especially in discussions about the presidential election, which is currently a hot topic. Everyone has different views on the candidates, both positive and negative. With a large amount of tweet data from users, this information can serve as a data source for processing and analysis. Various methods can be used to analyze and classify sentiment from this data, one of which is using BERT. This research conducts sentiment classification using BERT with the IndoBert model. The research aims to classify sentiments towards tweets related to the 2024 Indonesian presidential election to understand the political inclinations of X users, evaluate the performance of the IndoBert model in sentiment classification, and assess the extent to which back translation augmentation and synonym augmentation techniques can enhance the model's performance. Data was collected using crawling techniques for seven days leading up to the election and manually labeled by annotators. Synonym augmentation and back translation techniques were used to balance data in minority classes. The data was divided into 80% training data, 10% test data, and 10% validation data. The classification process was conducted using the IndoBert model that had been fine-tuned. The research results show that IndoBert with synonym augmentation achieved the highest accuracy, which was 82% in the first experiment and 81% in the second experiment. On the other hand, back translation only reached an accuracy of 78% in the first experiment and 74% in the second experiment. This indicates that synonym augmentation proved to be more effective in increasing data variation and model performance on the dataset used in this research.

Downloads

Download data is not yet available.

References

M. Mujahid et al., “Sentiment analysis and topic modeling on tweets about online education during covid-19,” Applied Sciences (Switzerland), vol. 11, no. 18, Sep. 2021, doi: 10.3390/app11188438.

A. F. Hidayatullah, S. Cahyaningtyas, and A. M. Hakim, “Sentiment Analysis on Twitter using Neural Network: Indonesian Presidential Election 2019 Dataset,” IOP Conf Ser Mater Sci Eng, vol. 1077, no. 1, p. 012001, Feb. 2021, doi: 10.1088/1757-899x/1077/1/012001.

G. A. BUNTORO, R. ARIFIN, G. N. SYAIFUDDIIN, A. SELAMAT, O. KREJCAR, and H. FUJITA, “Implementation of a Machine Learning Algorithm for Sentiment Analysis of Indonesia‘s 2019 Presidential Election,” IIUM Engineering Journal, vol. 22, no. 1, pp. 78–92, 2021, doi: 10.31436/IIUMEJ.V22I1.1532.

S. M. Isa, G. Nico, and M. Permana, “INDOBERT FOR INDONESIAN FAKE NEWS DETECTION,” ICIC Express Letters, vol. 16, no. 3, pp. 289–297, Mar. 2022, doi: 10.24507/icicel.16.03.289.

A. Holzinger, P. Kieseberg, A. M. Tjoa, and E. Weippl, Eds., Machine Learning and Knowledge Extraction, vol. 12279. in Lecture Notes in Computer Science, vol. 12279. Cham: Springer International Publishing, 2020. doi: 10.1007/978-3-030-57321-8.

A. N. Azizah, M. Falach Asy’ari, I. Wisma, D. Prastya, and D. Purwitasari, “EASY DATA AUGMENTATION UNTUK DATA YANG IMBALANCE PADA KONSULTASI KESEHATAN DARING,” Jurnal Teknologi Informasi dan Ilmu Komputer (JTIIK), vol. 10, no. 5, pp. 1095–1104, 2023, doi: 10.25126/jtiik.2023107082.

I. Athiyyah Rahma and L. Hulliyyatus Suadaa, “PENERAPAN TEXT AUGMENTATION UNTUK MENGATASI DATA YANG TIDAK SEIMBANG PADA KLASIFIKASI TEKS BERBAHASA INDONESIA STUDI KASUS: DETEKSI JUDUL CLICKBAIT DAN KOMENTAR HATE SPEECH PADA BERITA ONLINE,” Jurnal Teknologi Informasi dan Ilmu Komputer (JTIIK), 2023, doi: 10.25126/jtiik.2023107325.

A. Bello, S. C. Ng, and M. F. Leung, “A BERT Framework to Sentiment Analysis of Tweets,” Sensors, vol. 23, no. 1, Jan. 2023, doi: 10.3390/s23010506.

A. Roy and M. Ojha, “Twitter sentiment analysis using deep learning models,” in 2020 IEEE 17th India Council International Conference, INDICON 2020, Institute of Electrical and Electronics Engineers Inc., Dec. 2020. doi: 10.1109/INDICON49873.2020.9342279.

A. Singh, A. Kumar, N. Dua, V. K. Mishra, D. Singh, and A. Agrawal, “Predicting Elections Results using Social Media Activity A Case Study: USA Presidential Election 2020,” in 2021 7th International Conference on Advanced Computing and Communication Systems, ICACCS 2021, Institute of Electrical and Electronics Engineers Inc., Mar. 2021, pp. 314–319. doi: 10.1109/ICACCS51430.2021.9441835.

A. J. Nair, G. Veena, and A. Vinayak, “Comparative study of Twitter Sentiment on COVID - 19 Tweets,” in Proceedings - 5th International Conference on Computing Methodologies and Communication, ICCMC 2021, Institute of Electrical and Electronics Engineers Inc., Apr. 2021, pp. 1773–1778. doi: 10.1109/ICCMC51019.2021.9418320.

A. B. Y. A. Putra, Y. Sibaroni, and A. F. Ihsan, “Disinformation Detection on 2024 Indonesia Presidential Election using IndoBERT,” in 2023 International Conference on Data Science and Its Applications, ICoDSA 2023, Institute of Electrical and Electronics Engineers Inc., 2023, pp. 350–355. doi: 10.1109/ICoDSA58501.2023.10277572.

L. Geni, E. Yulianti, and D. I. Sensuse, “Sentiment Analysis of Tweets Before the 2024 Elections in Indonesia Using IndoBERT Language Models,” Jurnal Ilmiah Teknik Elektro Komputer dan Informatika (JITEKI), vol. 9, no. 3, pp. 746–757, 2023, doi: 10.26555/jiteki.v9i3.26490.

T. M. Fagbola and S. C. Thakur, “Lexicon-based Bot-aware Public Emotion Mining and Sentiment Analysis of the Nigerian 2019 Presidential Election on Twitter,” 2019. [Online]. Available: www.ijacsa.thesai.org

D. Fimoza, A. Amalia, and T. Henny Febriana Harumy, “Sentiment Analysis for Movie Review in Bahasa Indonesia Using BERT,” in 2021 International Conference on Data Science, Artificial Intelligence, and Business Analytics, DATABIA 2021 - Proceedings, Institute of Electrical and Electronics Engineers Inc., 2021, pp. 27–34. doi: 10.1109/DATABIA53375.2021.9650096.

M. Bucos and B. Drăgulescu, “Enhancing Fake News Detection in Romanian Using Transformer-Based Back Translation Augmentation,” Applied Sciences (Switzerland), vol. 13, no. 24, Dec. 2023, doi: 10.3390/app132413207.

F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, “IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP,” Nov. 2020, [Online]. Available: http://arxiv.org/abs/2011.00677

J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” May 2019, [Online]. Available: http://arxiv.org/abs/1810.04805

I. R. Hidayat and W. Maharani, “General Depression Detection Analysis Using IndoBERT Method,” International Journal on Information and Communication Technology (IJoICT), vol. 8, no. 1, pp. 41–51, Aug. 2022, doi: 10.21108/ijoict.v8i1.634.

G. A. Pradnyana, W. Anggraeni, E. M. Yuniarno, and M. H. Purnomo, “Fine-Tuning IndoBERT Model for Big Five Personality Prediction from Indonesian Social Media,” in 2023 International Seminar on Intelligent Technology and Its Applications: Leveraging Intelligent Systems to Achieve Sustainable Development Goals, ISITIA 2023 - Proceeding, Institute of Electrical and Electronics Engineers Inc., 2023, pp. 93–98. doi: 10.1109/ISITIA59021.2023.10221074.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Klasifikasi Sentimen Untuk Mengetahui Kecenderungan Politik Pengguna X Pada Calon Presiden Indonesia 2024 Menggunakan Metode IndoBert

Dimensions Badge
Article History
Submitted: 2024-06-27
Published: 2024-09-07
Abstract View: 61 times
PDF Download: 48 times
How to Cite
Oktariansyah, I., Umbara, F., & Kasyidi, F. (2024). Klasifikasi Sentimen Untuk Mengetahui Kecenderungan Politik Pengguna X Pada Calon Presiden Indonesia 2024 Menggunakan Metode IndoBert. Building of Informatics, Technology and Science (BITS), 6(2), 636-648. https://doi.org/10.47065/bits.v6i2.5435
Section
Articles