Twitter Sentiment Analysis of Kanjuruhan Disaster using Word2Vec and Support Vector Machine


  • Fariz Muhammad Rizky * Mail Telkom University, Bandung, Indonesia
  • Jondri Jondri Telkom University, Bandung, Indonesia
  • Kemas Muslim Lhaksmana Telkom University, Bandung, Indonesia
  • (*) Corresponding Author
Keywords: Football; Kanjuruhan Disaster; Machine Learning; Twitter; Sentiment Analysis

Abstract

The Kanjuruhan disaster on 1 October 2022, gained the peoples attention. People share their thoughts on social media. Their posts contain a variety of perspectives. Sentiment analysis is possible to use on a dataset of people's posts. This final project applies the supervised learning Support Vector Machine (SVM) method with feature expansion using Word2Vec and TF-IDF as weighting. Three SVM kernels—rbf, linear, and polynomial—are applied. Three split data techniques and two different types of training data are used to train each kernel. Training data with oversampling and training data without oversampling are the two types of training data. The best result gained from using rbf kernel, split ratio 70:30, and oversampling. From it, oversampling trained model have relatively stable in every split rasio and kernel without having significant difference.

Downloads

Download data is not yet available.

References

V. Febrianto, “Death count in Kanjuruhan tragedy climbs to 135,” ANTARA News, Oct. 24, 2022. Accessed: Jan. 20, 2023. [Online]. Available: https://en.antaranews.com/news/256465/death-count-in-kanjuruhan-tragedy-climbs-to-135

Hermanto, A. Y. Kuntoro, T. Asra, E. B. Pratama, L. Effendi, and R. Ocanitra, “Gojek and Grab User Sentiment Analysis on Google Play Using Naive Bayes Algorithm And Support Vector Machine Based Smote Technique,” J Phys Conf Ser, vol. 1641, p. 012102, 2020, doi: 10.1088/1742-6596/1641/1/012102.

M. Guia, R. Silva, and J. Bernardino, “Comparison of Naïve Bayes, Support Vector Machine, Decision Trees and Random Forest on Sentiment Analysis,” Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, 2019, doi: 10.5220/0008364105250531.

S. Al-Saqqa and A. Awajan, “The Use of Word2vec Model in Sentiment Analysis,” Proceedings of the 2019 International Conference on Artificial Intelligence, Robotics and Control, pp. 39–43, 2019, doi: 10.1145/3388218.3388229.

M. Matuq Ashi, M. Ahmed Siddiqui, and F. Nadeem, “Pre-trained Word Embeddings for Arabic Aspect-Based Sentiment Analysis of Airline Tweets,” in The International Conference on Advanced Intelligent Systems and Informatics, A. E. Hassanien, M. F. Tolba, K. Shaalan, and A. T. Azar, Eds., Cairo: Springer International Publishing, 2018. doi: 10.1007/978-3-319-99010-1.

M. Sahbuddin and S. Agustian, “View of Support Vector Machine Method with Word2vec for Covid-19 Vaccine Sentiment Classification on Twitter,” Journal of Informatics and Telecommunication Engineering, vol. 6, no. 1, 2022, doi: 10.31289/jite.v6i1.7534.

J. Xue, X. Ban, H. Guo, and X. Zhu, “Sentiment Analysis Based on Weibo Comments,” in 2018 13th World Congress on Intelligent Control and Automation (WCICA), Changsa, China: IEEE, Jul. 2018, pp. 1166–1171. doi: 10.1109/WCICA.2018.8630471.

Naufal Adi Nugroho and Erwin Budi Setiawan, “Implementation Word2Vec for Feature Expansion in Twitter Sentiment Analysis,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 5, no. 5, pp. 837–842, Oct. 2021, doi: 10.29207/resti.v5i5.3325.

M. A. Fauzi, “Word2Vec model for sentiment analysis of product reviews in Indonesian language,” International Journal of Electrical and Computer Engineering (IJECE), vol. 9, no. 1, p. 525, Feb. 2019, doi: 10.11591/ijece.v9i1.pp525-530.

M. Rizki, “Analisis Sentimen Masyarakat Terhadap Vaksin COVID-19 Menggunakan Metode Support Vector Machine Pada Media Sosial Twitter,” Uin-suska.ac.id, 2022, [Online]. Available: http://repository.uin-suska.ac.id/58497/

R. Nooraeni, H. D. Sariyanti, A. F. F. Iskandar, S. F. Munawwaroh, S. Pertiwi, and Y. Ronaldias, “Analisis Sentimen Data Twitter Mengenai Isu RUU KPK Dengan Metode Support Vector Machine (SVM),” Paradigma - Jurnal Komputer dan Informatika, vol. 22, no. 1, pp. 55–60, Mar. 2020, doi: 10.31294/p.v22i1.6869.

“Rekor Kematian Kedua di Dunia, Tragedi Kanjuruhan Lampaui Hillsborough,” CNN Indonesia, Oct. 02, 2022. https://www.cnnindonesia.com/olahraga/20221002070354-142-855202/rekor-kematian-kedua-di-dunia-tragedi-kanjuruhan-lampaui-hillsborough (accessed Jan. 20, 2023).

L. Bing, Sentiment Analysis Mining Opinions, Sentiments, and Emotions, 2nd ed. Cambridge University Press, 2020. doi: 10.1017/9781108639286.002.

D. Sharma, M. Sabharwal, V. Goyal, and M. Vij, “Sentiment Analysis Techniques for Social Media Data: A Review,” 2020, pp. 75–90. doi: 10.1007/978-981-15-0029-9_7.

Suyanto, Data Mining untuk Klasifikasi dan Klasterisasi Data. Bandung: Informatika Bandung, 2017. Accessed: Jan. 25, 2023. [Online]. Available: https://opac.perpusnas.go.id/DetailOpac.aspx?id=1069411#

B. Santosa, “Tutorial Support Vector Machines,” Surabaya, 2015.

S.-W. Kim and J.-M. Gil, “Research paper classification systems based on TF-IDF and LDA schemes,” Human-centric Computing and Information Sciences, vol. 9, no. 1, p. 30, Dec. 2019, doi: 10.1186/s13673-019-0192-7.

X. Huang, C.-Z. Zhang, and J. Yuan, “Predicting Extreme Financial Risks on Imbalanced Dataset: A Combined Kernel FCM and Kernel SMOTE Based SVM Classifier,” Comput Econ, vol. 56, no. 1, pp. 187–216, Jun. 2020, doi: 10.1007/s10614-020-09975-3.

D. Chicco and G. Jurman, “The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation,” BMC Genomics, vol. 21, no. 1, p. 6, Dec. 2020, doi: 10.1186/s12864-019-6413-7.

S. Pradha, M. N. Halgamuge, and N. Tran Quoc Vinh, “Effective Text Data Preprocessing Technique for Sentiment Analysis in Social Media Data,” in 2019 11th International Conference on Knowledge and Systems Engineering (KSE), IEEE, Oct. 2019, pp. 1–8. doi: 10.1109/KSE.2019.8919368.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Twitter Sentiment Analysis of Kanjuruhan Disaster using Word2Vec and Support Vector Machine

Dimensions Badge
Article History
Submitted: 2023-06-11
Published: 2023-06-29
Abstract View: 1294 times
PDF Download: 569 times
How to Cite
Rizky, F., Jondri, J., & Lhaksmana, K. (2023). Twitter Sentiment Analysis of Kanjuruhan Disaster using Word2Vec and Support Vector Machine. Building of Informatics, Technology and Science (BITS), 5(1), 219−227. https://doi.org/10.47065/bits.v5i1.3612
Issue
Section
Articles

Most read articles by the same author(s)

1 2 > >>