Twitter Sentiment Analysis of Kanjuruhan Disaster using Word2Vec and Support Vector Machine
Abstract
The Kanjuruhan disaster on 1 October 2022, gained the peoples attention. People share their thoughts on social media. Their posts contain a variety of perspectives. Sentiment analysis is possible to use on a dataset of people's posts. This final project applies the supervised learning Support Vector Machine (SVM) method with feature expansion using Word2Vec and TF-IDF as weighting. Three SVM kernels—rbf, linear, and polynomial—are applied. Three split data techniques and two different types of training data are used to train each kernel. Training data with oversampling and training data without oversampling are the two types of training data. The best result gained from using rbf kernel, split ratio 70:30, and oversampling. From it, oversampling trained model have relatively stable in every split rasio and kernel without having significant difference.
Downloads
References
V. Febrianto, “Death count in Kanjuruhan tragedy climbs to 135,” ANTARA News, Oct. 24, 2022. Accessed: Jan. 20, 2023. [Online]. Available: https://en.antaranews.com/news/256465/death-count-in-kanjuruhan-tragedy-climbs-to-135
Hermanto, A. Y. Kuntoro, T. Asra, E. B. Pratama, L. Effendi, and R. Ocanitra, “Gojek and Grab User Sentiment Analysis on Google Play Using Naive Bayes Algorithm And Support Vector Machine Based Smote Technique,” J Phys Conf Ser, vol. 1641, p. 012102, 2020, doi: 10.1088/1742-6596/1641/1/012102.
M. Guia, R. Silva, and J. Bernardino, “Comparison of Naïve Bayes, Support Vector Machine, Decision Trees and Random Forest on Sentiment Analysis,” Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, 2019, doi: 10.5220/0008364105250531.
S. Al-Saqqa and A. Awajan, “The Use of Word2vec Model in Sentiment Analysis,” Proceedings of the 2019 International Conference on Artificial Intelligence, Robotics and Control, pp. 39–43, 2019, doi: 10.1145/3388218.3388229.
M. Matuq Ashi, M. Ahmed Siddiqui, and F. Nadeem, “Pre-trained Word Embeddings for Arabic Aspect-Based Sentiment Analysis of Airline Tweets,” in The International Conference on Advanced Intelligent Systems and Informatics, A. E. Hassanien, M. F. Tolba, K. Shaalan, and A. T. Azar, Eds., Cairo: Springer International Publishing, 2018. doi: 10.1007/978-3-319-99010-1.
M. Sahbuddin and S. Agustian, “View of Support Vector Machine Method with Word2vec for Covid-19 Vaccine Sentiment Classification on Twitter,” Journal of Informatics and Telecommunication Engineering, vol. 6, no. 1, 2022, doi: 10.31289/jite.v6i1.7534.
J. Xue, X. Ban, H. Guo, and X. Zhu, “Sentiment Analysis Based on Weibo Comments,” in 2018 13th World Congress on Intelligent Control and Automation (WCICA), Changsa, China: IEEE, Jul. 2018, pp. 1166–1171. doi: 10.1109/WCICA.2018.8630471.
Naufal Adi Nugroho and Erwin Budi Setiawan, “Implementation Word2Vec for Feature Expansion in Twitter Sentiment Analysis,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 5, no. 5, pp. 837–842, Oct. 2021, doi: 10.29207/resti.v5i5.3325.
M. A. Fauzi, “Word2Vec model for sentiment analysis of product reviews in Indonesian language,” International Journal of Electrical and Computer Engineering (IJECE), vol. 9, no. 1, p. 525, Feb. 2019, doi: 10.11591/ijece.v9i1.pp525-530.
M. Rizki, “Analisis Sentimen Masyarakat Terhadap Vaksin COVID-19 Menggunakan Metode Support Vector Machine Pada Media Sosial Twitter,” Uin-suska.ac.id, 2022, [Online]. Available: http://repository.uin-suska.ac.id/58497/
R. Nooraeni, H. D. Sariyanti, A. F. F. Iskandar, S. F. Munawwaroh, S. Pertiwi, and Y. Ronaldias, “Analisis Sentimen Data Twitter Mengenai Isu RUU KPK Dengan Metode Support Vector Machine (SVM),” Paradigma - Jurnal Komputer dan Informatika, vol. 22, no. 1, pp. 55–60, Mar. 2020, doi: 10.31294/p.v22i1.6869.
“Rekor Kematian Kedua di Dunia, Tragedi Kanjuruhan Lampaui Hillsborough,” CNN Indonesia, Oct. 02, 2022. https://www.cnnindonesia.com/olahraga/20221002070354-142-855202/rekor-kematian-kedua-di-dunia-tragedi-kanjuruhan-lampaui-hillsborough (accessed Jan. 20, 2023).
L. Bing, Sentiment Analysis Mining Opinions, Sentiments, and Emotions, 2nd ed. Cambridge University Press, 2020. doi: 10.1017/9781108639286.002.
D. Sharma, M. Sabharwal, V. Goyal, and M. Vij, “Sentiment Analysis Techniques for Social Media Data: A Review,” 2020, pp. 75–90. doi: 10.1007/978-981-15-0029-9_7.
Suyanto, Data Mining untuk Klasifikasi dan Klasterisasi Data. Bandung: Informatika Bandung, 2017. Accessed: Jan. 25, 2023. [Online]. Available: https://opac.perpusnas.go.id/DetailOpac.aspx?id=1069411#
B. Santosa, “Tutorial Support Vector Machines,” Surabaya, 2015.
S.-W. Kim and J.-M. Gil, “Research paper classification systems based on TF-IDF and LDA schemes,” Human-centric Computing and Information Sciences, vol. 9, no. 1, p. 30, Dec. 2019, doi: 10.1186/s13673-019-0192-7.
X. Huang, C.-Z. Zhang, and J. Yuan, “Predicting Extreme Financial Risks on Imbalanced Dataset: A Combined Kernel FCM and Kernel SMOTE Based SVM Classifier,” Comput Econ, vol. 56, no. 1, pp. 187–216, Jun. 2020, doi: 10.1007/s10614-020-09975-3.
D. Chicco and G. Jurman, “The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation,” BMC Genomics, vol. 21, no. 1, p. 6, Dec. 2020, doi: 10.1186/s12864-019-6413-7.
S. Pradha, M. N. Halgamuge, and N. Tran Quoc Vinh, “Effective Text Data Preprocessing Technique for Sentiment Analysis in Social Media Data,” in 2019 11th International Conference on Knowledge and Systems Engineering (KSE), IEEE, Oct. 2019, pp. 1–8. doi: 10.1109/KSE.2019.8919368.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Twitter Sentiment Analysis of Kanjuruhan Disaster using Word2Vec and Support Vector Machine
Pages: 219−227
Copyright (c) 2023 Fariz Muhammad Rizky, Jondri, Kemas Muslim Lhaksmana

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















