Cyberbullying Detection on Twitter using Support Vector Machine Classification Method


  • Ni Luh Putu Mawar Silveria Putri Waisnawa * Mail Universitas Telkom, Bandung, Indonesia
  • Dade Nurjanah Universitas Telkom, Bandung, Indonesia
  • Hani Nurrahmi Universitas Telkom, Bandung, Indonesia
  • (*) Corresponding Author
Keywords: Cyberbullying; Twitter; Support Vector Machine; N-gram; TF-IDF

Abstract

Bullying is when someone or a group of individuals is continuously attacked. Because of the advancement of the internet, it has become very easy for society to engage in harmful acts of bullying by attacking a person or group of people who can hurt the victim, this is known as cyberbullying. Twitter is a social media platform that may be used by the society to share information and can also be used to perpetrate cyberbullying actions by sending messages (tweets) that addressed to the victims. This final project was developing a system to detect cyberbullying on Twitter. The system uses the Support Vector Machine method to classify whether the tweets that are shared include cyberbullying or not. In addition, this research also uses Term Frequency-Inverse Document Frequency (TF-IDF) and N-gram feature extraction for data that has gone through the pre-processing stage. In collecting data, the author crawled tweets based on the keywords 'jelek', 'bodoh', 'goblok', 'brengsek', 'bangsat', 'memalukan', 'laknat', 'bacot' and 'pelacur'. The best performance results of the research is 76.2% accuracy, 73.2% precision, 78.2% recall and 75.6% F1-Score generated by the RBF kernel with a total of n=1

Downloads

Download data is not yet available.

References

M. Rifauddin, “Fenomena Cyberbullying pada Remaja,” Khizanah Al-Hikmah J. Ilmu Perpust. Inf. Dan Kearsipan, vol. 4, no. 1, pp. 35–44, Jun. 2016, doi: 10.24252/kah.v4i1a3.

R. M. Huda, “Indonesia Pengguna Twitter Terbesar Ketiga Dunia,” Setara.net, Jun. 11, 2017. https://setara.net/indonesia-pengguna-twitter-terbesar-ketiga-dunia/

R. R. Dalvi, S. Baliram Chavan, and A. Halbe, “Detecting A Twitter Cyberbullying Using Machine Learning,” in 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, May 2020, pp. 297–301. doi: 10.1109/ICICCS48265.2020.9120893.

V. S. Chavan and Shylaja S S, “Machine learning approach for detection of cyber-aggressive comments by peers on social media network,” in 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Kochi, India, Aug. 2015, pp. 2354–2358. doi: 10.1109/ICACCI.2015.7275970.

R. M. Kamal, “Analisis Sentimen Cyberbullying Pada Komentar Facebook Dengan Metode Klasifikasi Support Vector Machine,” Universitas Komputer Indonesia, 2019.

M. Sintaha and M. Mostakim, “An Empirical Study and Analysis of the Machine Learning Algorithms Used in Detecting Cyberbullying in Social Media,” in 2018 21st International Conference of Computer and Information Technology (ICCIT), Dhaka, Bangladesh, Dec. 2018, pp. 1–6. doi: 10.1109/ICCITECHN.2018.8631958.

Noviantho, S. M. Isa and L. Ashianti, "Cyberbullying classification using text mining," 2017 1st International Conference on Informatics and Computational Sciences (ICICoS), 2017, pp. 241-246, doi: 10.1109/ICICOS.2017.8276369.

A. Rachmat and Y. Lukito, “Sentipol: Dataset Sentimen Komentar Pada Kampanye Pemilu Presiden Indonesia 2014 Dari Facebook Page,” Konf. Nas. Teknol. Inf. Dan Komun. 2017, pp. 218–228, 2016.

M. Bramer, Clustering. Springer, 2007.

R. S. Putra, “Klasifikasi dokumen menurut bahasa berbasis N-Gram,” 2018.

C. P. Medina and M. R. R. Ramon, “Using TF-IDF to Determine Word Relevance in Document Queries Juan,” New Educ. Rev., vol. 42, no. 4, pp. 40– 51, 2015.

R. Melita, “Penerapan Metode Term Frequency Inverse Document Frequency (Tf-Idf) Dan Cosine Similarity Pada Sistem Temu Kembali Informasi Untuk Mengetahui Syarah Hadits Berbasis Web (Studi Kasus: Hadits Shahih Bukhari-Muslim),” Fakultas Sains dan Teknologi UIN Syarif Hidayatullah Jakarta, 2018.

Paramita, “Penerapan Support Vector Machine untuk Ekstraksi Informasi dari Dokumen Teks,” Jun. 2008.

A. Kesumawati, “Perbandingan Metode Support Vector Machine (SVM) Linear, Radial Basis Function (RBF), dan Polinomial Kernel dalam Klasifikasi Bidang Studi Lanjut Pilihan Alumni UII,” 2018.

S. Naz, A. Sharan, and N. Malik, “Sentiment classification on twitter data using support vector machine,” in 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI), 2018, pp. 676–679.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Cyberbullying Detection on Twitter using Support Vector Machine Classification Method

Dimensions Badge
Article History
Submitted: 2022-03-21
Published: 2022-03-31
Abstract View: 925 times
PDF Download: 738 times
How to Cite
Putri Waisnawa, N. L. P., Nurjanah, D., & Nurrahmi, H. (2022). Cyberbullying Detection on Twitter using Support Vector Machine Classification Method. Building of Informatics, Technology and Science (BITS), 3(4), 661−666. https://doi.org/10.47065/bits.v3i4.1435
Issue
Section
Articles