Analisis Performa Algoritma NBC, DT, SVM dalam Klasifikasi Data Ulasan Pengunjung Candi Borobudur Berbasis CRISP-DM


Keywords: Naive Bayes Classifier; Decision Tree; Support Vector Machine; Sentiment Analysis

Abstract

The approach of visitor sentiment analysis to Borobudur Temple tourist destinations in Indonesia can be classified using various algorithms to get optimal results. Good algorithm performance can be seen from the confusion matrix (accuracy, precision, recall) value, Area Under Curve (AUC) value, and Receiver Operating Characteristic (ROC). This study used the Naïve Bayes Classifier (NBC), Decision Tree (DT), and Support Vector Machine (SVM) algorithms against 3850 text data obtained from the Tripadvisor website, especially reviews of Borobudur Temple visitors. The method refers to the Cross-Industry Standard Process for Data Mining (CRISP-DM) for optimizing tourist destination products and services by paying attention to six stages: business understanding, data understanding, data preparation, modeling, evaluation, and deployment. The results of this study show that the results of NBC's algorithm performance evaluation can be seen to have a change in the confusion matrix value at the accuracy value from 98.73% to 95.6%, the precision value changed from 98.72% to 98.97%, the recall value also changed from 100% to 96.54%. In addition, the Area Under Curve (AUC) of NBC also changed from 0.500 (50%) to 0.693 (69.35%). In addition, the results of the DT algorithm performance evaluation showed a change in the confusion matrix value at the accuracy value from 97.55% to 94.40%, the precision value increased from 97.63% to 91.86%, the recall value also changed from 99.90% to 99.47%. The Area Under Curve (AUC) of DT value also changed from 0.591 (59.1%) to 0.932 (93.2%). The results of the SVM algorithm performance evaluation showed a change in the confusion matrix value at the accuracy value from 98.73% to 99.41%; the precision value changed from 98.72% to 100%, and the recall value also changed from 100% to 99.01%. The Area Under Curve (AUC) of the SVM value also changed from 0.961 (96.1%) to 1.00 (100%). In addition, the T-test results show that the SVM algorithm is more dominant compared to other algorithms, where the SVM algorithm T-test value is 0.994 compared to the DT algorithm T-test value of 0.944 and the NBC algorithm T-test value of 0.98. Based on the Receiver Operating Characteristic (ROC) value, it can be seen that the DT algorithm also shows good performance in addition to SVM. It indicates that in analyzing the sentiment of visitors to Borobudur Temple, the best-recommended algorithm is the Support Vector Machine

Downloads

Download data is not yet available.

References

L. K. P. Daulay, F. Boy, N. Nakaromi, P. Prakoso, and U. Ramadhanty, “Transformasi Digital Di Ekowisata Bukit Peramun,” J. Ind. Pariwisata, vol. 5, no. 1, pp. 99–110, 2022, doi: 10.36441/pariwisata.v5i1.991.

G. Hazmin and A. Wijayanti, “Pendekatan Berbasis Phygital dalam Menjembatani Kesenjangan dalam Transformasi Digital,” Int. J. Community Serv. Learn., vol. 6, no. 2, pp. 159–166, 2022, doi: 10.23887/ijcsl.v6i2.48470.

N. N. R. Suasih, P. Y. Wijaya, and I. M. E. K. Yudha, “Key Factors Transformasi Digital UMKM (Pendekatan Analisis Micmac Pada Umkm Di Bali),” J. Akunt. dan Pajak, vol. 22, no. 2, pp. 1–7, 2022, doi: http://dx.doi.org/10.29040/jap.v22i2.4014.

S. Asril, “Adaptasi Digital: Upaya Menghidupkan Kembali Roh Museum,” War. Pariwisata, vol. 20, no. 1, pp. 15–17, 2022, doi: 10.5614/wpar.2022.20.1.04.

M. R. Muttaqin, T. I. Hermanto, and M. A. Sunandar, “Penerapan K-Means Clustering Dan Cross-Industry Standard Process for Data Mining (CRISP-DM) Untuk Mengelompokan Penjualan Kue,” Komputasi J. Ilm. Ilmu Komput. dan Mat., vol. 19, no. 1, pp. 38–53, 2022, [Online]. Available: https://journal.unpak.ac.id/index.php/komputasi

S. E. A. Felix et al., “A Data Mining-based Cross-Industry Process for Predicting Major Bleeding in Mechanical Circulatory Support,” Eur. Hear. J. - Digit. Heal., vol. 2, no. 4, pp. 635–642, 2021, doi: 10.1093/ehjdh/ztab082.

H. N. Prabowo, R. Setyadi, and W. A. Prabowo, “Application of Data Mining for Clustering of Foreign Tourist Visits Based on Arrival Entrance,” Sinkron, vol. 7, no. 1, pp. 49–58, 2022, doi: 10.33395/sinkron.v7i1.11217.

H. J. Christanto and Y. A. Singgalen, “Sentiment Analysis on Customer Perception towards Products and Services of Restaurant in Labuan Bajo,” J. Inf. Syst. Informatics, vol. 4, no. 3, pp. 511–523, 2022, doi: 10.51519/journalisi.v4i3.276.

Y. A. Singgalen, “Sentiment Analysis on Customer Perception towards Products and Services of Restaurant in Labuan Bajo,” J. Inf. Syst. Informatics, vol. 4, no. 3, pp. 511–523, 2022, doi: 10.51519/journalisi.v4i3.276.

A. Wicaksono, N. Khakhim, and N. M. Farda, “Variasi Sentimen Pantai Wisata dari Tweet Berbahasa Indonesia Studi Kasus : Pantai Wisata Di Desa Parangtritis , Kabupaten Bantul,” J. Kepariwisataan, Hosp. dan Perjalanan, vol. 6, no. 1, pp. 1–15, 2022, doi: 10.34013/jk.v6i1.326.

Y. M. W. Wahyu, A. R. Berto, and E. Murwani, “Analisis Sentimen Jaringan Pesan Kolom Komentar Video Wonderful Indonesia 2022 Jagad Jawi yang Dipengaruhi Budaya,” Avant Garde J. Ilmu Komun., vol. 10, no. 2, pp. 201–216, 2022.

N. A. Rahma, Garno, and N. Sulistiyowati, “Analisis Sentimen Tempat Wisata di Jakarta Pasca Covid-19 dengan Algoritma Naive Bayes,” J. Pendidik. dan Konseling, vol. 4, no. 6, pp. 5894–5908, 2022.

A. A. Arifiyanti, M. F. Pandji, and B. Utomo, “Analisis Sentimen Ulasan Pengunjung Objek Wisata Gunung Bromo pada Situs Tripadvisor,” Explor. J. Sist. Inf. dan Telemat., vol. 13, no. 1, p. 32, 2022, doi: 10.36448/jsit.v13i1.2539.

Y. T. Pratama, F. A. Bachtiar, and N. Y. Setiawan, “Analisis Sentimen Opini Pelanggan Terhadap Aspek Pariwisata Pantai Malang Selatan Menggunakan TF-IDF dan Support Vector Machine,” J. Pengemb. Teknol. Inf. dan Ilmu Komput. Univ. Brawijaya, vol. 2, no. 12, pp. 6244–6252, 2018.

R. Azmatul Barro, I. D. Sulvianti, and M. Afendi, “Penerapan Synthetic Minority Oversampling Technique (Smote) Terhadap Data Tidak Seimbang Pada Pembuatan Model Komposisi Jamu,” Xplore J. Stat., vol. 1, no. 1, pp. 1–6, 2013.

Hartono, O. S. Sitompul, Tulus, and E. B. Nababan, “Biased Support Vector Machine and Weighted-SMOTE in Handling Class Imbalance Problem,” Int. J. Adv. Intell. Informatics, vol. 4, no. 1, pp. 21–27, 2018, doi: 10.26555/ijain.v4i1.146.

F. Nurhuda, S. W. Sihwi, and A. Doewes, “Analisis Sentimen Masyarakat Terhadap Pilpres 2019 Berdasarkan Opini Dari Twitter Menggunakan Metode Naive Bayes Classifier,” J. ITSMART, vol. 2, no. 2, pp. 35–42, 2013, doi: 10.51519/journalcisa.v1i3.45.

M. F. Asshiddiqi and K. M. Lhaksmana, “Perbandingan Metode Decision Tree dan Support Vector Machine untuk Analisis Sentimen pada Instagram Mengenai Kinerja PSSI,” in e-Proceeding of Engineering, 2020, vol. 7, no. 3, pp. 9936–9948.

R. Puspita and A. Widodo, “Perbandingan Metode KNN, Decision Tree, dan Naïve Bayes Terhadap Analisis Sentimen Pengguna Layanan BPJS,” J. Inform. Univ. Pamulang, vol. 5, no. 4, pp. 646–654, 2021, doi: 10.32493/informatika.v5i4.7622.

D. N. Fitriana and Y. Sibaroni, “Sentiment Analysis on KAI Twitter Post Using Multiclass Support Vector Machine (SVM),” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 4, no. 5, pp. 846–853, 2020, doi: 10.29207/resti.v4i5.2231.

A. Karim, “Perbandingan Prediksi Kemiskinan di Indonesia Menggunakan Support Vector Machine (SVM) dengan Regresi Linear,” J. Sains Mat. dan Stat., vol. 6, no. 1, pp. 107–112, 2020, doi: 10.24014/jsms.v6i1.9259.

E. A. Nida, “Analisis Kinerja Algoritma Support Vector Machine (SVM) Guna Pengambilan Keputusan Beli/Jual Pada Saham PT Elnusa Tbk. (ELSA),” J. Transform., vol. 17, no. 2, pp. 160–170, 2020, doi: 10.26623/transformatika.v17i2.1649.

C. Cahyaningtyas, Y. Nataliani, and I. R. Widiasari, “Analisis Sentimen Pada Rating Aplikasi Shopee Menggunakan Metode Decision Tree Berbasis SMOTE,” Aiti, vol. 18, no. 2, pp. 173–184, 2021, doi: 10.24246/aiti.v18i2.173-184.

W. Hadi and H. Widyaningsih, “Implementasi Penerapan Sapta Pesona Wisata Terhadap Kunjungan Wisatawan Di Desa Sambirejo Kecamatan Prambanan Kabupaten Sleman Daerah Istimewa Yogyakarta Wisnu,” Khasanah Ilmu J. Pariwisata Dan Budaya, vol. 11, no. 2, pp. 127–136, 2020, doi: 10.31294/khi.v11i2.8862.

T. Mardiana, H. Syahreva, and T. Tuslaela, “Komparasi Metode Klasifikasi Pada Analisis Sentimen Usaha Waralaba Berdasarkan Data Twitter,” J. Pilar Nusa Mandiri, vol. 15, no. 2, pp. 267–274, 2019, doi: 10.33480/pilar.v15i2.752.

I. M. B. S. Darma, R. S. Perdana, and Indriati, “Penerapan Sentimen Analisis Acara Televisi Pada Twitter Menggunakan Support Vector Machine dan Algoritma Genetika sebagai Metode Seleksi Fitur,” J. Pengemb. Teknol. Inf. dan Ilmu Komput., vol. 2, no. 3, pp. 998–1007, 2018, [Online]. Available: http://j-ptiik.ub.ac.id

J. Ipmawati, Kusrini, and E. Taufiq Luthfi, “Komparasi Teknik Klasifikasi Teks Mining Pada Analisis Sentimen,” Indones. J. Netw. Secur., vol. 6, no. 1, pp. 28–36, 2017.

P. Antinasari, R. S. Perdana, and M. A. Fauzi, “Analisis Sentimen Tentang Opini Film Pada Dokumen Twitter Berbahasa Indonesia Menggunakan Naive Bayes Dengan Perbaikan Kata Tidak Baku,” J. Pengemb. Teknol. Inf. dan Ilmu Komput., vol. 1, no. 12, pp. 1733–1741, 2017, [Online]. Available: http://j-ptiik.ub.ac.id

A. R. Kadafi, “Perbandingan Algoritma Klasifikasi Untuk Penjurusan Siswa SMA,” J. ELTIKOM, vol. 2, no. 2, pp. 67–77, 2018, doi: 10.31961/eltikom.v2i2.86.

B. S. Prakoso, D. Rosiyadi, H. S. Utama, and D. Aridarma, “Klasifikasi Berita Menggunakan Algoritma Naive Bayes Classifer Dengan Seleksi Fitur Dan Boosting,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 3, no. 2, pp. 227–232, 2019, doi: 10.29207/resti.v3i2.1042.

F. Zamachsari, G. Vangeran Saragih, Susafa’ati, and W. Gata, “Analisis Sentimen Pemindahan Ibu Kota Negara dengan Feature Selection Algoritma Naive Bayes dan Support Vector Machine,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 1, no. 3, pp. 504–512, 2017, doi: 10.29207/resti.v4i3.1942.

R. Fatmasari, V. M. Ayu, B. Pratama, and W. Gata, “Analisis Sentimen Dalam Pengkategorian Komentar Youtube Terhadap Layanan Akademik dan Non-Akademik Universitas Terbuka Untuk Prediksi Kepuasan,” Build. Informatics, Technol. Sci., vol. 4, no. 2, pp. 395–404, 2022, doi: 10.47065/bits.v4i2.1738.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Analisis Performa Algoritma NBC, DT, SVM dalam Klasifikasi Data Ulasan Pengunjung Candi Borobudur Berbasis CRISP-DM

Dimensions Badge
Article History
Submitted: 2022-12-26
Published: 2022-12-30
Abstract View: 2124 times
PDF Download: 1400 times
How to Cite
Singgalen, Y. (2022). Analisis Performa Algoritma NBC, DT, SVM dalam Klasifikasi Data Ulasan Pengunjung Candi Borobudur Berbasis CRISP-DM. Building of Informatics, Technology and Science (BITS), 4(3), 1634−1646. https://doi.org/10.47065/bits.v4i3.2766
Issue
Section
Articles