Penerapan Algoritma Naïve Bayes Terhadap Sentimen Ulasan Produk Skincare Pada E-Commerce Shopee


  • Divana Wahyu Putri * Mail Universitas Dian Nuswantoro, Semarang, Indonesia
  • Moch Arief Soeleman Universitas Dian Nuswantoro, Semarang, Indonesia
  • (*) Corresponding Author
Keywords: Sentiment Analysis; Skincare; Naïve Bayes; TF-IDF; N-Gram; Confusion Matrix

Abstract

The rapid growth of the beauty industry has generated a large volume of consumer reviews, necessitating an automated processing system to understand public sentiment. This study aims to implement sentiment analysis on skincare product reviews using the Multinomial Naïve Bayes algorithm. The labeling process was conducted by converting star ratings into sentiment categories: ratings 4 and 5 were labeled as positive, ratings 1 and 2 as negative, while rating 3 was excluded to avoid data ambiguity. The feature representation stage utilized TF-IDF with an N-gram approach (unigram and bigram), generating 10,000 features from a dataset of 8,646 reviews. Based on the testing results of 1,730 test data, the model achieved an accuracy of 70%. The Confusion Matrix evaluation revealed that the model performed exceptionally well in the positive class, reaching a recall of 1.00. However, the model struggled to classify negative and neutral classes, with recall values approaching 0.00. This was caused by imbalanced data distribution, where positive reviews significantly dominated the dataset. Nevertheless, Multinomial Naïve Bayes proved efficient in handling large-scale frequency-based textual features. A weighted average F1-score of 0.58 suggests that dataset optimization is required to improve the model's ability to accurately recognize minority sentiments.

Downloads

Download data is not yet available.

References

A. Rahman et al., "Analisis Perbandingan Algoritma LSTM dan Naive Bayes untuk Analisis Sentimen," JEPIN (Jurnal Edukasi dan Penelitian Informatika), vol. 11, no. 1, 2025, doi: 10.26418/jp.v11i1.72891.

A. Annur Rohman, G. Alfa Trisnapradika, and K. Kunci, “Perbandingan Algoritma NBC, SVM, Logistic Regression untuk Analisis Sentimen Terhadap Wacana KaburAjaDulu di Media Sosial X,” Technology and Science (BITS), vol. 7, no. 1, 2025, doi: 10.47065/bits.v7i1.7261.

A. Ramadhani, I. Permana, M. Afdal, and M. Fronita, “Analisis Sentimen Tanggapan Publik di Twitter Terkait Program Kerja Makan Siang Gratis Prabowo–Gibran Menggunakan Algoritma Naïve Bayes Classifier dan Support Vector Machine,” Building of Informatics, Technology and Science (BITS), vol. 6, no. 3, Dec. 2024, doi: 10.47065/bits.v6i3.6188.

E. Arya Pranata, F. Budiman, and D. Kurniawan, “Analisis Sentimen Ulasan Mobile JKN pada Playstore dengan Perbandingan Akurasi Algoritma Naïve Bayes dan SVM,” Technology and Science (BITS), vol. 7, no. 1, 2025, doi: 10.47065/bits.v7i1.7334.

R. F. P. Pratama and W. Maharani, “Comparative Analysis of Naive Bayes and SVM for Improved Emotion Classification on Social Media,” Edumatic: Jurnal Pendidikan Informatika, vol. 9, no. 1, pp. 11–20, Apr. 2025, doi: 10.29408/edumatic.v9i1.29087.

Y. Christian, T. Wibowo, and M. Lyawati, “Sentiment Analysis by Using Naïve Bayes Classification and Support Vector Machine, Study Case Sea Bank,” Sinkron, vol. 9, no. 1, pp. 258–275, Jan. 2024, doi: 10.33395/sinkron.v9i1.13141.

A. V. Kuncoro, F. Budiman, and D. Kurniawan, “Analisis Sentimen Pengguna X terhadap Kasus Korupsi Gula Tom Lembong Menggunakan Naïve Bayes, SVM, dan Random Forest,” Technology and Science (BITS), vol. 7, no. 3, 2025, doi: 10.47065/bits.v7i3.8577.

N. V. R. Jhosefhin, S. Srianu, and I. Kurniawan, "Analisis Sentimen Crawling Data dari Sosial Media X tentang Gaza Menggunakan Metode SVM dan Decision Tree” Jurnal Indonesia Manajemen Informatika dan Komunikasi (JIMIK), vol.6, no 1, 2025, doi:10.35870/jimik.v6i1.1225

M. F. A. Shidiq, D. Alita, and L. Ratu, "Analisis Sentimen Masyarakat Terhadap Kasus Judi Online Menggunakan Data dari Media Sosial X Pendekatan Naive Bayes dan SVM," Jurnal Sistem Informasi dan Informatika (Simika), vol. 8, no. 1, 2025. doi: 10.47080/simika.v8i1.3533.

M. N. Romadhoni and N. A. S. Winarsih, “Kinerja Naive Bayes dan SVM pada Data Survei Tidak Seimbang: Studi Klasifikasi Kepuasan Masyarakat,” Edumatic: Jurnal Pendidikan Informatika, vol. 9, no. 2, pp. 382–391, Aug. 2025, doi: 10.29408/edumatic.v9i2.30185.

Z. A. Mukharyahya, Y. P. Astuti, and O. N. Cahyani, “Perbandingan Naive Bayes dan Support Vector Machine dalam Klasifikasi Tingkat Kemiskinan di Indonesia,” Edumatic: Jurnal Pendidikan Informatika, vol. 9, no. 1, pp. 119–128, Apr. 2025, doi: 10.29408/edumatic.v9i1.29512.

Y. Aprianti, A. Lia Hananto, S. Shofiah Hilabi, S. Informasi, and U. Buana Perjuangan Karawang, “Klasifikasi Sentimen Komentar Pengguna pada Aplikasi Ruangguru Menggunakan Algoritma Naive Bayes,” vol. 9, p. 2025, doi: 10.47002/metik.v9i1.1023.

B. R. A. Putri, Fitrianti, "Penerapan Dan Perbandingan Algoritma Naïve Bayes Dan K-Nearest Neighbor Dalam Analisis Sentimen Terhadap Kepuasan Pengguna Aplikasi Flo," Jurnal Ilmiah Informatika Global, vol. 16, no. 2, 2025, doi: 10.36982/jig.v16i2.4285.

B. Z. Ramadhan, I. Riza, and I. Maulana, "Analisis Sentimen Ulasan Pada Aplikasi E-Commerce Dengan Menggunakan Algoritma Naïve Bayes," Journal of Applied Informatics and Computing (JAIC), vol. 6, no. 1, 2022, doi: 10.30871/jaic.v6i1.3654.

A. Syafi’i, M. Afdal, E. Saputra, and R. Novita, “Analisis Sentimen Ulasan Pengguna Aplikasi Penjualan Pulsa Menggunakan Algoritma Naïve Bayes Classifier,” Jurnal Teknologi Sistem Informasi dan Aplikasi, vol. 7, no. 3, pp. 1300–1308, Jul. 2024, doi: 10.32493/jtsi.v7i3.41364.

F. M. Delta Maharani, A. Lia Hananto, S. Shofia Hilabi, F. Nur Apriani, A. Hananto, and B. Huda, “Perbandingan Metode Klasifikasi Sentimen Analisis Penggunaan E-Wallet Menggunakan Algoritma Naïve Bayes dan K-Nearest Neighbor,” METIK JURNAL, vol. 6, no. 2, pp. 97–103, Dec. 2022, doi: 10.47002/metik.v6i2.372.

I. Kurniawan et al., "Perbandingan Algoritma Naive Bayes Dan SVM Dalam Sentimen Analisis Marketplace Pada Twitter," Jurnal Teknik Informatika dan Sistem Informasi (JUTISI), vol. 9, no. 1, 2023, doi: 10.35957/jutisi.v9i1.4121.

K. Nurfebia and S. Sriani, “Sentiment Analysis of Skincare Products Using the Naive Bayes Method,” Journal of Information Systems and Informatics, vol. 6, no. 3, pp. 1663–1676, Sep. 2024, doi: 10.51519/journalisi.v6i3.817.

D. Purnamasari et al., Pengantar Metode Analisis Sentimen, Depok, Indonesia: UG Penerbit Gunadarma, 2023. ISBN: 978-602-0764-57-3.

D. Atikah, A. Hananto, Tukino, and E. Novalia, "Analisis Opini Pengguna Aplikasi Shopee Dengan Naïve Bayes Classifier," JIKA (Jurnal Informatika) Universitas Muhammadiyah Tangerang, vol. 9, no. 3, pp. 363-366, 2025, doi: 10.31000/jika.v9i3.13456.

Pramesti Melinea Berlianti and Erwin Yudi Hidayat, “Implementasi Naïve Bayes Classifier untuk Sentimen Produk Kecantikan Berdasarkan Ulasan Female Daily,” The Indonesian Journal of Computer Science, vol. 13, no. 6, Dec. 2024, doi: 10.33022/ijcs.v13i6.4499.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Penerapan Algoritma Naïve Bayes Terhadap Sentimen Ulasan Produk Skincare Pada E-Commerce Shopee

Dimensions Badge
Article History
Submitted: 2026-01-15
Published: 2026-03-06
Abstract View: 166 times
PDF Download: 136 times
How to Cite
Putri, D., & Soeleman, M. (2026). Penerapan Algoritma Naïve Bayes Terhadap Sentimen Ulasan Produk Skincare Pada E-Commerce Shopee. Building of Informatics, Technology and Science (BITS), 7(4), 2218-2228. https://doi.org/10.47065/bits.v7i4.9209
Issue
Section
Articles