Klasifikasi Opini Pengguna TikTok terhadap Keamanan dan Efektivitas Produk Skincare Lokal menggunakan Metode Naïve Bayes, Decision Tree, dan Random Fores

Sintia Ariyani; Styawati Styawati

doi:10.47065/bits.v7i4.9537

Sintia Ariyani Universitas Teknokrat Indonesia, Bandar Lampung, Indonesia
Styawati Styawati * Universitas Teknokrat Indonesia, Bandar Lampung, Indonesia

(*) Corresponding Author

DOI: https://doi.org/10.47065/bits.v7i4.9537

Keywords: TikTok; Local Skincare; Sentiment Analysis; Machine Learning; Random Forest; Naïve Bayes; Decision Tree; SMOTE

Abstract

This study aims to analyze and compare the performance of Naïve Bayes, Decision Tree, and Random Forest algorithms in classifying TikTok users’ opinions regarding the safety and effectiveness of local skincare products. The results show that these algorithms exhibit significant differences in performance for sentiment classification tasks. Before applying SMOTE, Random Forest achieved the highest accuracy of 87%, followed by Decision Tree at 79% and Naïve Bayes at 65%. The main weakness was observed in minority classes such as Safe and Unsafe, which had low recall values. After applying SMOTE, all models showed improved performance, particularly in recognizing minority classes, resulting in more balanced accuracy, precision, recall, and F1-score across all sentiment categories. The TF-IDF analysis revealed that the extracted features were still dominated by common words and brand names, indicating that they did not fully represent the specific aspects of safety and effectiveness. This suggests that the preprocessing and feature selection stages can be further improved to generate more relevant feature representations. The classification visualization showed that most comments were categorized as Effective and Ineffective, while the Neutral category contained fewer instances. The implementation of SMOTE improved model performance in handling imbalanced data; however, it must be applied carefully only to the training data to avoid evaluation bias. Overall, Random Forest demonstrated the best performance among the evaluated algorithms. This study contributes to the development of a multi-class sentiment analysis model capable of distinguishing between safety and effectiveness aspects of skincare products, and demonstrates that the application of SMOTE effectively improves classification performance on imbalanced datasets. Future research is recommended to enhance sentiment labeling methods, improve feature quality, and explore more advanced approaches such as deep learning to achieve more accurate and robust classification results.

Downloads

Download data is not yet available.

References

Friska Aditia Indriyani, Ahmad Fauzi, and Sutan Faisal, “Analisis sentimen aplikasi tiktok menggunakan algoritma naïve bayes dan support vector machine,” TEKNOSAINS J. Sains, Teknol. dan Inform., vol. 10, no. 2, pp. 176–184, 2023, doi: 10.37373/tekno.v10i2.419.

A. N. Latif, M. Alianita, W. Musaadah, and R. Hidayat, “Mengatasi Tantangan dan Memanfaatkan Peluang Industri Halal Kosmetik dan Obat-Obatan Menuju Wajib Halal 2026: Analisis SWOT,” Indones. J. Halal, vol. 7, no. 2, pp. 146–155, 2024, [Online]. Available: https://doi.org/10.14710/halal.v7i2.23506

Syifa Nurjanah and Yordan Hermawan Apidana, “Analisis Sentimen TikTok untuk Mengevaluasi Reputasi Merek Pasca Kasus Overclaim: Studi pada Daviena Skincare,” Technol. Informatics Insight J., vol. 4, no. 2, pp. 74–91, 2025, doi: 10.32639/h5x8te73.

A. Romadhony, S. Al Faraby, R. Rismala, U. N. Wisesti, and A. Arifianto, “Sentiment Analysis on a Large Indonesian Product Review Dataset,” J. Inf. Syst. Eng. Bus. Intell., vol. 10, no. 1, pp. 167–178, 2024, doi: 10.20473/jisebi.10.1.167-178.

G. Khoerunnisa, S. S. Maesaroh, and M. R. Nugraha, “Analisis Sentimen Publik Terhadap Produk Rekomendasi Influencer Menggunakan Naive Bayes: Studi Kasus Tasya Farasya,” Data Sci. Indones., vol. 5, no. 1, pp. 84–95, 2025, doi: 10.47709/dsi.v5i1.6218.

S. Liem, T. Setiawan, and M. R. Pribadi, “Perbandingan Algoritma SVM dan Naïve Bayes Berbasis SMOTE dalam Analisis Sentimen Komentar Tiktok pada Produk Skincare,” Appl. Inf. Technol. Comput. Sci., vol. 3, no. 1, pp. 28–32, 2024, doi: 10.58466/aicoms.v3i1.1523.

J. Rama Dani, “Analisis Sentimen Komentar YouTube terhadap Kenaikan Tunjangan DPR RI menggunakan Naïve Bayes, SVM, dan Random Forest,” Technol. Sci., vol. 7, no. 3, pp. 1512–1524, 2025, doi: 10.47065/bits.v7i3.8513.

E. Permana, R. S. Eka Putri, P. D. Alfinda, and M. Mardhiyah, “Strategi Pemasaran Produk Skincare Somethinc Di Kalangan Generasi Z,” J. Pemasar. Kompetitif, vol. 7, no. 2, pp. 119–135, 2024, doi: 10.32493/jpkpk.v7i2.29289.

D. Ruswanti, D. Susilo, and R. Riani, “Implementasi CRISP-DM pada Data Mining untuk Melakukan Prediksi Pendapatan dengan Algoritma C.45,” Go Infotech J. Ilm. STMIK AUB, vol. 30, no. 1, pp. 111–121, 2024, doi: 10.36309/goi.v30i1.266.

Ridwan, E. H. Hermaliani, and M. Ernawati, “Penerapan Metode SMOTE Untuk Mengatasi Imbalanced Data Pada,” Comput. Sci., vol. 4, no. 1, pp. 80–88, 2024, [Online]. Available: http://jurnal.bsi.ac.id/index.php/co-science

C. Montag, H. Yang, and J. D. Elhai, “On the Psychology of TikTok Use: A First Glimpse From Empirical Findings,” Front. Public Heal., vol. 9, no. March, pp. 1–6, 2021, doi: 10.3389/fpubh.2021.641673.

D. Rifaldi and A. Fadlil, “DECODE: Jurnal Pendidikan Teknologi Informasi TEKNIK PREPROCESSING PADA TEXT MINING MENGGUNAKAN DATA TWEET ‘MENTAL HEALTH,’” Decod. J. Pendidik. Teknol. Inf., vol. 3, no. 2, pp. 161–171, 2023.

S. J. Angelina, A. Bijaksana, P. Negara, and H. Muhardi, “Analisis Pengaruh Penerapan Stopword Removal Pada Performa Klasifikasi Sentimen Tweet Bahasa Indonesia,” JUARA (Jurnal Apl. dan Ris. Inform., vol. 02, no. 1, pp. 165–173, 2023, doi: 10.26418/juara.v2i1.69680.

P. Keterampilan and M. Berbasis, “RISET : Jurnal Ilmiah Multidisiplin Ilmu,” vol. 1, no. 1, 2025.

A. Hermawan, I. Jowensen, J. Junaedi, and Edy, “Implementasi Text-Mining untuk Analisis Sentimen pada Twitter dengan Algoritma Support Vector Machine,” JST (Jurnal Sains dan Teknol., vol. 12, no. 1, pp. 129–137, 2023, doi: 10.23887/jstundiksha.v12i1.52358.

A. T. J. H, “Preprocessing Text untuk Meminimalisir Kata yang Tidak Berarti dalam Proses Text Mining,” pp. 1–9.

L. Mu et al., “DenseLoRA: Dense Low-Rank Adaptation of Large Language Models,” Proc. Annu. Meet. Assoc. Comput. Linguist., vol. 1, pp. 10198–10211, 2025, doi: 10.18653/v1/2025.acl-long.503.

M. U. Albab, Y. K. P., and M. N. Fawaiq, “Optimization of the Stemming Technique on Text Preprocessing President 3 Periods Topic,” J. Transform., vol. 20, no. 2, pp. 1–12, 2023, doi: 10.26623/transformatika.v20i2.5374.

S. Ghan, E. Diesen, C. Kunkel, K. Reuter, and H. Oberhofer, “Interpreting ultrafast electron transfer on surfaces with a converged first-principles Newns-Anderson chemisorption function,” J. Chem. Phys., vol. 158, no. 23, pp. 1–13, 2023, doi: 10.1063/5.0151009.

Y. Elor and H. Averbuch-Elor, To SMOTE, or not to SMOTE?, vol. 1, no. 1. Association for Computing Machinery. [Online]. Available: http://arxiv.org/abs/2201.08528

Y. Lin and Y. Xie, “Semilinear heat equations and parabolic variational inequalities on graphs,” pp. 1–25, 2021, [Online]. Available: http://arxiv.org/abs/2108.13007

H. Bichri, A. Chergui, and M. Hain, “Investigating the Impact of Train / Test Split Ratio on the Performance of Pre-Trained Models with Custom Datasets,” Int. J. Adv. Comput. Sci. Appl., vol. 15, no. 2, pp. 331–339, 2024, doi: 10.14569/IJACSA.2024.0150235.

R. Oktafiani, A. Hermawan, and D. Avianto, “Pengaruh Komposisi Split data Terhadap Performa Klasifikasi Penyakit Kanker Payudara Menggunakan Algoritma Machine Learning,” J. Sains dan Inform., vol. 9, no. April, pp. 19–28, 2023, doi: 10.34128/jsi.v9i1.622.

H. Oktavianto, H. W. Sulistyo, G. Wijaya, and D. Irawan, “Analisis Komparasi Kinerja Metode Decision Tree dan Random Forest dalam Klasifikasi Teks Data Kesehatan,” vol. 11, no. 1, pp. 56–65, 2024.

A. Muzaki, V. Febriana, and W. N. Cholifah, “Analisis Sentimen Pada Ulasan Produk di E-Commerce dengan Metode Naive Bayes,” J. Ris. dan Apl. Mhs. Inform., vol. 5, no. 4, pp. 758–765, 2024, doi: 10.30998/jrami.v5i4.9647.

S. A. Arnomo, A. A. Fajrin, Y. Siyamto, and S. F. N. Sadikin, “Evaluasi Model Decision Tree Pada Keputusan Kelayakan Kredit,” J. Desain Dan Anal. Teknol., vol. 2, no. 2, pp. 200–206, 2023, doi: 10.58520/jddat.v2i2.39.

N. Fathirachman Mahing, A. Lazuardi Gunawan, A. Foresta Azhar Zen, F. Abdurrachman Bachtiar, and S. Agung Wicaksono, “Klasifikasi Tingkat Stress dari Data Berbentuk Teks dengan Menggunakan Algoritma Support Vector Machine (SVM) dan Random Forest,” J. Teknol. Inf. dan Ilmu Komput., vol. 11, no. 5, pp. 1067–1076, 2024, doi: 10.25126/jtiik.2024118010.

Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Klasifikasi Opini Pengguna TikTok terhadap Keamanan dan Efektivitas Produk Skincare Lokal menggunakan Metode Naïve Bayes, Decision Tree, dan Random Fores