Klasifikasi Opini Pengguna TikTok terhadap Keamanan dan Efektivitas Produk Skincare Lokal menggunakan Metode Naïve Bayes, Decision Tree, dan Random Fores
Abstract
This study aims to analyze and compare the performance of Naïve Bayes, Decision Tree, and Random Forest algorithms in classifying TikTok users’ opinions regarding the safety and effectiveness of local skincare products. The results show that these algorithms exhibit significant differences in performance for sentiment classification tasks. Before applying SMOTE, Random Forest achieved the highest accuracy of 87%, followed by Decision Tree at 79% and Naïve Bayes at 65%. The main weakness was observed in minority classes such as Safe and Unsafe, which had low recall values. After applying SMOTE, all models showed improved performance, particularly in recognizing minority classes, resulting in more balanced accuracy, precision, recall, and F1-score across all sentiment categories. The TF-IDF analysis revealed that the extracted features were still dominated by common words and brand names, indicating that they did not fully represent the specific aspects of safety and effectiveness. This suggests that the preprocessing and feature selection stages can be further improved to generate more relevant feature representations. The classification visualization showed that most comments were categorized as Effective and Ineffective, while the Neutral category contained fewer instances. The implementation of SMOTE improved model performance in handling imbalanced data; however, it must be applied carefully only to the training data to avoid evaluation bias. Overall, Random Forest demonstrated the best performance among the evaluated algorithms. This study contributes to the development of a multi-class sentiment analysis model capable of distinguishing between safety and effectiveness aspects of skincare products, and demonstrates that the application of SMOTE effectively improves classification performance on imbalanced datasets. Future research is recommended to enhance sentiment labeling methods, improve feature quality, and explore more advanced approaches such as deep learning to achieve more accurate and robust classification results.
Downloads
References
Friska Aditia Indriyani, Ahmad Fauzi, and Sutan Faisal, “Analisis sentimen aplikasi tiktok menggunakan algoritma naïve bayes dan support vector machine,” TEKNOSAINS J. Sains, Teknol. dan Inform., vol. 10, no. 2, pp. 176–184, 2023, doi: 10.37373/tekno.v10i2.419.
A. N. Latif, M. Alianita, W. Musaadah, and R. Hidayat, “Mengatasi Tantangan dan Memanfaatkan Peluang Industri Halal Kosmetik dan Obat-Obatan Menuju Wajib Halal 2026: Analisis SWOT,” Indones. J. Halal, vol. 7, no. 2, pp. 146–155, 2024, [Online]. Available: https://doi.org/10.14710/halal.v7i2.23506
Syifa Nurjanah and Yordan Hermawan Apidana, “Analisis Sentimen TikTok untuk Mengevaluasi Reputasi Merek Pasca Kasus Overclaim: Studi pada Daviena Skincare,” Technol. Informatics Insight J., vol. 4, no. 2, pp. 74–91, 2025, doi: 10.32639/h5x8te73.
A. Romadhony, S. Al Faraby, R. Rismala, U. N. Wisesti, and A. Arifianto, “Sentiment Analysis on a Large Indonesian Product Review Dataset,” J. Inf. Syst. Eng. Bus. Intell., vol. 10, no. 1, pp. 167–178, 2024, doi: 10.20473/jisebi.10.1.167-178.
G. Khoerunnisa, S. S. Maesaroh, and M. R. Nugraha, “Analisis Sentimen Publik Terhadap Produk Rekomendasi Influencer Menggunakan Naive Bayes: Studi Kasus Tasya Farasya,” Data Sci. Indones., vol. 5, no. 1, pp. 84–95, 2025, doi: 10.47709/dsi.v5i1.6218.
S. Liem, T. Setiawan, and M. R. Pribadi, “Perbandingan Algoritma SVM dan Naïve Bayes Berbasis SMOTE dalam Analisis Sentimen Komentar Tiktok pada Produk Skincare,” Appl. Inf. Technol. Comput. Sci., vol. 3, no. 1, pp. 28–32, 2024, doi: 10.58466/aicoms.v3i1.1523.
J. Rama Dani, “Analisis Sentimen Komentar YouTube terhadap Kenaikan Tunjangan DPR RI menggunakan Naïve Bayes, SVM, dan Random Forest,” Technol. Sci., vol. 7, no. 3, pp. 1512–1524, 2025, doi: 10.47065/bits.v7i3.8513.
E. Permana, R. S. Eka Putri, P. D. Alfinda, and M. Mardhiyah, “Strategi Pemasaran Produk Skincare Somethinc Di Kalangan Generasi Z,” J. Pemasar. Kompetitif, vol. 7, no. 2, pp. 119–135, 2024, doi: 10.32493/jpkpk.v7i2.29289.
D. Ruswanti, D. Susilo, and R. Riani, “Implementasi CRISP-DM pada Data Mining untuk Melakukan Prediksi Pendapatan dengan Algoritma C.45,” Go Infotech J. Ilm. STMIK AUB, vol. 30, no. 1, pp. 111–121, 2024, doi: 10.36309/goi.v30i1.266.
Ridwan, E. H. Hermaliani, and M. Ernawati, “Penerapan Metode SMOTE Untuk Mengatasi Imbalanced Data Pada,” Comput. Sci., vol. 4, no. 1, pp. 80–88, 2024, [Online]. Available: http://jurnal.bsi.ac.id/index.php/co-science
C. Montag, H. Yang, and J. D. Elhai, “On the Psychology of TikTok Use: A First Glimpse From Empirical Findings,” Front. Public Heal., vol. 9, no. March, pp. 1–6, 2021, doi: 10.3389/fpubh.2021.641673.
D. Rifaldi and A. Fadlil, “DECODE: Jurnal Pendidikan Teknologi Informasi TEKNIK PREPROCESSING PADA TEXT MINING MENGGUNAKAN DATA TWEET ‘MENTAL HEALTH,’” Decod. J. Pendidik. Teknol. Inf., vol. 3, no. 2, pp. 161–171, 2023.
S. J. Angelina, A. Bijaksana, P. Negara, and H. Muhardi, “Analisis Pengaruh Penerapan Stopword Removal Pada Performa Klasifikasi Sentimen Tweet Bahasa Indonesia,” JUARA (Jurnal Apl. dan Ris. Inform., vol. 02, no. 1, pp. 165–173, 2023, doi: 10.26418/juara.v2i1.69680.
P. Keterampilan and M. Berbasis, “RISET : Jurnal Ilmiah Multidisiplin Ilmu,” vol. 1, no. 1, 2025.
A. Hermawan, I. Jowensen, J. Junaedi, and Edy, “Implementasi Text-Mining untuk Analisis Sentimen pada Twitter dengan Algoritma Support Vector Machine,” JST (Jurnal Sains dan Teknol., vol. 12, no. 1, pp. 129–137, 2023, doi: 10.23887/jstundiksha.v12i1.52358.
A. T. J. H, “Preprocessing Text untuk Meminimalisir Kata yang Tidak Berarti dalam Proses Text Mining,” pp. 1–9.
L. Mu et al., “DenseLoRA: Dense Low-Rank Adaptation of Large Language Models,” Proc. Annu. Meet. Assoc. Comput. Linguist., vol. 1, pp. 10198–10211, 2025, doi: 10.18653/v1/2025.acl-long.503.
M. U. Albab, Y. K. P., and M. N. Fawaiq, “Optimization of the Stemming Technique on Text Preprocessing President 3 Periods Topic,” J. Transform., vol. 20, no. 2, pp. 1–12, 2023, doi: 10.26623/transformatika.v20i2.5374.
S. Ghan, E. Diesen, C. Kunkel, K. Reuter, and H. Oberhofer, “Interpreting ultrafast electron transfer on surfaces with a converged first-principles Newns-Anderson chemisorption function,” J. Chem. Phys., vol. 158, no. 23, pp. 1–13, 2023, doi: 10.1063/5.0151009.
Y. Elor and H. Averbuch-Elor, To SMOTE, or not to SMOTE?, vol. 1, no. 1. Association for Computing Machinery. [Online]. Available: http://arxiv.org/abs/2201.08528
Y. Lin and Y. Xie, “Semilinear heat equations and parabolic variational inequalities on graphs,” pp. 1–25, 2021, [Online]. Available: http://arxiv.org/abs/2108.13007
H. Bichri, A. Chergui, and M. Hain, “Investigating the Impact of Train / Test Split Ratio on the Performance of Pre-Trained Models with Custom Datasets,” Int. J. Adv. Comput. Sci. Appl., vol. 15, no. 2, pp. 331–339, 2024, doi: 10.14569/IJACSA.2024.0150235.
R. Oktafiani, A. Hermawan, and D. Avianto, “Pengaruh Komposisi Split data Terhadap Performa Klasifikasi Penyakit Kanker Payudara Menggunakan Algoritma Machine Learning,” J. Sains dan Inform., vol. 9, no. April, pp. 19–28, 2023, doi: 10.34128/jsi.v9i1.622.
H. Oktavianto, H. W. Sulistyo, G. Wijaya, and D. Irawan, “Analisis Komparasi Kinerja Metode Decision Tree dan Random Forest dalam Klasifikasi Teks Data Kesehatan,” vol. 11, no. 1, pp. 56–65, 2024.
A. Muzaki, V. Febriana, and W. N. Cholifah, “Analisis Sentimen Pada Ulasan Produk di E-Commerce dengan Metode Naive Bayes,” J. Ris. dan Apl. Mhs. Inform., vol. 5, no. 4, pp. 758–765, 2024, doi: 10.30998/jrami.v5i4.9647.
S. A. Arnomo, A. A. Fajrin, Y. Siyamto, and S. F. N. Sadikin, “Evaluasi Model Decision Tree Pada Keputusan Kelayakan Kredit,” J. Desain Dan Anal. Teknol., vol. 2, no. 2, pp. 200–206, 2023, doi: 10.58520/jddat.v2i2.39.
N. Fathirachman Mahing, A. Lazuardi Gunawan, A. Foresta Azhar Zen, F. Abdurrachman Bachtiar, and S. Agung Wicaksono, “Klasifikasi Tingkat Stress dari Data Berbentuk Teks dengan Menggunakan Algoritma Support Vector Machine (SVM) dan Random Forest,” J. Teknol. Inf. dan Ilmu Komput., vol. 11, no. 5, pp. 1067–1076, 2024, doi: 10.25126/jtiik.2024118010.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Klasifikasi Opini Pengguna TikTok terhadap Keamanan dan Efektivitas Produk Skincare Lokal menggunakan Metode Naïve Bayes, Decision Tree, dan Random Fores
Pages: 2736-2751
Copyright (c) 2026 Sintia Ariyani, Styawati Styawati

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















