Sentiment Classification and Interpretation of Tokopedia Reviews: A Machine Learning, IndoBERT, and LIME Approach
Abstract
Sentiment classification of user reviews plays a vital role in business decision-making, especially on e-commerce platforms like Tokopedia. This study evaluates the performance of various sentiment classification models such as Logistic Regression LinearSVC, and BERT models, both baseline and fine-tuned. Evaluation metrics used include accuracy, precision, recall, and F1-score, applied to Tokopedia review data labelled based on user ratings. The result is fine-tuned BERT model has the best and consistent result, with 92% accuracy and 0.92 f1-score for each class. This shows that fine-tuned BERT can effectively capture the semantic context of user reviews. Its consistent performance across classes makes it suitable for reliable sentiment classification in real-world applications. Furthermore, fine-tune BERT model is visualized by Local Interpretable Model-agnostic Explanation to identify features – in this case is word – that indicates sentiment as positive or negative. It will show as color, orange for positive and blue as negative. This method will make the model more transparent and more reliable.
Downloads
References
M. Zainottah, R. Rengga, Y. Yustian, and I. Isa, “Critical Sentiment Analysis of Tokopedia Electronic Products Using SVM-Logistic & TF-IDF Ensemble Methods,” Journal of Artificial Intelligence and Engineering Applications (JAIEA), vol. 4, pp. 2476–2482, Jul. 2025, doi: 10.59934/jaiea.v4i3.1194.
BPS-Statistics Indonesia, Statistik E-Commerce 2023 / E-Commerce Statistics 2023, Jakarta, Indonesia, Publikasi No. 06300.25001, Jan. 30, 2025. [Online]. Available: https://www.bps.go.id/id/publication/2025/01/30/d52af11843aee401403ecfa6/statistik-e-commerce-2023.html
M. Birjali, M. Kasri, and A. Beni-Hssane, “A comprehensive survey on sentiment analysis: Approaches, challenges and trends,” Knowl Based Syst, vol. 226, Aug. 2021, doi: 10.1016/j.knosys.2021.107134.
H. Huang, A. Asemi, and M. Mustafa, “Sentiment Analysis in E-Commerce Platforms: A Review of Current Techniques and Future Directions,” IEEE Access, vol. 11, p. 1, Jul. 2023, doi: 10.1109/ACCESS.2023.3307308.
Y. Pratama, D. Murdiansyah, and K. Lhaksmana, “Analisis Sentimen Kendaraan Listrik Pada Media Sosial Twitter Menggunakan Algoritma Logistic Regression dan Principal Component Analysis,” JURNAL MEDIA INFORMATIKA BUDIDARMA, vol. 7, no. 1, p. 529, Jul. 2023, doi: 10.30865/mib.v7i1.5575.
M. Qorib, T. Oladunni, M. Denis, E. Ososanya, and P. Cotae, “Covid-19 Vaccine Hesitancy: Text Mining, Sentiment Analysis and Machine Learning on COVID-19 Vaccination Twitter Dataset,” Expert Syst Appl, vol. 212, p. 118715, Jul. 2022, doi: 10.1016/j.eswa.2022.118715.
N. Smairi, H. Abadlia, H. Brahim, and W. L. Chaari, “Fine-tune BERT based on Machine Learning Models For Sentiment Analysis,” Procedia Comput Sci, vol. 246, no. C, pp. 2390–2399, Jan. 2024, doi: 10.1016/J.PROCS.2024.09.531.
N. Nurdin and A. Dimas, “Explainable Artificial Intelligence (XAI) towards Model Personality in NLP task,” 2021. doi: 10.12962%2Fj23378557.v7i1.a8989.
M. A. Ibrahim et al., “An Explainable AI Model for Hate Speech Detection on Indonesian Twitter,” CommIT (Communication and Information Technology) Journal, vol. 16, no. 2, Jul. 2022, doi: 10.21512/commit.v16i2.8343.
T. Thogesan, A. Nugaliyadde, and K. W. Wong, “Integration of Explainable AI Techniques with Large Language Models for Enhanced Interpretability for Sentiment Analysis,” 2025. [Online]. Available: https://arxiv.org/abs/2503.11948
D. S. Parmar and H. K. Saran, “Empirical Study on The Role of Explainable AI (XAI) in Improving Customer Trust in AI-Powered Products,” International Journal of Computer Trends and Technology, vol. 73, no. 2, pp. 48–57, Feb. 2025, doi: 10.14445/22312803/ijctt-v73i2p106.
Rezky Yayang Yakhamid, “Reviews of Indonesian Startup Apps on Playstore.” Accessed: Jul. 07, 2025. [Online]. Available: https://www.kaggle.com/datasets/rezkyyayang/reviews-of-indonesian-app-startups-on-playstore/data?select=tokopedia.csv
L. Qadrini, “Undersampling dan K-Fold Random Forest Untuk Klasifikasi Kelas Tidak Seimbang,” Building of Informatics, Technology and Science (BITS), vol. 4, no. 4, Jul. 2023, doi: 10.47065/bits.v4i4.3141.
G. Popoola, K.-K. Abdullah, G. Shu Fuhnwi, and J. Agbaje, “Sentiment Analysis of Financial News Data using TF-IDF and Machine Learning Algorithms,” Jul. 2024, pp. 1–6. doi: 10.1109/ICAIC60265.2024.10433843.
M. Nasir and S. Hidayat, “Analisis Sentimen Ulasan Film Menggunakan Metode BiLSTM,” Jurnal Informatika dan Teknologi Komputer (J-ICOM), vol. 5, no. 2, pp. 126–132, Jul. 2024, doi: 10.55377/j-icom.v5i2.8871.
P. Schober and T. Vetter, “Logistic Regression in Medical Research,” Anesth Analg, vol. 132, pp. 365–366, Jul. 2021, doi: 10.1213/ANE.0000000000005247.
N. Ashraf, R. Iqbal, S. Bano, H. M. Azeem, and S. Naz, “Enhancing MBTI Personality Prediction from Text Data with Advance Word Embedding Technique.,” VFAST Transactions on Software Engineering, vol. 12, p. 35, Jul. 2024, doi: 10.21015/vtse.v12i3.1864.
V. Chakkarwar, S. Tamane, and A. Thombre, “A Review on BERT and Its Implementation in Various NLP Tasks,” pp. 112–121, 2023, doi: 10.2991/978-94-6463-136-4_12.
Y. Wu, Z. Jin, C. Shi, P. Liang, and T. Zhan, “Research on the application of deep learning-based BERT model in sentiment analysis,” Applied and Computational Engineering, vol. 71, pp. 14–20, Jul. 2024, doi: 10.54254/2755-2721/71/2024MA.
M. Tripathi, “Sentiment Analysis of Nepali COVID19 Tweets Using NB, SVM AND LSTM,” Journal of Artificial Intelligence and Capsule Networks, vol. 3, no. 3, pp. 151–168, Jul. 2021, doi: 10.36548/jaicn.2021.3.001.
S. Rao, S. Mehta, S. Kulkarni, H. Dalvi, N. Katre, and M. Narvekar, "A Study of LIME and SHAP Model Explainers for Autonomous Disease Predictions," in 2022 IEEE Bombay Section Signature Conference (IBSSC), Mumbai, India, Dec. 2022, pp. 1-6, doi: 10.1109/IBSSC56953.2022.10037324.
C. V Roberts, E. Elahi, and A. Chandrashekar, “On the Bias-Variance Characteristics of LIME and SHAP in High Sparsity Movie Recommendation Explanation Tasks,” 2022. [Online]. Available: https://arxiv.org/abs/2206.04784
A. Salih et al., “A Perspective on Explainable Artificial Intelligence Methods: SHAP and LIME,” Advanced Intelligent Systems, vol. 7, Jul. 2024, doi: 10.1002/aisy.202400304.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Sentiment Classification and Interpretation of Tokopedia Reviews: A Machine Learning, IndoBERT, and LIME Approach
Pages: 1164-1173
Copyright (c) 2025 Adrian Yoris Mbake Woka, Mahendra Dwifebri Purbolaksono, Dody Qori Utama

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















