Sentiment Classification and Interpretation of Tokopedia Reviews: A Machine Learning, IndoBERT, and LIME Approach


  • Adrian Yoris Mbake Woka Telkom University, Bandung, Indonesia
  • Mahendra Dwifebri Purbolaksono Telkom University, Bandung, Indonesia
  • Dody Qori Utama * Mail Telkom University, Bandung, Indonesia
  • (*) Corresponding Author
Keywords: Sentiment Classification; Tokopedia; Machine Learning; BERT; XAI; LIME

Abstract

Sentiment classification of user reviews plays a vital role in business decision-making, especially on e-commerce platforms like Tokopedia. This study evaluates the performance of various sentiment classification models such as Logistic Regression LinearSVC, and BERT models, both baseline and fine-tuned. Evaluation metrics used include accuracy, precision, recall, and F1-score, applied to Tokopedia review data labelled based on user ratings. The result is fine-tuned BERT model has the best and consistent result, with 92% accuracy and 0.92 f1-score for each class. This shows that fine-tuned BERT can effectively capture the semantic context of user reviews. Its consistent performance across classes makes it suitable for reliable sentiment classification in real-world applications. Furthermore, fine-tune BERT model is visualized by Local Interpretable Model-agnostic Explanation to identify features – in this case is word – that indicates sentiment as positive or negative. It will show as color, orange for positive and blue as negative. This method will make the model more transparent and more reliable.

Downloads

Download data is not yet available.

References

M. Zainottah, R. Rengga, Y. Yustian, and I. Isa, “Critical Sentiment Analysis of Tokopedia Electronic Products Using SVM-Logistic & TF-IDF Ensemble Methods,” Journal of Artificial Intelligence and Engineering Applications (JAIEA), vol. 4, pp. 2476–2482, Jul. 2025, doi: 10.59934/jaiea.v4i3.1194.

BPS-Statistics Indonesia, Statistik E-Commerce 2023 / E-Commerce Statistics 2023, Jakarta, Indonesia, Publikasi No. 06300.25001, Jan. 30, 2025. [Online]. Available: https://www.bps.go.id/id/publication/2025/01/30/d52af11843aee401403ecfa6/statistik-e-commerce-2023.html

M. Birjali, M. Kasri, and A. Beni-Hssane, “A comprehensive survey on sentiment analysis: Approaches, challenges and trends,” Knowl Based Syst, vol. 226, Aug. 2021, doi: 10.1016/j.knosys.2021.107134.

H. Huang, A. Asemi, and M. Mustafa, “Sentiment Analysis in E-Commerce Platforms: A Review of Current Techniques and Future Directions,” IEEE Access, vol. 11, p. 1, Jul. 2023, doi: 10.1109/ACCESS.2023.3307308.

Y. Pratama, D. Murdiansyah, and K. Lhaksmana, “Analisis Sentimen Kendaraan Listrik Pada Media Sosial Twitter Menggunakan Algoritma Logistic Regression dan Principal Component Analysis,” JURNAL MEDIA INFORMATIKA BUDIDARMA, vol. 7, no. 1, p. 529, Jul. 2023, doi: 10.30865/mib.v7i1.5575.

M. Qorib, T. Oladunni, M. Denis, E. Ososanya, and P. Cotae, “Covid-19 Vaccine Hesitancy: Text Mining, Sentiment Analysis and Machine Learning on COVID-19 Vaccination Twitter Dataset,” Expert Syst Appl, vol. 212, p. 118715, Jul. 2022, doi: 10.1016/j.eswa.2022.118715.

N. Smairi, H. Abadlia, H. Brahim, and W. L. Chaari, “Fine-tune BERT based on Machine Learning Models For Sentiment Analysis,” Procedia Comput Sci, vol. 246, no. C, pp. 2390–2399, Jan. 2024, doi: 10.1016/J.PROCS.2024.09.531.

N. Nurdin and A. Dimas, “Explainable Artificial Intelligence (XAI) towards Model Personality in NLP task,” 2021. doi: 10.12962%2Fj23378557.v7i1.a8989.

M. A. Ibrahim et al., “An Explainable AI Model for Hate Speech Detection on Indonesian Twitter,” CommIT (Communication and Information Technology) Journal, vol. 16, no. 2, Jul. 2022, doi: 10.21512/commit.v16i2.8343.

T. Thogesan, A. Nugaliyadde, and K. W. Wong, “Integration of Explainable AI Techniques with Large Language Models for Enhanced Interpretability for Sentiment Analysis,” 2025. [Online]. Available: https://arxiv.org/abs/2503.11948

D. S. Parmar and H. K. Saran, “Empirical Study on The Role of Explainable AI (XAI) in Improving Customer Trust in AI-Powered Products,” International Journal of Computer Trends and Technology, vol. 73, no. 2, pp. 48–57, Feb. 2025, doi: 10.14445/22312803/ijctt-v73i2p106.

Rezky Yayang Yakhamid, “Reviews of Indonesian Startup Apps on Playstore.” Accessed: Jul. 07, 2025. [Online]. Available: https://www.kaggle.com/datasets/rezkyyayang/reviews-of-indonesian-app-startups-on-playstore/data?select=tokopedia.csv

L. Qadrini, “Undersampling dan K-Fold Random Forest Untuk Klasifikasi Kelas Tidak Seimbang,” Building of Informatics, Technology and Science (BITS), vol. 4, no. 4, Jul. 2023, doi: 10.47065/bits.v4i4.3141.

G. Popoola, K.-K. Abdullah, G. Shu Fuhnwi, and J. Agbaje, “Sentiment Analysis of Financial News Data using TF-IDF and Machine Learning Algorithms,” Jul. 2024, pp. 1–6. doi: 10.1109/ICAIC60265.2024.10433843.

M. Nasir and S. Hidayat, “Analisis Sentimen Ulasan Film Menggunakan Metode BiLSTM,” Jurnal Informatika dan Teknologi Komputer (J-ICOM), vol. 5, no. 2, pp. 126–132, Jul. 2024, doi: 10.55377/j-icom.v5i2.8871.

P. Schober and T. Vetter, “Logistic Regression in Medical Research,” Anesth Analg, vol. 132, pp. 365–366, Jul. 2021, doi: 10.1213/ANE.0000000000005247.

N. Ashraf, R. Iqbal, S. Bano, H. M. Azeem, and S. Naz, “Enhancing MBTI Personality Prediction from Text Data with Advance Word Embedding Technique.,” VFAST Transactions on Software Engineering, vol. 12, p. 35, Jul. 2024, doi: 10.21015/vtse.v12i3.1864.

V. Chakkarwar, S. Tamane, and A. Thombre, “A Review on BERT and Its Implementation in Various NLP Tasks,” pp. 112–121, 2023, doi: 10.2991/978-94-6463-136-4_12.

Y. Wu, Z. Jin, C. Shi, P. Liang, and T. Zhan, “Research on the application of deep learning-based BERT model in sentiment analysis,” Applied and Computational Engineering, vol. 71, pp. 14–20, Jul. 2024, doi: 10.54254/2755-2721/71/2024MA.

M. Tripathi, “Sentiment Analysis of Nepali COVID19 Tweets Using NB, SVM AND LSTM,” Journal of Artificial Intelligence and Capsule Networks, vol. 3, no. 3, pp. 151–168, Jul. 2021, doi: 10.36548/jaicn.2021.3.001.

S. Rao, S. Mehta, S. Kulkarni, H. Dalvi, N. Katre, and M. Narvekar, "A Study of LIME and SHAP Model Explainers for Autonomous Disease Predictions," in 2022 IEEE Bombay Section Signature Conference (IBSSC), Mumbai, India, Dec. 2022, pp. 1-6, doi: 10.1109/IBSSC56953.2022.10037324.

C. V Roberts, E. Elahi, and A. Chandrashekar, “On the Bias-Variance Characteristics of LIME and SHAP in High Sparsity Movie Recommendation Explanation Tasks,” 2022. [Online]. Available: https://arxiv.org/abs/2206.04784

A. Salih et al., “A Perspective on Explainable Artificial Intelligence Methods: SHAP and LIME,” Advanced Intelligent Systems, vol. 7, Jul. 2024, doi: 10.1002/aisy.202400304.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Sentiment Classification and Interpretation of Tokopedia Reviews: A Machine Learning, IndoBERT, and LIME Approach

Dimensions Badge
Article History
Submitted: 2025-07-21
Published: 2025-09-02
Abstract View: 516 times
PDF Download: 408 times
How to Cite
Mbake Woka, A. Y., Purbolaksono, M., & Utama, D. (2025). Sentiment Classification and Interpretation of Tokopedia Reviews: A Machine Learning, IndoBERT, and LIME Approach. Building of Informatics, Technology and Science (BITS), 7(2), 1164-1173. https://doi.org/10.47065/bits.v7i2.8072
Section
Articles