Comparison of RoBERTa and IndoBERT on Multi-Aspect Sentiment Analysis of Indonesian Hotel Reviews with Tuning Optimization


  • Rizky Ahsan Syarif * Mail Telkom University, Bandung, Indonesia
  • Yuliant Sibaroni Telkom University, Bandung, Indonesia
  • (*) Corresponding Author
Keywords: Aspect-Based; Sentiment Analysis; RoBERTa; IndoBERT; Fine-Tuning

Abstract

The hospitality industry heavily relies on online reviews as a crucial source of information that influences potential guests' decisions. However, conducting sentiment analysis on hotel reviews can be challenging due to the complexity of language and contextual diversity, especially in Indonesian. This study aims to develop and optimize a RoBERTa-based sentiment analysis model to improve the accuracy of sentiment classification in Indonesian hotel reviews, focusing on the aspects of facilities, cleanliness, location, price, and service. The methodology includes data collection through web scraping from the Traveloka platform, manual labeling, and text pre-processing. The RoBERTa model was trained and optimized using fine-tuning techniques and evaluated using metrics such as accuracy, precision, recall, F1-score, and AUC. The results show that the optimized RoBERTa model achieves competitive performance, although the IndoBERT model with Bayesian Optimization demonstrates superior performance, particularly in terms of accuracy and efficiency in identifying positive and negative sentiments. This study is expected to contribute to the development of more effective and accurate aspect-based sentiment analysis (ABSA) for Indonesian-language hotel reviews. It also opens opportunities for applying NLP technology in the hospitality industry and across other review platforms, thereby improving sentiment analysis quality and assisting hotel managers in enhancing service and customer experience.

Downloads

Download data is not yet available.

References

N. K. Nissa and E. Yulianti, “Multi-label text classification of Indonesian customer reviews using bidirectional encoder representations from transformers language model,” International Journal of Electrical and Computer Engineering, vol. 13, no. 5, pp. 5641–5652, Oct. 2023, doi: 10.11591/ijece.v13i5.pp5641-5652.

E. M. Pusung and I. N. Dewi, “Optimasi RoBERTa dengan Hyperparameter Tuning untuk Deteksi Emosi berbasis Teks,” Jurnal Nasional Teknologi dan Sistem Informasi, vol. 10, no. 3, pp. 240–248, Feb. 2025, doi: 10.25077/TEKNOSI.v10i3.2024.240-248.

Y. A. Singgalen, “IndoBERT-Based Sentiment Analysis for Understanding Hotel Guests’ Preferences,” Article in Journal of Computer System and Informatics, vol. 6, no. 2, pp. 508–520, 2025, doi: 10.47065/josyc.v6i2.6864.

I. Nawawi, K. F. Ilmawan, M. R. Maarif, and M. Syafrudin, “Exploring Tourist Experience through Online Reviews Using Aspect-Based Sentiment Analysis with Zero-Shot Learning for Hospitality Service Enhancement,” Information (Switzerland), vol. 15, no. 8, Aug. 2024, doi: 10.3390/info15080499.

F. Baharuddin and M. F. Naufal, “Fine-Tuning IndoBERT for Indonesian Exam Question Classification Based on Bloom’s Taxonomy,” Journal of Information Systems Engineering and Business Intelligence, vol. 9, no. 2, pp. 253–263, Oct. 2023, doi: 10.20473/jisebi.9.2.253-263.

Z. A. Diekson, M. R. B. Prakoso, M. S. Q. Putra, M. S. A. F. Syaputra, S. Achmad, and R. Sutoyo, “Sentiment analysis for customer review: Case study of Traveloka,” in Procedia Computer Science, Elsevier B.V., 2022, pp. 682–690. doi: 10.1016/j.procs.2022.12.184.

R. I. Perwira, V. A. Permadi, D. I. Purnamasari, and R. P. Agusdin, “Domain-Specific Fine-Tuning of IndoBERT for Aspect-Based Sentiment Analysis in Indonesian Travel User-Generated Content,” Journal of Information Systems Engineering and Business Intelligence, vol. 11, no. 1, pp. 30–40, Feb. 2025, doi: 10.20473/jisebi.11.1.30-40.

A. Lin, N. Livando, W. Chandra, G. Phan, and A. M. Husein, “Sentiment Analysis Of Hotel Reviews On Tripadvisor With LSTM And ELECTRA,” SinkrOn, vol. 8, no. 2, pp. 733–740, Apr. 2023, doi: 10.33395/sinkron.v8i2.12234.

N. A. Semary, W. Ahmed, K. Amin, P. Pławiak, and M. Hammad, “Enhancing machine learning-based sentiment analysis through feature extraction techniques,” PLoS One, vol. 19, no. 2, Feb. 2024, doi: 10.1371/journal.pone.0294968.

T. Gori, A. Sunyoto, and H. Al Fatta, “Preprocessing Data dan Klasifikasi untuk Prediksi Kinerja Akademik Siswa,” Jurnal Teknologi Informasi dan Ilmu Komputer, vol. 11, no. 1, pp. 215–224, Feb. 2024, doi: 10.25126/jtiik.20241118074.

R. Merdiansah and A. Ali Ridha, “Analisis Sentimen Pengguna X Indonesia Terkait Kendaraan Listrik Menggunakan IndoBERT,” Jurnal Ilmu Komputer dan Sistem Informasi (JIKOMSI, vol. 7, no. 1, pp. 221–228, Mar. 2024, doi: https://doi.org/10.55338/jikomsi.v7i1.2895.

Hizbul Izzi, Arief Setyanto, and Anggit Dwi Hartanto, “Optimalisasi Akurasi Algoritma Naïve Bayes Dengan Metode Syntetic Minority Oversampling Technique (Smote) Pada Data Numerik,” Infotek: Jurnal Informatika dan Teknologi, vol. 8, no. 1, pp. 217–227, Jan. 2025, doi: 10.29408/jit.v8i1.28340.

F. A. Hizham and C. K. Murni, “Sentiment Analysis Based on Review of Puncak B29 Lumajang using Backpropagation Neural Network,” Atlantis Press, 2024, pp. 210–214. doi: 10.2991/978-94-6463-346-7_39.

Y. O. Sihombing, N. V. Situmorang, B. K. Negara, and J. M. Sutoyo, “Prediksi Sentimen Pada Teks Media Sosial Corporate University Menggunakan RoBERTa,” Prosiding PITNAS Widyaiswara, vol. 1, no. 1, pp. 302–316, Sep. 2024, [Online]. Available: https://ejournal.iwi.or.id/ojs/index.php/pitnas2024/article/view/316/187

Dian Agus Prawinata, Ani Dijah Rahajoe, and I Gede Susrama Mas Diyasa, “Analisis Sentimen Kendaraan Listrik Pada Twitter Menggunakan Metode Long Short Term Memory,” SABER : Jurnal Teknik Informatika, Sains dan Ilmu Komunikasi, vol. 2, no. 1, pp. 300–313, Jan. 2024, doi: 10.59841/saber.v2i1.857.

S. Chibuzor, “Smart City PM2.5 Air Pollution Modeling Techniques: Train-Test Data Split versus K-Fold Cross Validation Techniques,” Journal of Inventive Engineering and Technology (JIET, vol. 2, no. 3, pp. 30–44, Mar. 2023, [Online]. Available: https://www.jiengtech.com/index.php/INDEX/article/viewFile/50/49

J. A Ilemobayo et al., “Hyperparameter Tuning in Machine Learning: A Comprehensive Review,” Journal of Engineering Research and Reports, vol. 26, no. 6, pp. 388–395, Jun. 2024, doi: 10.9734/jerr/2024/v26i61188.

A. Alamsyah and N. D. Girawan, “Improving Clothing Product Quality and Reducing Waste Based on Consumer Review Using RoBERTa and BERTopic Language Model,” Big Data and Cognitive Computing, vol. 7, no. 4, Dec. 2023, doi: 10.3390/bdcc7040168.

A. Farhan AlShammari, “Implementation of Model Evaluation using Confusion Matrix in Python,” Int J Comput Appl, vol. 186, no. 50, pp. 0975–8887, Nov. 2024, doi: 10.5120/ijca2024924236.

A. R. Adinata, T. Rohana, K. A. Baihaqi, and S. Faisal, “Implementasi Algoritma Convolutional Neural Network dan YOLOV8 Untuk Klasifikasi Ras Kucing,” Building of Informatics, Technology and Science (BITS), vol. 6, no. 3, pp. 1658–1667, Dec. 2024, doi: 10.47065/bits.v6i3.5913.

H. Kaur and D. Kaur Sandhu, “Evaluating the Effectiveness of the Proposed System Using F1 Score, Recall, Accuracy, Precision and Loss Metrics Compared to Prior Techniques,” International Journal of Communication Networks and Information Security, vol. 15, pp. 368–383, Sep. 2024, [Online]. Available: https://ijcnis.org/index.php/ijcnis/article/view/7168


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Comparison of RoBERTa and IndoBERT on Multi-Aspect Sentiment Analysis of Indonesian Hotel Reviews with Tuning Optimization

Dimensions Badge
Article History
Submitted: 2025-06-05
Published: 2025-09-02
Abstract View: 1058 times
PDF Download: 268 times
How to Cite
Syarif, R., & Sibaroni, Y. (2025). Comparison of RoBERTa and IndoBERT on Multi-Aspect Sentiment Analysis of Indonesian Hotel Reviews with Tuning Optimization. Building of Informatics, Technology and Science (BITS), 7(2), 949-959. https://doi.org/10.47065/bits.v7i2.7523
Section
Articles

Most read articles by the same author(s)

1 2 3 > >>