A Hybrid CNN-LSTM Model with SMOTE for Enhanced Sentiment Analysis of Hotel Reviews
Abstract
The growing reliance on online reviews as a critical decision-making tool in the hospitality industry underscores the need for robust sentiment analysis methodologies. Understanding customer feedback is essential for hotels to enhance service quality and maintain a competitive edge in an increasingly digital marketplace. However, traditional sentiment analysis models often encounter difficulties processing unstructured textual data, particularly when faced with class imbalances where positive reviews dominate, overshadowing critical negative feedback. To address these challenges, this study investigates integrating a hybrid Convolutional Neural Network and Long Short-Term Memory (CNN-LSTM) model with the Synthetic Minority Over-sampling Technique (SMOTE) to improve sentiment classification accuracy. Utilizing a dataset of 665 reviews from THE 1O1 Bandung Dago Hotel, the model leverages CNN’s capability to capture local features and LSTM’s strength in handling sequential dependencies, resulting in a more nuanced analysis of customer sentiments. The application of SMOTE effectively balances the dataset, addressing the class imbalance issue, which often skews sentiment classification. This approach improves predictive accuracy and provides actionable insights to enhance customer satisfaction strategies. The study achieved an overall classification accuracy of 77%, with precision at 78%, recall at 77%, an F1 score of 77.5%, and an AUC score of 0.81, reflecting discriminatory solid capability. Future research could focus on model optimization, multilingual sentiment analysis, aspect-based sentiment insights, and real-time sentiment monitoring to further refine customer feedback analysis and support strategic decision-making in the hospitality sector.
Downloads
References
F. Khanam, A. Chakraborty, M. A. Habib, and M. S. Iqbal, “Bangla Sentiment Analysis On Highly Imbalanced Data Using Hybrid CNN-LSTM & Bangla BERT,” 2024 3rd International Conference on Advancement in Electrical and Electronic Engineering, ICAEEE 2024. 2024. doi: 10.1109/ICAEEE62219.2024.10561678.
K. Singh, A. Mahajan, and V. Mansotra, “Hybrid CNN-LSTM model combined with feature selection and SMOTE for detection of network attacks,” Int. J. Sens. Networks, vol. 43, no. 4, pp. 208–222, 2023, doi: 10.1504/IJSNET.2023.135851.
H. M. Rai and K. Chatterjee, “Hybrid CNN-LSTM deep learning model and ensemble technique for automatic detection of myocardial infarction using big ECG data,” Appl. Intell., vol. 52, no. 5, pp. 5366–5384, 2022, doi: 10.1007/s10489-021-02696-6.
S. A. Alex, N. Z. Jhanjhi, M. Humayun, A. O. Ibrahim, and A. W. Abulfaraj, “Deep LSTM Model for Diabetes Prediction with Class Balancing by SMOTE,” Electron., vol. 11, no. 17, 2022, doi: 10.3390/electronics11172737.
X. Wu, H. Xiang, Y. Wang, and Y. Huo, “How does customer satisfaction change after hotels start using self-service kiosks?,” Int. J. Hosp. Manag., vol. 122, 2024, doi: 10.1016/j.ijhm.2024.103872.
H. T. T. Nguyen, T. P. Huong, A. L. T. Tram, and T. V. Tran, “Exploring Customer Feedback on Their Hotel Experiences in Vietnam,” Int. J. E-entrepreneursh. Innov., vol. 13, no. 1, 2023, doi: 10.4018/IJEEI.330023.
C. YU, L. J. LIANG, and H. C. CHOI, “Examining Customer Value Cocreation Behavior in Boutique Hotels: Hospitableness, Perceived Value, Satisfaction, and Citizenship Behavior,” Tour. Anal., vol. 29, no. 2, pp. 221–237, 2024, doi: 10.3727/108354224X17091476372167.
Z. Shu, M. H. Torralba, R. A. Carrasco, and M. F. B. López, “Assessing customer satisfaction of London luxury hotels with the AHP method and the SERVPERF scale: a case study of customer reviews on TripAdvisor,” Procedia Computer Science, vol. 221. pp. 73–80, 2023. doi: 10.1016/j.procs.2023.07.011.
E. Park, J. Kang, D. Choi, and J. Han, “Understanding customers’ hotel revisiting behaviour: a sentiment analysis of online feedback reviews,” Curr. Issues Tour., vol. 23, no. 5, pp. 605–611, 2020, doi: 10.1080/13683500.2018.1549025.
M. A. Elberri, Ü. Tokeşer, J. Rahebi, and J. M. Lopez-Guede, “A cyber defense system against phishing attacks with deep learning game theory and LSTM-CNN with African vulture optimization algorithm (AVOA),” Int. J. Inf. Secur., vol. 23, no. 4, pp. 2583–2606, 2024, doi: 10.1007/s10207-024-00851-x.
Y. Han et al., “Production prediction modeling of food waste anaerobic digestion for resources saving based on SMOTE-LSTM,” Appl. Energy, vol. 352, 2023, doi: 10.1016/j.apenergy.2023.122024.
R. Zaimi, M. Hafidi, and M. Lamia, “A deep learning mechanism to detect phishing URLs using the permutation importance method and SMOTE-Tomek link,” J. Supercomput., vol. 80, no. 12, pp. 17159–17191, 2024, doi: 10.1007/s11227-024-06124-7.
A. A. S. Shaikh, M. S. Bhargavi, and C. P. Kumar, “An optimised Darknet traffic detection system using modified locally connected CNN - BiLSTM network,” Int. J. Ad Hoc Ubiquitous Comput., vol. 43, no. 2, pp. 87–96, 2023, doi: 10.1504/ijahuc.2023.131361.
A. A. Alani, G. Cosma, and A. Taherkhani, “Classifying Imbalanced Multi-modal Sensor Data for Human Activity Recognition in a Smart Home using Deep Learning,” Proceedings of the International Joint Conference on Neural Networks. 2020. doi: 10.1109/IJCNN48605.2020.9207697.
M. Mujahid et al., “Sentiment analysis and topic modeling on tweets about online education during covid-19,” Appl. Sci., vol. 11, no. 18, 2021, doi: 10.3390/app11188438.
S. Solayman, S. A. Aumi, C. S. Mery, M. Mubassir, and R. Khan, “Automatic COVID-19 prediction using explainable machine learning techniques,” Int. J. Cogn. Comput. Eng., vol. 4, pp. 36–46, 2023, doi: 10.1016/j.ijcce.2023.01.003.
R. Ahmad, L. A. Maghrabi, I. A. Khaja, L. A. Maghrabi, and M. Ahmad, “SMOTE-Based Automated PCOS Prediction Using Lightweight Deep Learning Models,” Diagnostics, vol. 14, no. 19, 2024, doi: 10.3390/diagnostics14192225.
M. N. Yousuf Ali, T. Kabir, N. L. Raka, S. Siddikha Toma, M. L. Rahman, and J. Ferdaus, “SMOTE Based Credit Card Fraud Detection Using Convolutional Neural Network,” Proceedings of 2022 25th International Conference on Computer and Information Technology, ICCIT 2022. pp. 55–60, 2022. doi: 10.1109/ICCIT57492.2022.10054727.
E. R. Subhiyakto et al., “Evaluation of Resampling Techniques in CNN-Based Heartbeat Classification,” Ing. des Syst. d’Information, vol. 29, no. 4, pp. 1323–1332, 2024, doi: 10.18280/isi.290408.
S. Mirlekar, K. P. Kanojia, and B. Chourasia, “A Stacked CNN-BiLSTM Model with Majority Technique for Detecting the Intrusions in Network,” Int. J. Intell. Syst. Appl. Eng., vol. 12, no. 5s, pp. 152–162, 2024, [Online]. Available: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85179998659&origin=inward
H. M. Rai, K. Chatterjee, and S. Dashkevych, “The prediction of cardiac abnormality and enhancement in minority class accuracy from imbalanced ECG signals using modified deep neural network models,” Comput. Biol. Med., vol. 150, 2022, doi: 10.1016/j.compbiomed.2022.106142.
V. Choudhary, S. Tanwar, and T. Choudhury, “A Hybrid Deep Learning Model for Intrusion Detection System in the Internet of Things Environment,” 2023 4th International Conference on Data Analytics for Business and Industry, ICDABI 2023. pp. 682–689, 2023. doi: 10.1109/ICDABI60145.2023.10629562.
L. Popp et al., “Evaluation of customer experience and satisfaction in luxury resort hotels of the Maldives,” J. Environ. Manag. Tour., vol. 12, no. 8, pp. 2099–2108, 2021, doi: 10.14505/jemt.v12.8(56).09.
V. H. Nguyen and T. Ho, “Analyzing Customer Experience in Hotel Services Using Topic Modeling,” J. Inf. Process. Syst., vol. 17, no. 3, pp. 586–598, 2021, doi: 10.3745/JIPS.04.0217.
C. Ding, Q. Guo, A. Rehman, and M. Zeeshan, “Impact of environment on hotel customer satisfaction in Southeast Asia: A study of online booking site reviews,” Front. Environ. Sci., vol. 10, 2022, doi: 10.3389/fenvs.2022.978070.
S. Han and C. K. Anderson, “The Effect of Private Customer-Manager Social Engagement Upon Online Booking Behavior,” Cornell Hosp. Q., vol. 63, no. 2, pp. 141–151, 2022, doi: 10.1177/1938965520975330.
R. Narayan, A. Gehlot, R. Singh, S. V. Akram, N. Priyadarshi, and B. Twala, “Hospitality Feedback System 4.0: Digitalization of Feedback System with Integration of Industry 4.0 Enabling Technologies,” Sustain., vol. 14, no. 19, 2022, doi: 10.3390/su141912158.
F. Amali, H. Yigit, and Z. H. Kilimci, “Sentiment Analysis of Hotel Reviews using Deep Learning Approaches,” 2024 IEEE Open Conference of Electrical, Electronic and Information Sciences, eStream 2024 - Proceedings. 2024. doi: 10.1109/eStream61684.2024.10542593.
A. Liyih, S. Anagaw, M. Yibeyin, and Y. Tehone, “Sentiment analysis of the Hamas-Israel war on YouTube comments using deep learning,” Sci. Rep., vol. 14, no. 1, 2024, doi: 10.1038/s41598-024-63367-3.
W. Jiang, K. Zhou, C. Xiong, G. Du, C. Ou, and J. Zhang, “KSCB: a novel unsupervised method for text sentiment analysis,” Appl. Intell., vol. 53, no. 1, pp. 301–311, 2023, doi: 10.1007/s10489-022-03389-4.
N. A. Semary, W. Ahmed, K. Amin, P. Pławiak, and M. Hammad, “Improving sentiment classification using a RoBERTa-based hybrid model,” Front. Hum. Neurosci., vol. 17, 2023, doi: 10.3389/fnhum.2023.1292010.
R. Olusegun, T. Oladunni, H. Audu, Y. A. O. Houkpati, and S. Bengesi, “Text Mining and Emotion Classification on Monkeypox Twitter Dataset: A Deep Learning-Natural Language Processing (NLP) Approach,” IEEE Access, vol. 11, pp. 49882–49894, 2023, doi: 10.1109/ACCESS.2023.3277868.
R. Ramadhan, P. H. Gunawan, and N. Aquarini, “Web-Based Sentiment Analysis Application of Hotel Reviews in Indonesia,” 2022 2nd International Conference on Intelligent Cybernetics Technology and Applications, ICICyTA 2022. pp. 239–244, 2022. doi: 10.1109/ICICyTA57421.2022.10037946.
S. Mehta and R. Kumar, “Application of a Hybrid CNN-Random Forest Model in the Early Detection and Classification of Parkinson’s Disease,” 2nd International Conference on Sustainable Computing and Smart Systems, ICSCSS 2024 - Proceedings. pp. 1203–1208, 2024. doi: 10.1109/ICSCSS60660.2024.10625419.
A. A. Pambudi and K. M. Lhaksmana, “Identifying Difficult Quran Verses Using SVM, LSTM, and CNN,” 2024 International Conference on Artificial Intelligence, Blockchain, Cloud Computing, and Data Analytics, ICoABCD 2024. pp. 43–48, 2024. doi: 10.1109/ICoABCD63526.2024.10704332.
D. K. Kim and Y. K. Chung, “Addressing Class Imbalances in Software Defect Detection,” J. Comput. Inf. Syst., vol. 64, no. 2, pp. 219–231, 2024, doi: 10.1080/08874417.2023.2187483.
X. Chen, L. Gupta, and S. Tragoudas, “Improving the Forecasting and Classification of Extreme Events in Imbalanced Time Series Through Block Resampling in the Joint Predictor-Forecast Space,” IEEE Access, vol. 10, pp. 121048–121079, 2022, doi: 10.1109/ACCESS.2022.3219832.
A. Ajeesh and T. Mathew, “Enhancing Network Security: A Comparative Analysis of Deep Learning and Machine Learning Models for Intrusion Detection,” International Conference on E-Mobility, Power Control and Smart Systems: Futuristic Technologies for Sustainable Solutions, ICEMPS 2024. 2024. doi: 10.1109/ICEMPS60684.2024.10559350.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel A Hybrid CNN-LSTM Model with SMOTE for Enhanced Sentiment Analysis of Hotel Reviews
Pages: 1363-1373
Copyright (c) 2024 Yerik Afrianto Singgalen

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).