Sentiment Classification of S.E.A Aquarium Singapore Reviews through CRISP-DM using DT and SVM with SMOTE
Abstract
In recent years, sentiment analysis has emerged as a critical area of research due to its wide-ranging applications in understanding public opinion, customer feedback, and social media sentiment. However, one of the significant challenges faced in sentiment analysis is the handling of imbalanced datasets, where the distribution of sentiment classes is uneven, leading to biased model performance. This study employs the Cross-Industry Standard Process for Data Mining (CRISP-DM) methodology to investigate sentiment analysis algorithms, mainly focusing on the Support Vector Machine (SVM) algorithm and the integration of the Synthetic Minority Over-sampling Technique (SMOTE). Through systematic experimentation and evaluation, the research demonstrates the superior performance of the SVM-SMOTE model in handling imbalanced datasets, achieving an accuracy of 98.46%, an AUC of 1.000, precision of 100.00%, recall of 96.91%, and an impressive F-measure of 98.42%. Additionally, the evaluation unveils specific toxicity scores across various categories, with Toxicity scoring at 0.11036 and 0.93915, Severe Toxicity at 0.00905 and 0.45895, Identity Attack at 0.02415 and 0.66373, Insult at 0.05149 and 0.85793, Profanity at 0.06392 and 0.93426, and Threat at 0.01562 and 0.51957. These numerical indicators provide quantitative insights into potential harm within analyzed content, emphasizing the efficacy of the SVM-SMOTE model in real-world applications and contributing to the advancement of sentiment analysis within the CRISP-DM framework.
Downloads
References
L. Chang, X. Huang, and M. Meng, “Study on tourist’s loyalty of Zhinan Village in the view of tourism landscape,” Cogent Soc Sci, vol. 7, no. 1, 2021, doi: 10.1080/23311886.2021.1997403.
B. Hatipoglu, B. Ertuna, and D. Salman, “Small-sized tourism projects in rural areas: the compounding effects on societal wellbeing,” Journal of Sustainable Tourism, vol. 30, no. 9, pp. 2121–2143, 2022, doi: 10.1080/09669582.2020.1784909.
J. Frost and W. Frost, “Exploring prosocial and environmental motivations of frontier tourists: implications for sustainable space tourism,” Journal of Sustainable Tourism, vol. 30, no. 9, pp. 2254–2270, 2022, doi: 10.1080/09669582.2021.1897131.
J. H. Wang, H. Feng, and Y. Wu, “Exploring key factors of medical tourism and its relation with tourism attraction and re-visit intention,” Cogent Soc Sci, vol. 6, no. 1, 2020, doi: 10.1080/23311886.2020.1746108.
R. Zengeya, P. W. Mamimine, and M. C. Mwando, “Diaspora based tourism marketing conceptual paper: A conceptual analysis of the potential of harnessing the diaspora to improve tourism traffic in Zimbabwe,” Cogent Soc Sci, vol. 9, no. 1, 2023, doi: 10.1080/23311886.2023.2164994.
M. H. Dewantara, S. Gardiner, and X. Jin, “Travel vlog ecosystem in tourism digital marketing evolution: a narrative literature review,” Current Issues in Tourism, vol. 26, no. 19, pp. 3125–3139, 2023, doi: 10.1080/13683500.2022.2136568.
X. Chi and H. Han, “Emerging rural tourism in China’s current tourism industry and tourist behaviors: the case of Anji County,” Journal of Travel and Tourism Marketing, vol. 38, no. 1, pp. 58–74, 2021, doi: 10.1080/10548408.2020.1862026.
H. Liu, X. Chen, and X. Liu, “A Study of the Application of Weight Distributing Method Combining Sentiment Dictionary and TF-IDF for Text Sentiment Analysis,” IEEE Access, vol. 10, pp. 32280–32289, 2022, doi: 10.1109/ACCESS.2022.3160172.
S. Gulati, “Tapping public sentiments on Twitter for tourism insights: a study of famous Indian heritage sites,” International Hospitality Review, vol. 36, no. 2, pp. 244–257, Jan. 2022, doi: 10.1108/ihr-03-2021-0021.
U. Kattiyapornpong, M. Ditta-Apichai, and C. Chuntamara, “Exploring gastronomic tourism experiences through online platforms: evidence from Thai local communities,” Tourism Recreation Research, vol. 47, no. 3, pp. 241–257, 2022, doi: 10.1080/02508281.2021.1963920.
T. B. Walker, T. J. Lee, and X. Li, “Sustainable development for small island tourism: developing slow tourism in the Caribbean,” Journal of Travel and Tourism Marketing, vol. 38, no. 1, pp. 1–15, 2021, doi: 10.1080/10548408.2020.1842289.
A. Boumhidi, A. Benlahbib, and E. H. Nfaoui, “Cross-Platform Reputation Generation System Based on Aspect-Based Sentiment Analysis,” IEEE Access, vol. 10, pp. 2515–2531, 2022, doi: 10.1109/ACCESS.2021.3139956.
R. Obiedat, D. Al-Darras, E. Alzaghoul, and O. Harfoushi, “Arabic Aspect-Based Sentiment Analysis: A Systematic Literature Review,” IEEE Access, vol. 9, pp. 152628–152645, 2021, doi: 10.1109/ACCESS.2021.3127140.
Y. Yu, D. T. Dinh, B. H. Nguyen, F. Yu, and V. N. Huynh, “Mining Insights From Esports Game Reviews With an Aspect-Based Sentiment Analysis Framework,” IEEE Access, vol. 11, no. June, pp. 61161–61172, 2023, doi: 10.1109/ACCESS.2023.3285864.
C. Hehir, C. Scarles, K. J. Wyles, and J. Kantenbacher, “Last chance for wildlife: making tourism count for conservation,” Journal of Sustainable Tourism, vol. 31, no. 5, pp. 1271–1291, 2023, doi: 10.1080/09669582.2022.2049804.
K. Çakar and F. Seyitoğlu, “Motivations and experiences of tourists visiting Hasankeyf as a last chance tourism destination,” Journal of Ecotourism, vol. 22, no. 2, pp. 237–259, 2023, doi: 10.1080/14724049.2021.1965151.
J. P. Valencia, C. T. Cerio, and R. R. Biares, “Tourists’ motives and activity preferences to farm tourism sites in the Philippines: application of push and pull theory,” Cogent Soc Sci, vol. 8, no. 1, 2022, doi: 10.1080/23311886.2022.2104706.
C. Wang, J. Liu, L. Wei, and T. Zhang, “Impact of tourist experience on memorability and authenticity: a study of creative tourism,” Journal of Travel and Tourism Marketing, vol. 37, no. 1, pp. 48–63, 2020, doi: 10.1080/10548408.2020.1711846.
N. S. Subawa, E. A. Mimaki, C. A. Mimaki, E. Baykal, and M. S. M. Utami, “Exploring the hidden potential of Bali’s wellness tourism: Which factors encourage tourists to visit?,” Cogent Soc Sci, vol. 9, no. 2, 2023, doi: 10.1080/23311886.2023.2269722.
Y. Gao, W. Su, and L. Zang, “Does Regional Tourism Benefit from the Official Quality Rating of Tourist Attractions? Evidence from China’s Top-grade Tourist Attraction Accreditations,” Journal of China Tourism Research, vol. 18, no. 2, pp. 268–293, 2022, doi: 10.1080/19388160.2020.1822975.
M. Xiaolong, Z. Litian, Y. Lu, and W. Rong, “Tourist ethnocentrism and tourism intentions during a political crisis,” Journal of Tourism and Cultural Change, vol. 21, no. 1, pp. 71–93, 2023, doi: 10.1080/14766825.2022.2064224.
J. Park, “Framework for sentiment-driven evaluation of customer satisfaction with cosmetics brands,” IEEE Access, vol. 8, pp. 98526–98538, 2020, doi: 10.1109/ACCESS.2020.2997522.
R. Perangin-Angin, R. Tavakoli, and C. Kusumo, “Inclusive tourism: the experiences and expectations of Indonesian wheelchair tourists in nature tourism,” Tourism Recreation Research, vol. 48, no. 6, pp. 955–968, 2023, doi: 10.1080/02508281.2023.2221092.
A. Toivonen, “Sustainability dimensions in space tourism: the case of Finland,” Journal of Sustainable Tourism, vol. 30, no. 9, pp. 2223–2239, 2022, doi: 10.1080/09669582.2020.1783276.
J. Kennell and R. Powell, “Dark tourism and World Heritage Sites: a Delphi study of stakeholder perceptions of the development of dark tourism products,” Journal of Heritage Tourism, vol. 16, no. 4, pp. 1–15, 2020, doi: 10.1080/1743873X.2020.1782924.
T. Ngo and T. Pham, “Indigenous residents, tourism knowledge exchange and situated perceptions of tourism,” Journal of Sustainable Tourism, vol. 31, no. 2, pp. 597–614, 2023, doi: 10.1080/09669582.2021.1920967.
K. Koens et al., “Serious gaming to stimulate participatory urban tourism planning,” Journal of Sustainable Tourism, vol. 30, no. 9, pp. 2167–2186, 2022, doi: 10.1080/09669582.2020.1819301.
M. R. A. Mollah, G. Cuskelly, and B. Hill, “Sport tourism collaboration: a systematic quantitative literature review,” Journal of Sport and Tourism, vol. 25, no. 1, pp. 3–25, 2021, doi: 10.1080/14775085.2021.1877563.
I. Volić, “Tourism Policy Values in Serbia—From Equity to Competition,” Tourism Planning and Development, vol. 20, no. 5, pp. 901–918, 2023, doi: 10.1080/21568316.2022.2045346.
B. Koerner, W. Sushartami, and D. M. Spencer, “An assessment of tourism policies and planning in Indonesia,” Tourism Recreation Research, vol. 0, no. 0, pp. 1–12, 2023, doi: 10.1080/02508281.2023.2214030.
T. Lin and I. Joe, “An Adaptive Masked Attention Mechanism to Act on the Local Text in a Global Context for Aspect-Based Sentiment Analysis,” IEEE Access, vol. 11, no. May, pp. 43055–43066, 2023, doi: 10.1109/ACCESS.2023.3270927.
E. Sthapit, P. Björk, and D. N. Coudounaris, “Memorable nature-based tourism experience, place attachment and tourists’ environmentally responsible behaviour,” Journal of Ecotourism, vol. 22, no. 4, pp. 542–565, 2023, doi: 10.1080/14724049.2022.2091581.
X. Font, A. Torres-Delgado, G. Crabolu, J. Palomo Martinez, J. Kantenbacher, and G. Miller, “The impact of sustainable tourism indicators on destination competitiveness: the European Tourism Indicator System,” Journal of Sustainable Tourism, vol. 31, no. 7, pp. 1608–1630, 2023, doi: 10.1080/09669582.2021.1910281.
S. Chen and J. M. Luo, “Assessing barriers to the development of convention tourism in Macau,” Cogent Soc Sci, vol. 7, no. 1, 2021, doi: 10.1080/23311886.2021.1928978.
G. Qiao, J. Xu, L. Ding, and Q. Chen, “The impact of volunteer interaction on the tourism experience of people with visual impairment based on a mixed approach,” Current Issues in Tourism, vol. 26, no. 17, pp. 2794–2811, 2023, doi: 10.1080/13683500.2022.2098093.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Sentiment Classification of S.E.A Aquarium Singapore Reviews through CRISP-DM using DT and SVM with SMOTE
Pages: 595−606
Copyright (c) 2023 Yerik Afrianto Singgalen

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).