Comprehensive Analysis of Sentiment and Toxicity Dynamics in Tourist Vlog Reviews: A CRISP-DM Approach
Abstract
This research employs the CRISP-DM framework to analyze sentiment and toxicity dynamics in tourist vlog reviews thoroughly. The study delves into sentiment classification and toxicity identification nuances by leveraging machine learning algorithms such as k-NN, SVM, NBC, and DT with SMOTE. Utilizing a dataset comprising a substantial number of posts, the analysis reveals varying levels of accuracy across different algorithms. For instance, k-NN and SVM showcase promising accuracy rates of 85.90% and 86.27% in sentiment classification, while NBC and DT with SMOTE yield 72.52% and 71.14%, respectively. Furthermore, the research elucidates the limitations of toxicity analysis, with NBC demonstrating a precision of 64.96% and DT exhibiting lower recall rates. These findings highlight the importance of robust methodologies for understanding sentiment and toxicity dynamics in online content, particularly in tourist vlog reviews.
Downloads
References
N. S. Borchers, “Between Skepticism and Identification: A Systematic Mapping of Adolescents’ Persuasion Knowledge of Influencer Marketing,” J. Curr. Issues Res. Advert., vol. 43, no. 3, pp. 274–300, 2022, doi: 10.1080/10641734.2022.2066230.
S. Park, J. Y. Lee, T. Notley, and M. Dezuanni, “Exploring the relationship between media literacy, online interaction, and civic engagement,” Inf. Soc., vol. 39, no. 4, pp. 250–261, 2023, doi: 10.1080/01972243.2023.2211055.
C. S. Ritter, “Gazing from the air: tourist encounters in the age of travel drones,” Tour. Geogr., vol. 0, no. 0, pp. 1–17, 2023, doi: 10.1080/14616688.2023.2264823.
S. Dvir-Gvirsman, “The Meaning of Like: How Social-Media Editors and Users Make Sense of Social Media Engagement,” Journal. Pract., pp. 1–18, 2023, doi: 10.1080/17512786.2023.2228782.
S. Or, N. Meir, D. Ron, O. Livio, Y. Tsfati, and N. Tal-Or, “The Impact of Testimony Journalism on Audience Engagement: An Experimental Investigation of the Effects of Point of View,” Journal. Stud., vol. 24, no. 5, pp. 573–593, 2023, doi: 10.1080/1461670X.2023.2173957.
L. Campbell, “Technoparticipation,” Perform. Res., vol. 26, no. 5, pp. 134–136, 2021, doi: 10.1080/13528165.2021.2029142.
A. Errmann, “Mindful Immersion: Curating Awe-Inducing Experiences to Increase Brand Salience,” J. Advert., vol. 0, no. 0, pp. 1–15, 2024, doi: 10.1080/00913367.2024.2325144.
N. Kyegombe et al., “A qualitative exploration of the salience of MTV-Shuga, an edutainment programme, and adolescents’ engagement with sexual and reproductive health information in rural KwaZulu-Natal, South Africa,” Sex. Reprod. Heal. Matters, vol. 30, no. 1, 2022, doi: 10.1080/26410397.2022.2083809.
S. Dickinson, “Watching the disaster unfold: geographies of engagement with live-streamed extreme weather,” Environ. Hazards, pp. 1–19, 2024, doi: 10.1080/17477891.2024.2324058.
F. Freitas, K. E. Leedham-Green, S. F. Smith, and M. J. Costa, “Partners in academic endeavour: Characterising student engagement across internationally excellent medical schools,” Med. Teach., vol. 45, no. 8, pp. 830–837, 2023, doi: 10.1080/0142159X.2023.2174418.
H. Jodén and J. Strandell, “Building viewer engagement through interaction rituals on Twitch.tv,” Inf. Commun. Soc., vol. 25, no. 13, pp. 1969–1986, 2022, doi: 10.1080/1369118X.2021.1913211.
M. Grizzard and A. Eden, “The Character Engagement and Moral Adjustment Model (CEMAM): A Synthesis of More than Six Decades of Research,” J. Broadcast. Electron. Media, vol. 66, no. 4, pp. 698–722, 2022, doi: 10.1080/08838151.2022.2146116.
L. A. Salah, S. Altalhab, A. Omair, and M. Aljasser, “Accuracy and Quality of YouTube Videos as a Source of Information on Vitiligo,” Clin. Cosmet. Investig. Dermatol., vol. 15, pp. 21–25, 2022, doi: 10.2147/CCID.S330015.
R. Iloranta and R. Komppula, “Service providers’ perspective on the luxury tourist experience as a product,” Scand. J. Hosp. Tour., vol. 22, no. 1, pp. 39–57, 2022, doi: 10.1080/15022250.2021.1946845.
J. Li, X. Peng, X. Liu, H. Tang, and W. Li, “A study on shaping tourists’ conservational intentions towards cultural heritage in the digital era: exploring the effects of authenticity, cultural experience, and place attachment,” J. Asian Archit. Build. Eng., vol. 00, no. 00, pp. 1–20, 2024, doi: 10.1080/13467581.2024.2321999.
I. Sörensen, D. Vogler, S. Fürst, and M. S. Schäfer, “Platforms matter: analyzing user engagement with social media content of Swiss higher education institutions,” J. Mark. High. Educ., pp. 1–20, 2023, doi: 10.1080/08841241.2023.2289009.
S. Ding, R. Zhang, Y. Liu, P. Lu, and M. Liu, “Visitor crowding at World Heritage Sites based on tourist spatial-temporal distribution: a case study of the Master-of-Nets Garden, China,” J. Herit. Tour., vol. 18, no. 5, pp. 632–657, 2023, doi: 10.1080/1743873X.2023.2214680.
S. M. Braimah, E. N. A. Solomon, and R. E. Hinson, “Tourists satisfaction in destination selection determinants and revisit intentions; perspectives from Ghana,” Cogent Soc. Sci., vol. 10, no. 1, p., 2024, doi: 10.1080/23311886.2024.2318864.
B. Pikkemaat, B. F. Bichler, and M. Peters, “Exploring the crowding-satisfaction relationship of skiers: the role of social behavior and experiences,” J. Travel Tour. Mark., vol. 37, no. 8–9, pp. 902–916, 2020, doi: 10.1080/10548408.2020.1763229.
S. Seyfi, S. Kuhzady, R. Rastegar, T. Vo-Thanh, and M. Zaman, “Exploring the dynamics of tourist travel intention before and during the COVID-19 pandemic: a scoping review,” Tour. Recreat. Res., vol. 0, no. 0, pp. 1–14, 2024, doi: 10.1080/02508281.2024.2330236.
S. Steensen, R. Ferrer-Conill, and C. Peters, “(Against a) Theory of Audience Engagement with News,” Journal. Stud., vol. 20, pp. 1662–1680, 2020, doi: 10.1080/1461670X.2020.1788414.
C. Leston-Bandeira and S. T. Siefken, “The development of public engagement as a core institutional role for parliaments,” J. Legis. Stud., vol. 29, no. 3, pp. 361–379, 2023, doi: 10.1080/13572334.2023.2214390.
K. K. Aldous, J. An, and B. J. Jansen, “What really matters?: characterising and predicting user engagement of news postings using multiple platforms, sentiments and topics,” Behav. Inf. Technol., vol. 42, no. 5, pp. 545–568, 2023, doi: 10.1080/0144929X.2022.2030798.
S. Zakiah, A. Winarno, and D. Hermana, “Examination of consumer engagement for loyalty in sustainable destination image,” Cogent Soc. Sci., vol. 9, no. 2, 2023, doi: 10.1080/23311886.2023.2269680.
J. Lehtinen and K. Aaltonen, “Community stakeholders’ online engagement in infrastructure projects: a theory-testing single-case study,” Constr. Manag. Econ., vol. 0, no. 0, pp. 1–29, 2024, doi: 10.1080/01446193.2024.2326558.
S. Zhang, M. D. T. de Jong, and J. F. Gosselt, “Microblogging for Engagement: Effects of Prior Company Involvement, Communication Strategy, and Emojis on Western and Chinese Users,” J. Int. Consum. Mark., vol. 34, no. 5, pp. 616–630, 2022, doi: 10.1080/08961530.2022.2040073.
S. H. Laaksonen and P. Varga, “Assessing the impact of selfie-taking tourists on local tour guides in the Chernobyl Exclusion Zone: a netnographic analysis of a dark tourism location,” J. Herit. Tour., pp. 1–16, 2023, doi: 10.1080/1743873X.2023.2292147.
W. B. Kelly and L. M. Given, “The community engagement for impact (CEFI) framework: an evidence-based strategy to facilitate social change,” Stud. High. Educ., vol. 49, no. 3, pp. 441–459, 2024, doi: 10.1080/03075079.2023.2238762.
I. Z. P. Hamdan and M. Othman, “Predicting Customer Loyalty Using Machine Learning for Hotel Industry,” J. Soft Comput. Data Min., vol. 3, no. 2, pp. 31–42, 2022.
J. A. Syahid and D. Mahdiana, “Perbandingan algoritma untuk klasifikasi analisis sentimen terhadap Genose pada media sosial Twitter,” semanTIK, vol. 7, no. 1, pp. 9–16, 2021, doi: 10.5281/zenodo.5034916.
H. Xu, L. T. O. Cheung, J. Lovett, X. Duan, Q. Pei, and D. Liang, “Understanding the influence of user-generated content on tourist loyalty behavior in a cultural World Heritage Site,” Tour. Recreat. Res., vol. 48, no. 2, pp. 173–187, 2023, doi: 10.1080/02508281.2021.1913022.
H. N. Io and C. B. Lee, “Social Media Comments about Hotel Robots,” J. China Tour. Res., vol. 16, no. 4, pp. 606–625, 2020, doi: 10.1080/19388160.2020.1769785.
S. Lin and W. Wei, “Social annotations and second language viewers’ engagement with multimedia learning resources in LMOOCs: a self-determination theory perspective,” Cogent Educ., vol. 11, no. 1, p., 2024, doi: 10.1080/2331186X.2024.2335715.
J. Lind and J. Lindström, “Towards a framework for exploring indirect value of tourist attractions in place branding: the case of Tom Tits Experiment Science Center,” Scand. J. Hosp. Tour., pp. 1–23, 2023, doi: 10.1080/15022250.2023.2233486.
J. Räikkönen, M. Grénman, H. Rouhiainen, A. Honkanen, and I. E. Sääksjärvi, “Conceptualizing nature-based science tourism: a case study of Seili Island, Finland,” J. Sustain. Tour., vol. 31, no. 5, pp. 1214–1232, 2023, doi: 10.1080/09669582.2021.1948553.
A. H. Fansurya et al., “Tourist’s Length of stay: the perspective of flow experience theory,” Cogent Bus. Manag., vol. 11, no. 1, p., 2024, doi: 10.1080/23311975.2024.2310258.
P. Gaładyk and K. Podhorodecka, “Tourist attractions and the location of campsites in Western Australia,” Curr. Issues Tour., vol. 24, no. 15, pp. 2144–2166, 2021, doi: 10.1080/13683500.2020.1828308.
M. Nowak, O. Alnyme, and T. Heldt, “Testing the effectiveness of increased frequency of norm-nudges in encouraging sustainable tourist behaviour: a field experiment using actual and self-reported behavioural data,” J. Sustain. Tour., vol. 0, no. 0, pp. 1–25, 2023, doi: 10.1080/09669582.2023.2220979.
X. Xu, Z. Shi, N. A. Bos, and H. Wu, “Student engagement and learning outcomes: an empirical study applying a four-dimensional framework,” Med. Educ. Online, vol. 28, no. 1, 2023, doi: 10.1080/10872981.2023.2268347.
K. Davis and K. Southey, “Employee engagement in shared services in a regional university context,” J. High. Educ. Policy Manag., vol. 00, no. 00, pp. 1–16, 2024, doi: 10.1080/1360080X.2024.2344238.
C. Wilson, S. Sims, J. Dyer, and F. Handley, “Identifying opportunities and gaps in current evaluation frameworks–the knowns and unknowns in determining effective student engagement activity,” Assess. Eval. High. Educ., vol. 47, no. 6, pp. 843–856, 2022, doi: 10.1080/02602938.2021.1969536.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Comprehensive Analysis of Sentiment and Toxicity Dynamics in Tourist Vlog Reviews: A CRISP-DM Approach
Pages: 648-659
Copyright (c) 2024 Yerik Afrianto Singgalen

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).






















