Travel Content Evaluation through Sentiment and Toxicity Analysis using CRISP-DM


  • Yerik Afrianto Singgalen * Mail Atma Jaya Catholic University of Indonesia, Jakarta, Indonesia
  • (*) Corresponding Author
Keywords: CRISP-DM; Digital Content; Sentiment; Travel Content; Toxicity

Abstract

This research, framed by the CRISP-DM methodology, offers a comprehensive analysis of sentiment and toxicity in digital content, focusing on tourism-related videos. Utilizing advanced machine learning models like VADER and TextBlob for sentiment analysis, as well as APIs such as Detoxify and Perspective for toxicity assessment, the study analyzed 25,361 posts, with 23,292 processed for sentiment and 24,171 for toxicity. Various algorithms, including k-NN, DT, NBC, and SVM, were applied with SMOTE to address data imbalance. The SVM algorithm achieved the highest performance with an accuracy of 54.80% and an F-measure of 66.01%, while others showed lower efficacy. The deployment phase integrated these models for real-time analysis, providing actionable insights into user engagement. Findings emphasize the significant impact of sentiments on brand perception and the necessity of managing toxic behavior for a healthier online environment. Despite limitations such as dataset imbalance and model dependency, the study offers valuable recommendations for content creators, advocating for robust moderation and sentiment-based strategies to enhance user interaction. Future research should include diverse datasets and advanced tools to improve the findings' robustness and applicability. This research contributes to understanding digital content dynamics and provides strategic insights for optimizing content creation and user engagement.

Downloads

Download data is not yet available.

References

H. Jang, B. Barrett, and S. C. McGregor, “Social media policy in two dimensions: understanding the role of anti-establishment beliefs and political ideology in Americans’ attribution of responsibility regarding online content,” Inf. Commun. Soc., 2023, doi: 10.1080/1369118X.2023.2234970.

J. Sihvonen and L. L. M. Turunen, “Multisensory experiences at travel fairs: What evokes feelings of pleasure, arousal and dominance among visitors?,” J. Conv. Event Tour., vol. 23, no. 1, pp. 63–85, 2022, doi: 10.1080/15470148.2021.1949417.

M. Li, M. Cheng, V. Quintal, and I. Cheah, “From live streamer to viewer: exploring travel live streamer persuasive linguistic styles and their impacts on travel intentions,” J. Travel Tour. Mark., vol. 40, no. 8, pp. 764–777, 2023, doi: 10.1080/10548408.2023.2294071.

A. Tresa Sebastian et al., “Exploring the opinions of the YouTube visitors towards advertisements and its influence on purchase intention among viewers,” Cogent Bus. Manag., vol. 8, no. 1, 2021, doi: 10.1080/23311975.2021.1876545.

T. Godskesen, S. Frygner Holm, A. T. Höglund, and S. Eriksson, “YouTube as a source of information on clinical trials for paediatric cancer,” Inf. Commun. Soc., vol. 26, no. 4, pp. 716–729, 2023, doi: 10.1080/1369118X.2021.1974515.

C. Arkenback, “YouTube as a site for vocational learning: instructional video types for interactive service work in retail,” J. Vocat. Educ. Train., vol. 00, no. 00, pp. 1–27, 2023, doi: 10.1080/13636820.2023.2180423.

V. Valta, “Making a Narrative of Repetition: Diachronicity and the Second-Person Address in YouTube’s Routine Videos,” Life Writ., pp. 1–20, 2024, doi: 10.1080/14484528.2024.2324997.

E. Burrai, D. M. Buda, and E. Stevenson, “Tourism and refugee-crisis intersections: co-creating tour guide experiences in Leeds, England,” J. Sustain. Tour., vol. 31, no. 12, pp. 2680–2697, 2023, doi: 10.1080/09669582.2022.2072851.

J. Struwig and E. A. du Preez, “Evolving domestic tourism destination preferences post-apartheid,” J. Leis. Res., vol. 0, no. 0, pp. 1–30, 2024, doi: 10.1080/00222216.2024.2336073.

E. Burrai, D. Buda, and E. Stevenson, “Tourism and refugee-crisis intersections : co- creating tour guide experiences in Leeds , England,” J. Sustain. Tour., vol. 31, no. 12, pp. 2680–2697, 2023, doi: 10.1080/09669582.2022.2072851.

J. Struwig and E. Ann, “Evolving domestic tourism destination preferences,” J. Leis. Res., vol. 0, no. 0, pp. 1–30, 2024, doi: 10.1080/00222216.2024.2336073.

E. Tverijonaite, A. D. Sæþórsdóttir, R. Ólafsdóttir, and C. M. Hall, “Wilderness: a resource or a sanctuary? Views of tourism service providers,” Scand. J. Hosp. Tour., vol. 23, no. 2–3, pp. 195–225, 2023, doi: 10.1080/15022250.2023.2233932.

S. Chen and J. M. Luo, “Assessing barriers to the development of convention tourism in Macau,” Cogent Soc. Sci., vol. 7, no. 1, 2021, doi: 10.1080/23311886.2021.1928978.

S. D. Reynolds et al., “Swimming with humans: biotelemetry reveals effects of ‘gold standard’ regulated tourism on whale sharks,” J. Sustain. Tour., vol. 0, no. 0, pp. 1–20, 2024, doi: 10.1080/09669582.2024.2314624.

N. Porto, D. A. Pitetti, and M. Ciaschi, “A worldwide tourism-extended Environmental Kuznets Curve. New approaches in a comparative analysis,” J. Sustain. Tour., vol. 31, no. 9, pp. 2100–2118, 2023, doi: 10.1080/09669582.2021.2023165.

L. J. Salangsang, M. J. Liwanag, and P. A. Notorio, “A content analysis of Asian countries’ tourism video advertisements: a luxury travel perspective,” Consum. Behav. Tour. Hosp., vol. 17, no. 1, pp. 76–88, Jan. 2022, doi: 10.1108/CBTH-05-2021-0141.

N. Basaraba, “The rise of paranormal investigations as virtual dark tourism on YouTube,” J. Herit. Tour., vol. 19, no. 2, pp. 287–309, 2024, doi: 10.1080/1743873X.2023.2268746.

P. S. Motahar, R. Tavakoli, and P. Mura, “Social media influencers’ visual framing of Iran on YouTube,” Tour. Recreat. Res., vol. 0, no. 0, pp. 1–13, 2021, doi: 10.1080/02508281.2021.2014252.

P. Kumar, J. M. Mishra, and Y. V. Rao, “Analysing tourism destination promotion through Facebook by Destination Marketing Organizations of India,” Curr. Issues Tour., vol. 25, no. 9, pp. 1416–1431, 2022, doi: 10.1080/13683500.2021.1921713.

H. Jodén and J. Strandell, “Building viewer engagement through interaction rituals on Twitch.tv,” Inf. Commun. Soc., vol. 25, no. 13, pp. 1969–1986, 2022, doi: 10.1080/1369118X.2021.1913211.

C. P. Chen, “Hardcore viewer engagement and social exchange with streamers and their digital live streaming communities,” Qual. Mark. Res., vol. 26, no. 1, pp. 37–57, Jan. 2023, doi: 10.1108/QMR-06-2021-0074.

W. J. Ladeira, M. Dalmoro, F. de O. Santini, and W. C. Jardim, “Visual cognition of fake news: the effects of consumer brand engagement,” J. Mark. Commun., vol. 28, no. 6, pp. 681–701, 2021, doi: 10.1080/13527266.2021.1934083.

Y. Christian and K. O. Y. R. Qi, “Penerapan K-Means pada Segmentasi Pasar untuk Riset Pemasaran pada Startup Early Stage dengan Menggunakan CRISP-DM,” JURIKOM (Jurnal Ris. Komputer), vol. 9, no. 4, pp. 966–973, 2022, doi: 10.30865/jurikom.v9i4.4486.

F. Martinez-Plumed et al., “CRISP-DM Twenty Years Later: From Data Mining Processes to Data Science Trajectories,” IEEE Trans. Knowl. Data Eng., vol. 33, no. 8, pp. 3048–3061, 2021, doi: 10.1109/TKDE.2019.2962680.

J. Bokrantz, M. Subramaniyan, and A. Skoogh, “Realising the promises of artificial intelligence in manufacturing by enhancing CRISP-DM,” Prod. Plan. Control, vol. 0, no. 0, pp. 1–21, 2023, doi: 10.1080/09537287.2023.2234882.

Y. A. Singgalen, “Toxicity Analysis and Sentiment Classification of Wonderland Indonesia by Alffy Rev using Support Vector Machine,” J. Sist. Komput. dan Inform., vol. 5, no. 3, pp. 538–548, 2024, doi: 10.30865/json.v5i3.7563.

Y. A. Singgalen, “Digital marketing of smartphone manufacturing product : toxicity , social network , and sentiment classification,” Int. J. Soc. Sci. Econ. Art, vol. 14, no. 1, pp. 73–86, 2024.

Y. A. Singgalen, “Toxicity , topic , and sentiment analysis on the operation of coal-fired power plants content reviews,” J. Tek. Inform. C.I.T Medicom, vol. 16, no. 1, pp. 45–57, 2024.

T. Huu Do, M. Berneman, J. Patro, G. Bekoulis, and N. Deligiannis, “Context-Aware Deep Markov Random Fields for Fake News Detection,” IEEE Access, vol. 9, pp. 130042–130054, 2021, doi: 10.1109/ACCESS.2021.3113877.

S. W. Lin, K. F. Wang, and Y. H. Chiu, “Effects of tourists’ psychological perceptions and travel choice behaviors on the nonmarket value of urban ecotourism during the COVID-19 pandemic- case study of the Maokong region in Taiwan,” Cogent Soc. Sci., vol. 8, no. 1, 2022, doi: 10.1080/23311886.2022.2095109.

D. Joo, H. Cho, K. M. Woosnam, and C. Suess, “Re-theorizing social emotions in tourism: applying the theory of interaction ritual in tourism research,” J. Sustain. Tour., vol. 31, no. 2, pp. 367–382, 2023, doi: 10.1080/09669582.2020.1849237.

J. Sixto-García, A. I. Rodríguez-Vázquez, and X. López-García, “News Sharing Using Self-destructive Content in Digital Native Media from an International Perspective,” Journal. Pract., vol. 17, no. 7, pp. 1341–1356, 2023, doi: 10.1080/17512786.2021.2000883.

J. Filipovic and M. Arslanagic-Kalajdzic, “Mirroring digital content marketing framework: capturing providers’ perspectives through stimuli assessment and behavioural engagement response,” Eur. J. Mark., vol. 57, no. 9, pp. 2173–2198, Jan. 2023, doi: 10.1108/EJM-03-2021-0158.

M. Boukes, X. Chu, M. F. A. Noon, R. Liu, T. Araujo, and A. C. Kroon, “Comparing user-content interactivity and audience diversity across news and satire: differences in online engagement between satire, regular news and partisan news,” J. Inf. Technol. Polit., vol. 19, no. 1, pp. 98–117, 2022, doi: 10.1080/19331681.2021.1927928.

P. Stamolampros and D. Dousios, “Employee satisfaction during the pandemic in the tourism and hospitality industries,” Curr. Issues Tour., pp. 1–15, 2023, doi: 10.1080/13683500.2023.2268798.

M. Cassar, J. Konietzny, and A. Caruana, “Customer encounter satisfaction and narrative force: an investigation of user-generated content on TripAdvisor,” Scand. J. Hosp. Tour., vol. 23, no. 1, pp. 51–72, 2023, doi: 10.1080/15022250.2023.2194272.

I. P. G. Sukaatmadja, N. N. K. Yasa, and P. L. D. Rahmayanti, “Bali brand love: A perspective from domestic tourists,” Cogent Bus. Manag., vol. 10, no. 3, 2023, doi: 10.1080/23311975.2023.2260119.

D. Amani and E. Chao, “How does destination governance build local residents’ behavioural support towards destination branding: An empirical study of the tourism sector in Tanzania,” Cogent Soc. Sci., vol. 9, no. 1, 2023, doi: 10.1080/23311886.2023.2192441.

H. Shi, Y. Liu, T. Kumail, and L. Pan, “Tourism destination brand equity, brand authenticity and revisit intention: the mediating role of tourist satisfaction and the moderating role of destination familiarity,” Tour. Rev., vol. 77, no. 3, pp. 751–779, Jan. 2022, doi: 10.1108/TR-08-2021-0371.

S. W. Lee and K. Xue, “A model of destination loyalty: integrating destination image and sustainable tourism,” Asia Pacific J. Tour. Res., vol. 25, no. 4, pp. 393–408, 2020, doi: 10.1080/10941665.2020.1713185.

S. Tanaka, C. Kim, H. Takahashi, and A. Nishihara, “Impact of brand authenticity on word-of-mouth for tourism souvenirs,” Cogent Bus. Manag., vol. 11, no. 1, 2023, doi: 10.1080/23311975.2023.2290222.

W. Tafesse, “YouTube marketing: how marketers’ video optimization practices influence video views,” Internet Res., vol. 30, no. 6, pp. 1689–1707, Jan. 2020, doi: 10.1108/INTR-10-2019-0406.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Travel Content Evaluation through Sentiment and Toxicity Analysis using CRISP-DM

Dimensions Badge
Article History
Submitted: 2024-06-23
Published: 2024-06-30
Abstract View: 566 times
PDF Download: 190 times
How to Cite
Singgalen, Y. (2024). Travel Content Evaluation through Sentiment and Toxicity Analysis using CRISP-DM. Building of Informatics, Technology and Science (BITS), 6(1), 365-377. https://doi.org/10.47065/bits.v6i1.5397
Issue
Section
Articles

Most read articles by the same author(s)

1 2 3 4 5 > >>