Comprehensive Analysis of Sentiment Classification and Toxicity Assessment in Cultural Documentary Videos
Abstract
This research explores sentiment classification and toxicity assessment in cultural documentary videos through a systematic analysis framework based on the Cross-Industry Standard Process for Data Mining (CRISP-DM). The study evaluates the sentiment polarity of viewer comments by utilizing a diverse array of machine-learning algorithms, including k-NN, DT, NBC, and SVM. It identifies toxic language patterns across multiple videos. Additionally, the research employs SMOTE to address class imbalance issues and enhance model performance. The results reveal high accuracy rates ranging from 72.24% to 96.79% in sentiment classification, indicating the effectiveness of the proposed methodology. Moreover, toxicity analysis unveils varying degrees of toxic language prevalence, with toxicity scores ranging from 0.01270 to 0.09334 across different videos. Despite these achievements, the study acknowledges the inherent limitations of toxicity scoring algorithms in capturing contextual nuances. Overall, this research contributes to understanding sentiment dynamics and toxicity trends in cultural documentary content and underscores the importance of employing advanced machine learning techniques within a structured analytical framework for insightful data interpretation and decision-making.
Downloads
References
M. Hawkins and M. Hawkins, “Antigone in the London office: documentary film, creativity and female agency,” Cult. Stud., vol. 36, no. 5, pp. 856–873, 2022, doi: 10.1080/09502386.2021.2011930.
J. Chambers, “Mysterious objects: exploring imaginary community, community imagination and cinematic translations of Scottish oral traditions within documentary film production and post-production,” Media Pract. Educ., vol. 23, no. 4, pp. 315–328, 2022, doi: 10.1080/25741136.2022.2111627.
R. J. Bramley, “‘A Community Legacy on Film’: using collaborative documentary filmmaking to go beyond representations of the Windrush Generation as ‘victims,’” Stud. Doc. Film, vol. 17, no. 2, pp. 115–132, 2023, doi: 10.1080/17503280.2022.2090701.
M. Pramaggiore and P. Kerrigan, “Streaming bloody murder: documentary celebrity and Sophie Toscan Du Plantier anniversary media (SAM),” Celebr. Stud., vol. 00, no. 00, pp. 1–21, 2023, doi: 10.1080/19392397.2023.2207743.
J. Fenwick, “Urban regeneration and stakeholder dynamics in the formation, growth and maintenance of the Sheffield International Documentary Festival in the 1990s,” Hist. J. Film. Radio Telev., vol. 41, no. 4, pp. 838–863, 2021, doi: 10.1080/01439685.2021.1922035.
Y. Zhu, “China’s ‘new cultural diplomacy’ in international broadcasting: branding the nation through CGTN Documentary,” Int. J. Cult. Policy, vol. 28, no. 6, pp. 671–683, 2022, doi: 10.1080/10286632.2021.2022651.
E. Colucci, “‘Breaking the chains’: reflections on the making of an ethnographic documentary on human rights violations against people with mental illness in Indonesia,” Vis. Stud., 2023, doi: 10.1080/1472586X.2023.2274892.
A. Zemanek and L. Momesso, “Multiculturalism through a lens: migrants’ voice in Taiwanese documentaries,” Inter-Asia Cult. Stud., vol. 24, no. 3, pp. 413–430, 2023, doi: 10.1080/14649373.2023.2209426.
M. Gandy, “Film as Method in the Geohumanities,” Geohumanities, vol. 7, no. 2, pp. 605–624, 2021, doi: 10.1080/2373566X.2021.1898287.
N. Sakr and J. Steemers, “Children’s documentaries: distance and ethics in European storytelling about the wider world,” J. Child. Media, vol. 16, no. 2, pp. 288–302, 2022, doi: 10.1080/17482798.2021.1974502.
R. W. Hefner, “Islam and Covenantal Pluralism in Indonesia: A Critical Juncture Analysis,” Rev. Faith Int. Aff., vol. 18, no. 2, pp. 1–17, 2020, doi: 10.1080/15570274.2020.1753946.
J. H. Hanson, M. Schutgens, N. Baral, and N. Leader-Williams, “Assessing the potential of snow leopard tourism-related products and services in the Annapurna Conservation Area, Nepal,” Tour. Plan. Dev., vol. 20, no. 6, pp. 1182–1202, 2023, doi: 10.1080/21568316.2022.2122073.
L. Palmer, S. Barnes, T. Wagner, and A. Hanley, “Holding Tightly: Co-Mingling, Life-Flourishing and Filmic Ecologies,” J. Intercult. Stud., vol. 44, no. 5, pp. 697–715, 2023, doi: 10.1080/07256868.2023.2192910.
O. J. Hakola, “Ethical reflections on filming death in end-of-life documentaries,” Mortality, vol. 28, no. 3, pp. 395–410, 2023, doi: 10.1080/13576275.2021.1946025.
A. L. G. Waworuntu, Z. Alkatiri, and R. de Archellie, “Challenging the promise of decentralization: The case of marginalization of Mosalaki role in Nggela Vilage in Ende Lio, Flores,” Cogent Arts Humanit., vol. 10, no. 1, 2023, doi: 10.1080/23311983.2023.2168835.
A. Schapper, “Beyond ‘Macassans’: Speculations on layers of Austronesian contact in northern Australia,” Aust. J. Linguist., vol. 41, no. 4, pp. 434–452, 2021, doi: 10.1080/07268602.2021.2000365.
N. Sumba Nacipucha, A. Sánchez-Bayón, J. Cueva Estrada, and A. Valencia-Arias, “Social networks as a strategy to improve the visibility of scientific journals,” Cogent Soc. Sci., vol. 10, no. 1, p., 2024, doi: 10.1080/23311886.2024.2306715.
S. Rahmadani, I. Meilano, S. Susilo, D. A. Sarsito, H. Z. Abidin, and P. Supendi, “Geodetic observation of strain accumulation in the Banda Arc region,” Geomatics, Nat. Hazards Risk, vol. 13, no. 1, pp. 2579–2596, 2022, doi: 10.1080/19475705.2022.2126799.
A. Dorkas Rambu Atahau, I. Madea Sakti, A. Namilana Rambu Hutar, A. Dolfriandra Huruta, and M. S. Kim, “Financial literacy and sustainability of rural microfinance: The mediating effect of governance,” Cogent Econ. Financ., vol. 11, no. 2, 2023, doi: 10.1080/23322039.2023.2230725.
Y. Ghanggo Ate and C. El-Khaissi, “Apologizing in Kodhi,” Aust. J. Linguist., vol. 43, no. 3, pp. 258–282, 2023, doi: 10.1080/07268602.2023.2290685.
U. Supraptiningsih, H. Jubba, E. Hariyanto, and T. Rahmawati, “Inequality as a cultural construction: Women’s access to land rights in Madurese society,” Cogent Soc. Sci., vol. 9, no. 1, 2023, doi: 10.1080/23311886.2023.2194733.
S. Baleghizadeh and L. Amiri Shayesteh, “A content analysis of the cultural representations of three ESL grammar textbooks,” Cogent Educ., vol. 7, no. 1, 2020, doi: 10.1080/2331186X.2020.1844849.
E. Fino, B. Hanna-Khalil, and M. D. Griffiths, “Exploring the public’s perception of gambling addiction on Twitter during the COVID-19 pandemic: Topic modelling and sentiment analysis,” J. Addict. Dis., vol. 39, no. 4, pp. 489–503, 2021, doi: 10.1080/10550887.2021.1897064.
L. Nemes and A. Kiss, “Prediction of stock values changes using sentiment analysis of stock news headlines,” J. Inf. Telecommun., vol. 5, no. 3, pp. 375–394, 2021, doi: 10.1080/24751839.2021.1874252.
R. K. Botchway, A. B. Jibril, Z. K. Oplatková, and M. Chovancová, “Deductions from a Sub-Saharan African Bank’s Tweets: A sentiment analysis approach,” Cogent Econ. Financ., vol. 8, no. 1, 2020, doi: 10.1080/23322039.2020.1776006.
T. Da Nguyen, “An approach to improve the accuracy of rating prediction for recommender systems,” Automatika, vol. 65, no. 1, pp. 58–72, 2024, doi: 10.1080/00051144.2023.2284026.
A. John and T. Latha, “Stock market prediction based on deep hybrid RNN model and sentiment analysis,” Automatika, vol. 64, no. 4, pp. 981–995, 2023, doi: 10.1080/00051144.2023.2217602.
E. R. Kovacs, L. A. Cotfas, and C. Delcea, “January 6th on Twitter: measuring social media attitudes towards the Capitol riot through unhealthy online conversation and sentiment analysis,” J. Inf. Telecommun., vol. 8, no. 1, pp. 108–129, 2024, doi: 10.1080/24751839.2023.2262067.
S. W. Ke, C. F. Tsai, and Y. J. Chen, “Managing Emotion In The Workplace: An Empirical Study With Enterprise Instant Messaging,” Appl. Artif. Intell., vol. 38, no. 1, 2024, doi: 10.1080/08839514.2023.2297518.
O. Lock and C. Pettit, “Social media as passive geo-participation in transportation planning–how effective are topic modeling & sentiment analysis in comparison with citizen surveys?,” Geo-Spatial Inf. Sci., vol. 23, no. 4, pp. 275–292, 2020, doi: 10.1080/10095020.2020.1815596.
C. J. R. Walker, M. B. Doucette, S. Rotz, D. Lewis, H. T. Neufeld, and H. Castleden, “Non-Indigenous partner perspectives on Indigenous peoples’ involvement in renewable energy: exploring reconciliation as relationships of accountability or status quo innocence?,” Qual. Res. Organ. Manag. An Int. J., vol. 16, no. 3–4, pp. 636–657, Jan. 2021, doi: 10.1108/QROM-04-2020-1916.
N. Chanza and W. Musakwa, “Revitalizing indigenous ways of maintaining food security in a changing climate: review of the evidence base from Africa,” Int. J. Clim. Chang. Strateg. Manag., vol. 14, no. 3, pp. 252–271, Jan. 2022, doi: 10.1108/IJCCSM-06-2021-0065.
L. Pham Hong, H. T. Ngo, and L. T. Pham, “Community-based tourism: Opportunities and challenges a case study in Thanh Ha pottery village, Hoi An city, Vietnam,” Cogent Soc. Sci., vol. 7, no. 1, 2021, doi: 10.1080/23311886.2021.1926100.
V. Sen and P. Walter, “Community-based ecotourism and the transformative learning of homestay hosts in Cambodia,” Tour. Recreat. Res., vol. 45, no. 3, pp. 323–336, 2020, doi: 10.1080/02508281.2019.1692171.
J. Sixto-García, A. I. Rodríguez-Vázquez, and X. López-García, “News Sharing Using Self-destructive Content in Digital Native Media from an International Perspective,” Journal. Pract., vol. 17, no. 7, pp. 1341–1356, 2023, doi: 10.1080/17512786.2021.2000883.
N. Gryllakis and M. Matsiola, “Digital audiovisual content in marketing and distributing cultural products during the COVID-19 pandemic in Greece,” Arts Mark., vol. 13, no. 1, pp. 4–19, Jan. 2023, doi: 10.1108/AAM-09-2021-0053.
A. L. Haw, “What drives political news engagement in digital spaces? Reimagining ‘echo chambers’ in a polarised and hybridised media ecology,” Commun. Res. Pract., vol. 6, no. 1, pp. 38–54, 2020, doi: 10.1080/22041451.2020.1732002.
K. Toffoletti, R. Olive, H. Thorpe, and A. Pavlidis, “Doing feminist physical cultural research in digital spaces: reflections, learnings and ways forward,” Qual. Res. Sport. Exerc. Heal., vol. 13, no. 1, pp. 11–25, 2021, doi: 10.1080/2159676X.2020.1836513.
N. A. K. Zamri, N. N. A. Mohamad Nasir, M. N. Hassim, and S. M. Ramli, “Digital hate speech and othering: The construction of hate speech from Malaysian perspectives,” Cogent Arts Humanit., vol. 10, no. 1, 2023, doi: 10.1080/23311983.2023.2229089.
J. hyun Im, “The discursive construction of East Asian identities in an era of globalization and internationalization: the linguistic landscape of East Asian departments at a U.S. university,” J. Multicult. Discourses, vol. 15, no. 1, pp. 80–103, 2020, doi: 10.1080/17447143.2020.1738441.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Comprehensive Analysis of Sentiment Classification and Toxicity Assessment in Cultural Documentary Videos
Pages: 535-547
Copyright (c) 2024 Yerik Afrianto Singgalen

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).






















