Opinion Mining on TikTok Using Bidirectional Long Short-Term Memory for Enhanced Sentiment Analysis and Trend Prediction
Abstract
The widespread use of TikTok has generated a vast number of user reviews, offering a rich dataset for sentiment analysis. This study aims to classify TikTok reviews from the Google Play Store into positive, negative, and neutral categories, a complex task due to the informal and unstructured text. The research seeks to develop a reliable sentiment analysis model using deep learning to understand user perceptions, aiding platform improvements and marketing strategies. We collected 10,000 reviews via web scraping, preprocessed through text cleaning, normalization, tokenization, filtering, and stemming. Sentiment labels were assigned automatically using a lexicon-based approach, showing predominantly positive reviews. Word2Vec transformed text into numerical vectors for feature extraction. The Bidirectional Long Short-Term Memory (Bi-LSTM) model, with Embedding, Bidirectional LSTM, Dropout, and Dense layers, achieved 80% accuracy and an F1-score of 0.78 using a 90:10 train-test split. While effective for positive and negative sentiments, neutral expressions were less accurately detected due to lower recall. Compared to traditional methods like Naive Bayes, Support Vector Machine, and K-Nearest Neighbors, Bi-LSTM offered superior accuracy and better handling of linguistic variability, making it valuable for analyzing social media feedback.
Downloads
References
M. Ali, A. W. Budyastomo, and M. H. Harun, “The impact of social media for the development of da’wah in Indonesia,” Religia, vol. 24, no. 1, pp. 22–33, Apr. 2021, doi: 10.28918/religia.v24i1.2224.
G. A. Szalkowski, I. M. Windekilde, and C. Johansen, “Towards sustainable short-form video: Modelling solutions for social and environmental challenges,” F1000Res, vol. 14, p. 265, Mar. 2025, doi: 10.12688/f1000research.161812.1.
X. Wang, Y. Yu, Z. Zhu, and J. Zheng, “Visiting Intentions toward Theme Parks: Do Short Video Content and Tourists’ Perceived Playfulness on TikTok Matter?,” Sustainability (Switzerland), vol. 14, no. 19, Oct. 2022, doi: 10.3390/su141912206.
T. Shaik, X. Tao, C. Dann, H. Xie, Y. Li, and L. Galligan, “Sentiment analysis and opinion mining on educational data: A survey,” Natural Language Processing Journal, vol. 2, p. 100003, Mar. 2023, doi: 10.1016/j.nlp.2022.100003.
J. Hartmann, M. Heitmann, C. Siebert, and C. Schamp, “More than a Feeling: Accuracy and Application of Sentiment Analysis,” International Journal of Research in Marketing, vol. 40, no. 1, pp. 75–87, Mar. 2023, doi: 10.1016/j.ijresmar.2022.05.005.
L. Bai, J. Guo, T. Xu, and M. Yang, “Emotional monitoring of learners based on EEG signal recognition,” Procedia Computer Science, vol. 176, pp. 364–368, 2020, doi: 10.1016/j.procs.2020.06.100.
J. Oruh, S. Viriri, and A. Adegun, “Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition,” IEEE Access, vol. 10, pp. 30069–30079, 2022, doi: 10.1109/ACCESS.2022.3159339.
G. Airlangga, “Comparative Analysis of NLP Techniques for Hate Speech Classification in Online Communications,” G-Tech: Jurnal Teknologi Terapan, vol. 8, no. 1, pp. 674–683, Jan. 2024, doi: 10.33379/gtech.v8i1.3959.
A. K. Maulaya and Junadhi, “Analisis sentimen menggunakan Support Vector Machine masyarakat Indonesia di Twitter terkait Bjorka,” Jurnal CoSciTech (Computer Science and Information Technology), vol. 3, no. 3, pp. 495–500, Dec. 2022, doi: 10.37859/coscitech.v3i3.4358.
R. M. Al-Khatib, L. Heilat, W. Qudah, S. Alhatamleh, and A. Al-Khateeb, “A novel improved deep learning model based on Bi-LSTM algorithm for intrusion detection in WSN,” Networks and Heterogeneous Media, vol. 20, no. 2, pp. 532–565, 2025, doi: 10.3934/nhm.2025024.
A. P. Joshi and B. V. Patel, “Data Preprocessing: The Techniques for Preparing Clean and Quality Data for Data Analytics Process,” Oriental journal of computer science and technology, vol. 13, no. 0203, pp. 78–81, Jan. 2021, doi: 10.13005/ojcst13.0203.03.
E. Mulyani, F. P. B. Muhamad, M. Yani, and M. Alfarizi, “A comparative analysis: Spelling checker methods for syntactic ambiguity detection in software requirements statements using SMART rules between TextBlob and CyHunspell,” in Proc. 17th Int. Conf. on Evaluation of Novel Approaches to Software Engineering (ENASE), Dec. 2023, pp. 120–126, doi: 10.5220/0011723600003575.
W. Bourequat and H. Mourad, “Sentiment Analysis Approach for Analyzing iPhone Release using Support Vector Machine,” International Journal of Advances in Data and Information Systems, vol. 2, no. 1, pp. 36–44, Apr. 2021, doi: 10.25008/ijadis.v2i1.1216.
M. A. Palomino and F. Aider, “Evaluating the Effectiveness of Text Pre-Processing in Sentiment Analysis,” Applied Sciences (Switzerland), vol. 12, no. 17, Sep. 2022, doi: 10.3390/app12178765.
A. Jabbar, S. Iqbal, M. I. Tamimy, A. Rehman, S. A. Bahaj, and T. Saba, “An Analytical Analysis of Text Stemming Methodologies in Information Retrieval and Natural Language Processing Systems,” IEEE Access, vol. 11, pp. 133681–133702, 2023, doi: 10.1109/ACCESS.2023.3332710.
R. Catelli, S. Pelosi, C. Comito, C. Pizzuti, and M. Esposito, “Lexicon-based sentiment analysis to detect opinions and attitude towards COVID-19 vaccines on Twitter in Italy,” Comput Biol Med, vol. 158, May 2023, doi: 10.1016/j.compbiomed.2023.106876.
V. Nurcahyawati, Z. Mustaffa, and M. Khalaf, “Exceeding Manual Labeling: VADER Lexicon as an Accurate Alternative to Automatic Sentiment Classification,” International Arab Journal of Information Technology, vol. 22, no. 2, pp. 225–235, Mar. 2025, doi: 10.34028/iajit/22/2/2.
Z. Kastrati, A. S. Imran, S. M. Daudpota, M. A. Memon, and M. Kastrati, “Soaring Energy Prices: Understanding Public Engagement on Twitter Using Sentiment Analysis and Topic Modeling with Transformers,” IEEE Access, vol. 11, pp. 26541–26553, 2023, doi: 10.1109/ACCESS.2023.3257283.
M. Chiny, M. Chihab, A. A. Lahcen, O. Bencharef, and Y. Chihab, “Effect of word embedding vector dimensionality on sentiment analysis through short and long texts,” IAES International Journal of Artificial Intelligence, vol. 12, no. 2, pp. 823–830, Jun. 2023, doi: 10.11591/ijai.v12.i2.pp823-830.
A. Qostal, A. Moumen, and Y. Lakhrissi, “CVs Classification Using Neural Network Approaches Combined with BERT and Gensim: CVs of Moroccan Engineering Students,” Data (Basel), vol. 9, no. 6, Jun. 2024, doi: 10.3390/data9060074.
A. L. Lezama-Sánchez, M. Tovar Vidal, and J. A. Reyes-Ortiz, “An Approach Based on Semantic Relationship Embeddings for Text Classification,” Mathematics, vol. 10, no. 21, Nov. 2022, doi: 10.3390/math10214161.
B. Omarov and Z. Zhumanov, “Bidirectional Long-Short-Term Memory with Attention Mechanism for Emotion Analysis in Textual Content,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 6, pp. 129–136, 2023, doi: 10.14569/IJACSA.2023.0140615.
K. Zhu and N. H. Samsudin, “Attention-based Spatialized Word Embedding Bi-LSTM Model for Sentiment Analysis,” Pertanika J Sci Technol, vol. 32, no. 1, pp. 79–98, Jan. 2024, doi: 10.47836/pjst.32.1.05.
M. Anand, A. Velu, and P. Whig, “Prediction of Loan Behaviour with Machine Learning Models for Secure Banking,” Journal of Computer Science and Engineering (JCSE), vol. 3, no. 1, pp. 1–13, Feb. 2022, doi: 10.36596/jcse.v3i1.237.
A. Bandi, P. V. S. R. Adapa, and Y. E. V. P. K. Kuchi, “The power of generative AI: A review of requirements, models, input–output formats, evaluation metrics, and challenges,” Future Internet, vol. 15, no. 8, Aug. 2023, doi: 10.3390/fi15080260.
D. Marlina, T. B. Kurniawan, M. Z. Zakaria, and S. F. Abdullah, “Sentiment analysis on natural skincare products using SVM,” Journal of Data Science, vol. 4, no. 1, pp. 1–7, 2022. [Online]. Available: http://eprints.intimal.edu.my/1667/1/jods2022_12.pdf
A. Witanti, “Analisis sentimen masyarakat terhadap vaksinasi COVID-19 pada media sosial Twitter menggunakan algoritma Support Vector Machine (SVM),” J. Sist. Inf. dan Inform. (Simika), vol. 5, no. 2, pp. 107–112, 2022, doi: 10.47080/simika.v5i2.1716. [Online]. Available: https://ejournal.akakom.ac.id/index.php/JISS/article/view/1330/351
A. Salma and W. Silfianti, “Sentiment Analysis of User Review on COVID-19 Information Applications Using Naïve Bayes Classifier, Support Vector Machine, and K-Nearest Neighbors,” International Research Journal of Advanced Engineering and Science, vol. 6, no. 4, pp. 158–162, 2021.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Opinion Mining on TikTok Using Bidirectional Long Short-Term Memory for Enhanced Sentiment Analysis and Trend Prediction
Pages: 1234-1241
Copyright (c) 2025 Wafiq Muharnisa Haspin, Junadhi Junadhi, Susanti Susanti, Helda Yenni

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















