Sentiment Analysis on Movie Review from Rotten Tomatoes Using Modified Balanced Random Forest Method and Word2Vec
Abstract
The film industry is one of the impacts of the rapid development of technology. This causes the film industry to increase every year. In addition, technological developments also affect the public to make it easier to access various movies from various websites. With many choices of movies, people need to know the quality of various movies by knowing the reviews of these movies from other people. However, the large number of audience reviews of a movie makes it difficult for people to categorize good movies and bad movies. The solution to the problem is to perform sentiment analysis on movie reviews. In this research, the classification method used is Modified Balanced Random Forest. This method was chosen because it can overcome imbalanced data and can increase accuracy and reduce time complexity. In this research, Word2Vec is also used as feature extraction. This feature extraction was chosen because previous research explained that Word2Vec has the advantage of being able to show the contextual similarity of two words in the resulting vector. The best model produced from this research is a model built without using stemming in the preprocessing stage, using 300 dimensions in Word2Vec, and using the Modified Balanced Random Forest classification method which produces an f1-score of 84.15%.
Downloads
References
A. R. Yosafat and Y. Kurnia, “Aplikasi Prediksi Rating Film dengan Perbandingan Metode Naïve Bayes dan KNN Berbasis Website Menggunakan Framework Codeigniter,” 2019. [Online]. Available: https://jurnal.buddhidharma.ac.id/index.php/algor/index
S. A. Pratomo, S. Al Faraby, and M. D. Purbolaksono, “Analisis Sentimen Pengaruh Kombinasi Ekstraksi Fitur TF-IDF dan Lexicon Pada Ulasan Film Menggunakan Metode KNN,” eProceedings Eng., vol. 8, no. 5, pp. 10116–10126, 2021.
A. Z. Amrullah, A. Sofyan Anas, and M. A. J. Hidayat, “Analisis Sentimen Movie Review Menggunakan Naive Bayes Classifier Dengan Seleksi Fitur Chi Square,” Jurnal, vol. 2, no. 1, pp. 40–44, 2020, doi: 10.30812/bite.v2i1.804.
A. Andreyestha and A. Subekti, “Analisa Sentiment Pada Ulasan Film Dengan Optimasi Ensemble Learning,” J. Inform., vol. 7, no. 1, pp. 15–23, 2020, doi: 10.31311/ji.v7i1.6171.
D. I. Af’idah, Dairoh, S. F. Handayani, and R. W. Pratiwi, “Pengaruh Parameter Word2Vec terhadap Performa Deep Learning pada Klasifikasi Sentimen,” J. Inform. Jurunal Pengemb. IT, vol. 6, no. 3, pp. 156–161, 2021.
A. Putri, P. Wardani, and M. D. Purbolaksono, “Sentiment Analysis on Beauty Product Review Using Modified Balanced Random Forest Method and Chi-Square,” vol. 4, no. 1, pp. 1–7, 2022, doi: 10.47065/josh.v4i1.2047.
A. Fahmi Sabani, Adiwijaya, and W. Astuti, “Analisis Sentimen Review Film pada Website Rotten Tomatoes Menggunakan Metode SVM Dengan Mengimplementasikan Fitur Extraction Word2Vec,” e-Proceeding Eng., vol. 9, no. 3, p. 1800, 2022.
M. D. P. Y. Surya, S. Al Faraby, “Analisis Sentimen Terhadap Ulasan Film Menggunakan Word2Vec dan SVM,” vol. 8, no. 4, pp. 4136–4144, 2021.
M. A. A. Jihad, Adiwijaya, and W. Astuti, “Analisis sentimen terhadap ulasan film menggunakan algoritma random forest,” e-Proceeding Eng., vol. 8, no. 5, pp. 10153–10165, 2021.
F. N. Zamzami, A. Adiwijaya, and M. D. P, “Analisis Sentimen Terhadap Review Film Menggunakan Metode Modified Balanced Random Forest dan Mutual Information,” J. Media Inform. Budidarma, vol. 5, no. 2, p. 415, 2021, doi: 10.30865/mib.v5i2.2844.
R. S. Murti and S. Al-faraby, “Analisis Sentimen pada Ulasan Film Menggunakan Word2Vec dan Long Short-Term Mermory ( LSTM ) Pendahuluan Studi Terkait,” Telkom Univ., 2019.
M. B. Hamzah, “Classification of Movie Review Sentiment Analysis Using Chi-Square and Multinomial Naïve Bayes with Adaptive Boosting,” J. Adv. Inf. Syst. Technol., vol. 3, no. 1, pp. 67–74, 2021.
C. A. Putri, “Analisis Sentimen Review Film Berbahasa Inggris Dengan Pendekatan Bidirectional Encoder Representations from Transformers,” JATISI (Jurnal Tek. Inform. dan Sist. Informasi), vol. 6, no. 2, pp. 181–193, 2020, doi: 10.35957/jatisi.v6i2.206.
D. T. Hermanto, A. Setyanto, E. T. Luthfi, and U. A. Yogyakarta, “Algoritma LSTM-CNN untuk Sentimen Klasifikasi dengan Word2vec pada Media Online,” Citec J., vol. 8, pp. 64–77, 2021.
A. Salama and S. Al Faraby, “Klasifikasi Topik Ayat Al- Qur ’ an Terjemahan Berbahasa Inggris Menggunakan Metode Support Vector Machine Berbasis Vector Space Model dan Word2Vec,” vol. 6, no. 2, pp. 9133–9142, 2019.
I. G. Bagus, A. Bayu, P. Yuda, and M. Dwifebri, “Analisis Sentimen Review Film Berbahasa Inggris Menggunakan Word2Vec dan Naïve Bayes,” 2022.
S. Rizal, Adiwijaya, and M. D. Purbolaksono, “Sentiment Analysis on Movie Review from Rotten Tomatoes Using Word2Vec and Naive Bayes,” 2022 1st Int. Conf. Softw. Eng. Inf. Technol. ICoSEIT 2022, pp. 180–185, 2022, doi: 10.1109/ICoSEIT55604.2022.10030009.
A. M. D. Purbolaksono, M. I. Tantowi, A. I. Hidayat, “Perbandingan Support Vector Machine dan Modified Balanced Random Forest dalam Deteksi Pasien Penyakit Diabetes,” J. RESTI, vol. 1, no. 10, pp. 393–399, 2021.
Z. Agusta and K. Adiwijaya, “Modified balanced random forest for improving imbalanced data prediction,” Int. J. Adv. Intell. Informatics, vol. 5, pp. 58–65, 2019, doi: 10.26555/ijain.v5i1.255.
I. Prayoga and M. D. P, “Sentiment Analysis on Indonesian Movie Review Using KNN Method With the Implementation of Chi-Square Feature Selection,” J. Media Inform. Budidarma, vol. 7, pp. 369–375, 2023, doi: 10.30865/mib.v7i1.5522.
X. Deng, Q. Liu, Y. Deng, and S. Mahadevan, “An improved method to construct basic probability assignment based on the confusion matrix for classification problem,” Inf. Sci. (Ny)., 2016, doi: 10.1016/j.ins.2016.01.033.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Sentiment Analysis on Movie Review from Rotten Tomatoes Using Modified Balanced Random Forest Method and Word2Vec
Pages: 153−161
Copyright (c) 2023 Mohamad Rizki Nugraha, Mahendra Dwifebri Purbolaksono, Widi Astuti

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















