Sentiment Analysis on Movie Review from Rotten Tomatoes Using Modified Balanced Random Forest Method and Word2Vec


  • Mohamad Rizki Nugraha * Mail Telkom University, Bandung, Indonesia
  • Mahendra Dwifebri Purbolaksono Telkom University, Bandung, Indonesia
  • Widi Astuti Telkom University, Bandung, Indonesia
  • (*) Corresponding Author
Keywords: Sentiment Analysis; Movie Review; Modified Balanced Random Forest; Word2vec

Abstract

The film industry is one of the impacts of the rapid development of technology. This causes the film industry to increase every year. In addition, technological developments also affect the public to make it easier to access various movies from various websites. With many choices of movies, people need to know the quality of various movies by knowing the reviews of these movies from other people. However, the large number of audience reviews of a movie makes it difficult for people to categorize good movies and bad movies. The solution to the problem is to perform sentiment analysis on movie reviews. In this research, the classification method used is Modified Balanced Random Forest. This method was chosen because it can overcome imbalanced data and can increase accuracy and reduce time complexity. In this research, Word2Vec is also used as feature extraction. This feature extraction was chosen because previous research explained that Word2Vec has the advantage of being able to show the contextual similarity of two words in the resulting vector. The best model produced from this research is a model built without using stemming in the preprocessing stage, using 300 dimensions in Word2Vec, and using the Modified Balanced Random Forest classification method which produces an f1-score of 84.15%.

Downloads

Download data is not yet available.

References

A. R. Yosafat and Y. Kurnia, “Aplikasi Prediksi Rating Film dengan Perbandingan Metode Naïve Bayes dan KNN Berbasis Website Menggunakan Framework Codeigniter,” 2019. [Online]. Available: https://jurnal.buddhidharma.ac.id/index.php/algor/index

S. A. Pratomo, S. Al Faraby, and M. D. Purbolaksono, “Analisis Sentimen Pengaruh Kombinasi Ekstraksi Fitur TF-IDF dan Lexicon Pada Ulasan Film Menggunakan Metode KNN,” eProceedings Eng., vol. 8, no. 5, pp. 10116–10126, 2021.

A. Z. Amrullah, A. Sofyan Anas, and M. A. J. Hidayat, “Analisis Sentimen Movie Review Menggunakan Naive Bayes Classifier Dengan Seleksi Fitur Chi Square,” Jurnal, vol. 2, no. 1, pp. 40–44, 2020, doi: 10.30812/bite.v2i1.804.

A. Andreyestha and A. Subekti, “Analisa Sentiment Pada Ulasan Film Dengan Optimasi Ensemble Learning,” J. Inform., vol. 7, no. 1, pp. 15–23, 2020, doi: 10.31311/ji.v7i1.6171.

D. I. Af’idah, Dairoh, S. F. Handayani, and R. W. Pratiwi, “Pengaruh Parameter Word2Vec terhadap Performa Deep Learning pada Klasifikasi Sentimen,” J. Inform. Jurunal Pengemb. IT, vol. 6, no. 3, pp. 156–161, 2021.

A. Putri, P. Wardani, and M. D. Purbolaksono, “Sentiment Analysis on Beauty Product Review Using Modified Balanced Random Forest Method and Chi-Square,” vol. 4, no. 1, pp. 1–7, 2022, doi: 10.47065/josh.v4i1.2047.

A. Fahmi Sabani, Adiwijaya, and W. Astuti, “Analisis Sentimen Review Film pada Website Rotten Tomatoes Menggunakan Metode SVM Dengan Mengimplementasikan Fitur Extraction Word2Vec,” e-Proceeding Eng., vol. 9, no. 3, p. 1800, 2022.

M. D. P. Y. Surya, S. Al Faraby, “Analisis Sentimen Terhadap Ulasan Film Menggunakan Word2Vec dan SVM,” vol. 8, no. 4, pp. 4136–4144, 2021.

M. A. A. Jihad, Adiwijaya, and W. Astuti, “Analisis sentimen terhadap ulasan film menggunakan algoritma random forest,” e-Proceeding Eng., vol. 8, no. 5, pp. 10153–10165, 2021.

F. N. Zamzami, A. Adiwijaya, and M. D. P, “Analisis Sentimen Terhadap Review Film Menggunakan Metode Modified Balanced Random Forest dan Mutual Information,” J. Media Inform. Budidarma, vol. 5, no. 2, p. 415, 2021, doi: 10.30865/mib.v5i2.2844.

R. S. Murti and S. Al-faraby, “Analisis Sentimen pada Ulasan Film Menggunakan Word2Vec dan Long Short-Term Mermory ( LSTM ) Pendahuluan Studi Terkait,” Telkom Univ., 2019.

M. B. Hamzah, “Classification of Movie Review Sentiment Analysis Using Chi-Square and Multinomial Naïve Bayes with Adaptive Boosting,” J. Adv. Inf. Syst. Technol., vol. 3, no. 1, pp. 67–74, 2021.

C. A. Putri, “Analisis Sentimen Review Film Berbahasa Inggris Dengan Pendekatan Bidirectional Encoder Representations from Transformers,” JATISI (Jurnal Tek. Inform. dan Sist. Informasi), vol. 6, no. 2, pp. 181–193, 2020, doi: 10.35957/jatisi.v6i2.206.

D. T. Hermanto, A. Setyanto, E. T. Luthfi, and U. A. Yogyakarta, “Algoritma LSTM-CNN untuk Sentimen Klasifikasi dengan Word2vec pada Media Online,” Citec J., vol. 8, pp. 64–77, 2021.

A. Salama and S. Al Faraby, “Klasifikasi Topik Ayat Al- Qur ’ an Terjemahan Berbahasa Inggris Menggunakan Metode Support Vector Machine Berbasis Vector Space Model dan Word2Vec,” vol. 6, no. 2, pp. 9133–9142, 2019.

I. G. Bagus, A. Bayu, P. Yuda, and M. Dwifebri, “Analisis Sentimen Review Film Berbahasa Inggris Menggunakan Word2Vec dan Naïve Bayes,” 2022.

S. Rizal, Adiwijaya, and M. D. Purbolaksono, “Sentiment Analysis on Movie Review from Rotten Tomatoes Using Word2Vec and Naive Bayes,” 2022 1st Int. Conf. Softw. Eng. Inf. Technol. ICoSEIT 2022, pp. 180–185, 2022, doi: 10.1109/ICoSEIT55604.2022.10030009.

A. M. D. Purbolaksono, M. I. Tantowi, A. I. Hidayat, “Perbandingan Support Vector Machine dan Modified Balanced Random Forest dalam Deteksi Pasien Penyakit Diabetes,” J. RESTI, vol. 1, no. 10, pp. 393–399, 2021.

Z. Agusta and K. Adiwijaya, “Modified balanced random forest for improving imbalanced data prediction,” Int. J. Adv. Intell. Informatics, vol. 5, pp. 58–65, 2019, doi: 10.26555/ijain.v5i1.255.

I. Prayoga and M. D. P, “Sentiment Analysis on Indonesian Movie Review Using KNN Method With the Implementation of Chi-Square Feature Selection,” J. Media Inform. Budidarma, vol. 7, pp. 369–375, 2023, doi: 10.30865/mib.v7i1.5522.

X. Deng, Q. Liu, Y. Deng, and S. Mahadevan, “An improved method to construct basic probability assignment based on the confusion matrix for classification problem,” Inf. Sci. (Ny)., 2016, doi: 10.1016/j.ins.2016.01.033.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Sentiment Analysis on Movie Review from Rotten Tomatoes Using Modified Balanced Random Forest Method and Word2Vec

Dimensions Badge
Article History
Submitted: 2023-06-08
Published: 2023-06-29
Abstract View: 1315 times
PDF Download: 790 times
How to Cite
Nugraha, M. R., Purbolaksono, M. D., & Astuti, W. (2023). Sentiment Analysis on Movie Review from Rotten Tomatoes Using Modified Balanced Random Forest Method and Word2Vec. Building of Informatics, Technology and Science (BITS), 5(1), 153−161. https://doi.org/10.47065/bits.v5i1.3596
Issue
Section
Articles

Most read articles by the same author(s)