Sentiment Analysis on Beauty Product Review Using Modified Balanced Random Forest Method and Chi-Square
Abstract
Internet users in Indonesia have used e-commerce services to buy various products. For example, one website that provides information services about women's beauty products is Female Daily. On the website, there are reviews of beauty products. The review feature is one feature that helps users in determining which beauty products to buy. Unfortunately, many reviews will take a long time to read, and it is almost impossible for users to read all the information. Therefore, research is needed to make it easier for users to consider products such as sentiment analysis. Sentiment analysis aims to classify opinions, namely, user reviews, into positive, neutral, and negative opinions. In this study, sentiment analysis uses the Modified Balanced Random Forest(MBRF) and Chi-square method as feature selection. The best model from this study produces an average accuracy and an average f1-score of 81.75% and 71.90%, respectively.
Downloads
References
H. Rika, “88,1 Persen Pengguna Internet Belanja dengan E-Commerce,” https://www.cnnindonesia.com/, 2021. https://www.cnnindonesia.com/ekonomi/20211111123945-78-719672/881-persen-pengguna-internet-belanja-dengan-e-commerce.
Female Daily, “Female Daily,” 2022. https://femaledaily.com/.
Z. Zhang, Q. Ye, Z. Zhang, and Y. Li, “Sentiment classification of Internet restaurant reviews written in Cantonese,” Expert Syst. Appl., vol. 38, no. 6, pp. 7674–7682, 2011, doi: 10.1016/j.eswa.2010.12.147.
B. Liu, “Sentiment Analysis and Opinion Mining,” Morgan & Claypool Publishers, 2012.
H. Ardian and S. Kosasi, “Analisis Sentimen Pada Review Produk Kosmetik Bahasa Indonesia Dengan Metode Naive Bayes,” J. ENTER, vol. 2, no. 1, pp. 306–320, 2019.
Y. Hegde and S. K. Padma, “Sentiment analysis using random forest ensemble for mobile product reviews in kannada,” in Proceedings - 7th IEEE International Advanced Computing Conference, IACC 2017, Jul. 2017, pp. 777–782, doi: 10.1109/IACC.2017.0160.
D. Irvantoro, “Feature Selection Menggunakan Chi-Square Dan N-Gram Dengan Algoritma Naive Bayes Classifier Untuk Analisis Sentimen Review Produk Elektronik,” PhD Thesis. Univ. Muhammadiyah Jember, no. 1410651199, 2019.
Z. P. Agusta and Adiwijaya, “Modified balanced random forest for improving imbalanced data prediction,” Int. J. Adv. Intell. Informatics, vol. 5, no. 1, pp. 58–65, 2019, doi: 10.26555/ijain.v5i1.255.
F. N. Zamzami and M. D. P, “Analisis Sentimen Terhadap Review Film Menggunakan Metode Modified Balanced Random Forest dan Mutual Information,” vol. 5, no. April, pp. 415–421, 2021, doi: 10.30865/mib.v5i2.2835.
T. B. Rohman, D. D. Purwanto, and J. Santoso, “Sentiment Analysis Terhadap Review Rumah Makan di Surabaya Memanfaatkan Algoritma Random Forest,” Pros. Semin. Nas. Teknol. Inf. Apl., pp. 7–11, 2018.
D. B. Satmoko, P. Sukarno, and E. M. Jadied, “Peningkatan Akurasi Pendeteksian Serangan DDoS Menggunakan Multiclassifier Ensemble Learning dan Chi-Square Pendahuluan Studi Terkait,” vol. 5, no. 3, pp. 7977–7985, 2018.
C. F. Suharno, M. A. Fauzi, and R. S. Perdana, “Klasifikasi Teks Bahasa Indonesia Pada Dokumen Pengaduan Sambat Online Menggunakan Metode K-Nearest Neighbors Dan Chi-square,” Syst. Inf. Syst. Informatics J., vol. 3, no. 1, pp. 25–32, 2017, doi: 10.29080/systemic.v3i1.191.
M. A. A. Jihad, Adiwijaya, and W. Astuti, “Analisis sentimen terhadap ulasan film menggunakan algoritma random forest,” e-Proceeding Eng., vol. 8, no. 5, pp. 10153–10165, 2021.
J. A. Septian, T. M. Fahrudin, and A. Nugroho, “Analisis Sentimen Pengguna Twitter Terhadap Polemik Persepakbolaan Indonesia Menggunakan Pembobotan TF - IDF dan K - Nearest Neighbor,” J. Intell. Syst. Comput., no. September, pp. 43–49, 2019.
N. D. Pratama, Y. A. Sari, and P. P. Adikara, “Analisis Sentimen Pada Review Konsumen Menggunakan Metode Naive Bayes Dengan Seleksi Fitur Chi Square Untuk Rekomendasi Lokasi Makanan Tradisional,” J. Pengemb. Teknol. Inf. dan Ilmu Komput. Univ. Brawijaya, vol. 2, no. 9, pp. 2982–2988, 2018.
A. K. Santra and C. J. Christy, “Genetic Algorithm and Confusion Matrix for Document Clustering,” Int. J. Comput. Sci. Issues, vol. 9, no. 1, pp. 322–328, 2012.
M. Awad and R. Khanna, Efficient learning machines: Theories, concepts, and applications for engineers and system designers, no. April. 2015.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Sentiment Analysis on Beauty Product Review Using Modified Balanced Random Forest Method and Chi-Square
Pages: 1-7
Copyright (c) 2022 Antika Putri Permata Wardani, Adiwijaya Adiwijaya, Mahendra Dwifebri Purbolaksono

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).






















