Optimalisasi Random Forest dan Support Vector Machine dengan Hyperparameter GridSearchCV untuk Analisis Sentimen Ulasan PrimaKu


  • Titik Misriati Universitas Bina Sarana Informatika, Jakarta, Indonesia
  • Riska Aryanti * Mail Universitas Bina Sarana Informatika, Jakarta, Indonesia
  • (*) Corresponding Author
Keywords: Random Forest; Support Vector Machine; Gridsearchcv; Hyperparameter; Sentiment Analysis

Abstract

PrimaKu App has been a pioneer in the field of digital health since 2017. Through this application, parents can regularly and continuously monitor their children’s health and development. PrimaKu also has a formal alliance with the Indonesian Pediatric Association (IDAI) to promote child health in Indonesia. This application can be downloaded through the Google Play Store. Google Play Store has a feature that allows users to review the app before downloading. Sentiment analysis is used to distinguish between positive and negative reviews by users who have provided reviews so that an evaluation of the services provided can be made. This research aims to conduct sentiment analysis of user reviews of the PrimaKu application using Random Forest (RF) and Support Vector Machine (SVM) algorithms with TF-IDF weighting. Optimization was performed using hyperparameters to improve the performance of the Random Forest and SVM algorithms. The data used consisted of the 2,293 most relevant reviews collected from the Google Play Store. The most effective models for the Random Forest and Support Vector Machine were selected by adjusting the hyperparameters using GridSearch CV. The results of this study show that Random Forest has a higher success rate in classifying PrimaKu user review data, with an accuracy of 89%, precision of 88%, recall of 81%, and F1-Score of 85%.

Downloads

Download data is not yet available.

References

V. K, P. Samuel, B. V. Krishna, and M. J, “Exploration of sentiment analysis in twitter propaganda: a deep dive,” Multimed Tools Appl, vol. 83, no. 15, pp. 44729–44751, 2024, doi: 10.1007/s11042-023-17383-6.

F. Rejeki and V. Ayumi, “Analisa Sentimen Mengenai Kenaikan Harga Bbm Menggunakan Metode Naïve Bayes Dan Support Vector Machine,” JSAI : Journal Scientific and Applied Informatics, vol. 6, no. 1, 2023, doi: 10.36085.

F. A. Larasati, D. E. Ratnawati, and B. T. Hanggara, “Analisis Sentimen Ulasan Aplikasi Dana dengan Metode Random Forest,” Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer, vol. 6, no. 9, pp. 4305–4313, 2022, [Online]. Available: http://j-ptiik.ub.ac.id

M. Tabany and M. Gueffal, “Sentiment Analysis and Fake Amazon Reviews Classification Using SVM Supervised Machine Learning Model,” Journal of Advances in Information Technology, vol. 15, no. 1, pp. 49–58, 2024, doi: 10.12720/jait.15.1.49-58.

T. Tinaliah and T. Elizabeth, “Analisis Sentimen Ulasan Aplikasi PrimaKu Menggunakan Metode Support Vector Machine,” JATISI (Jurnal Teknik Informatika dan Sistem Informasi), vol. 9, no. 4, 2022, doi: https://doi.org/10.35957/jatisi.v9i4.3586.

R. Aryanti, T. Misriati, and A. Sagiyanto, “Analisis Sentimen Aplikasi Primaku Menggunakan Algoritma Random Forest dan SMOTE untuk Mengatasi Ketidakseimbangan Data,” Journal of Computer System and Informatics (JoSYC), vol. 5, no. 1, pp. 218–227, Nov. 2023, doi: 10.47065/josyc.v5i1.4562.

Y. A. Ali, E. M. Awwad, M. Al-Razgan, and A. Maarouf, “Hyperparameter Search for Machine Learning Algorithms for Optimizing the Computational Complexity,” Processes, vol. 11, no. 349, pp. 1–21, Feb. 2023, doi: 10.3390/pr11020349.

I. Jamaleddyn, R. El Ayachi, and B. Mohamed, “Optimization of Machine Learning Algorithms Through Hyper-parameter Tuning Applied for the classiication of Arabic news,” Res Sq, 2023, doi: 10.21203/rs.3.rs-2615380/v1.

J. F. Lundell, “Tuning Support Vector Machines and Boosted Trees Using Optimization Algorithms,” Journal of Data Science, pp. 1–16, 2023, doi: 10.6339/23-jds1106.

Md. M. Rahman, A. Rahman, S. Akter, and S. A. Pinky, “Hyperparameter Tuning Based Machine Learning Classifier for Breast Cancer Prediction,” Journal of Computer and Communications, vol. 11, no. 04, pp. 149–165, 2023, doi: 10.4236/jcc.2023.114007.

G. N. Ahmad, H. Fatima, Shafiullah, A. Salah Saidi, and Imdadullah, “Efficient Medical Diagnosis of Human Heart Diseases Using Machine Learning Techniques with and Without GridSearchCV,” IEEE Access, vol. 10, pp. 80151–80173, 2022, doi: 10.1109/ACCESS.2022.3165792.

S. George and B. Sumathi, “Grid Search Tuning of Hyperparameters in Random Forest Classifier for Customer Feedback Sentiment Prediction,” 2020. [Online]. Available: www.ijacsa.thesai.org

K. Rangsiman, C. Fabrizio, and L. Sandra, “Machine Learning-Assisted Discovery of Hidden States in Expanded Free Energy Space,” Journal of Physical Chemistry Letters, vol. 13, no. 7, pp. 1797–1805, Feb. 2022, doi: 10.1021/acs.jpclett.1c04004.

S. Rasheed, G. K. Kumar, D. M. Rani, M. V. V. P. Kantipudi, and M. Anila, “Heart Disease Prediction Using GridSearchCV and Random Forest,” EAI Endorsed Trans Pervasive Health Technol, vol. 10, Jan. 2024, doi: 10.4108/eetpht.10.5523.

S. K. Jain and K. G. Anil, “Application of Random Forest Regression with Hyper-parameters Tuning to Estimate Reference Evapotranspiration,” IJACSA) International Journal of Advanced Computer Science and Applications, vol. 13, no. 5, pp. 742–750, 2022, [Online]. Available: www.ijacsa.thesai.org

E. Munson, C. Smith, B. Boehmke, and J. Freels, “Sentiment Analysis of Twitter Data (saotd),” J Open Source Softw, vol. 4, no. 34, p. 764, Feb. 2019, doi: 10.21105/joss.00764.

A. Lertpiya, T. Chalothorn, and P. Buabthong, “How to Progressively Build Thai Spelling Correction Systems?,” IEEE Access, vol. 11, pp. 72704–72716, 2023, doi: 10.1109/ACCESS.2023.3295004.

N. Babanejad, H. Davoudi, A. Agrawal, A. An, and M. Papagelis, “The Role of Preprocessing for Word Representation Learning in Affective Tasks,” IEEE Trans Affect Comput, vol. 15, no. 1, pp. 254–272, Jan. 2024, doi: 10.1109/TAFFC.2023.3270115.

A. Hanafiah, A. H. Nasution, Y. Arta, R. Wandri, H. O. Nasution, and J. Mardafora, “Sentimen Analisis Terhadap Customer Review Produk Shopee Berbasis Wordcloud Dengan Algoritma Naïve Bayes Classifier,” INTECOMS: Journal of Information Technology and Computer Science, vol. 6, no. 1, pp. 230–236, Mar. 2023, doi: 10.31539/intecoms.v6i1.5845.

F. D. Ananda and Y. Pristyanto, “Analisis Sentimen Pengguna Twitter Terhadap Layanan Internet Provider Menggunakan Algoritma Support Vector Machine,” MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, vol. 20, no. 2, pp. 407–416, May 2021, doi: 10.30812/matrik.v20i2.1130.

M. IŞIK and H. DAĞ, “The impact of text preprocessing on the prediction of review ratings,” TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES, vol. 28, no. 3, pp. 1405–1421, May 2020, doi: 10.3906/elk-1907-46.

D. Mustikasari, I. Widaningrum, R. Arifin, and W. H. E. Putri, “Comparison of Effectiveness of Stemming Algorithms in Indonesian Documents,” in Proceedings of the 2nd Borobudur International Symposium on Science and Technology (BIS-STE 2020), 2021. doi: 10.2991/aer.k.210810.025.

R. Aryanti, A. Saryoko, A. Junaidi, S. Marlina, Wahyudin, and L. Nurmalia, “Comparing Classification Algorithm with Genetic Algorithm in Public Transport Analysis,” in Journal of Physics: Conference Series, IOP Publishing Ltd, Nov. 2020. doi: 10.1088/1742-6596/1641/1/012017.

T. Misriati, R. Aryanti, and A. Sagiyanto, “High Accurate Prediction of Heart Disease Classification by Support Vector Machine,” in Proceedings of the 3rd International Conference on Advanced Information Scientific Development, SCITEPRESS - Science and Technology Publications, 2023, pp. 5–9. doi: 10.5220/0012437100003848.

K. Chong and N. Shah, “Comparison of Naive Bayes and SVM Classification in Grid-Search Hyperparameter Tuned and Non-Hyperparameter Tuned Healthcare Stock Market Sentiment Analysis,” International Journal of Advanced Computer Science and Applications, vol. 13, no. 12, 2022, doi: 10.14569/IJACSA.2022.0131213.

O. Manullang, C. Prianto, and N. H. Harani, “Analisis Sentimen Untuk Memprediksi Hasil Calon Pemilu Presiden Menggunakan Lexicon Based dan Random Forest,” Jurnal Ilmiah Informatika (JIF), vol. 11, no. 2, pp. 159–169, 2023.

H. Adams, E. Farnell, and B. Story, “Support vector machines and Radon’s theorem,” Nov. 2020, [Online]. Available: http://arxiv.org/abs/2011.00617

W. W. Sukeiti and S. Surono, “Fuzzy Support Vector Machine Using Function Linear Membership and Exponential with Mahanalobis Distance,” JTAM (Jurnal Teori dan Aplikasi Matematika), vol. 6, no. 2, p. 268, Apr. 2022, doi: 10.31764/jtam.v6i2.6912.

Z. Yang, “Generating Knowledge-based Explanation for Recommendation from Review,” in Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, New York, NY, USA: ACM, Jul. 2022, pp. 3494–3494. doi: 10.1145/3477495.3531683.

V. Yepmo, G. Smits, and O. Pivert, “Anomaly explanation: A review Data and Knowledge Engineering,” Data Knowl Eng, vol. 137, p. 101946, Jan. 2022, doi: 10.1016/j.datak.2021.101946.

K. Padmanandam, S. P. V. D. S. Bheri, L. Vegesna, and K. Sruthi, “A Speech Recognized Dynamic Word Cloud Visualization for Text Summarization,” in 2021 6th International Conference on Inventive Computation Technologies (ICICT), IEEE, Jan. 2021, pp. 609–613. doi: 10.1109/ICICT50816.2021.9358693.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Optimalisasi Random Forest dan Support Vector Machine dengan Hyperparameter GridSearchCV untuk Analisis Sentimen Ulasan PrimaKu

Dimensions Badge
Article History
Submitted: 2024-06-14
Published: 2024-07-31
Abstract View: 827 times
PDF Download: 916 times
How to Cite
Misriati, T., & Aryanti, R. (2024). Optimalisasi Random Forest dan Support Vector Machine dengan Hyperparameter GridSearchCV untuk Analisis Sentimen Ulasan PrimaKu. Journal of Information System Research (JOSH), 5(4), 1333-1341. https://doi.org/10.47065/josh.v5i4.5347
Issue
Section
Articles