Analisis Sentimen Aplikasi Primaku Menggunakan Algoritma Random Forest dan SMOTE untuk Mengatasi Ketidakseimbangan Data


  • Riska Aryanti Universitas Bina Sarana Informatika, Jakarta, Indonesia
  • Titik Misriati * Mail Universitas Bina Sarana Informatika, Jakarta, Indonesia
  • Asriyani Sagiyanto Universitas Bina Sarana Informatika, Jakarta, Indonesia
  • (*) Corresponding Author
Keywords: Text Mining; Sentiment Analysi; Stunting; Google Play Store; Primaku

Abstract

The Primaku application is an application that can be used as a tool to monitor the growth of children under five, this application can be used to collect data on the growth of children under five, apart from that this application can also provide clear information and visualization about the growth of children under five, including nutritional status and growth development In accordance with the standards that have been set, the Primaku application can help parents or health workers in routinely monitoring the growth of children under five and early detecting the potential risk of stunting. Stunting is a growth disorder that occurs in children under five due to malnutrition which is characterized by the child's height. which is shorter than the age standard. Stunting can have a long-term impact on a child's quality of life, such as disrupting physical, cognitive and social development, as well as increasing the risk of chronic disease in adulthood. The primaku application has been widely used, more than 500,000 users have downloaded this application and 44,700 reviews have been given by users to this application, however, reading all the reviews may take time, but if there are few reviews read, then the review results will be biased. Therefore, sentiment analysis aims to overcome this problem by automatically grouping user reviews into positive and negative reviews. Therefore, research on toddler growth detection to determine the public's response to the Primaku application can be of great benefit in efforts to prevent stunting in children under five in Indonesia. In this research, the random forest algorithm with the SMOTE technique was used to carry out sentiment analysis of Primaku application reviews. The random forest algorithm is a machine learning algorithm based on decision trees. The SMOTE technique is used to overcome data imbalance problems and is able to reduce overfitting while increasing the performance of the Random Forest algorithm. The data used in this research is Primaku application review data obtained from scrapping results from the Google Play Store. This data contains comments from application users, namely positive and negative. The results of this sentiment analysis show a deep understanding of user perceptions of the Primaku application. This sentiment analysis can be a basis for further improvement and development of the Primaku application, with a focus on aspects that influence user satisfaction and the research results show that the random forest algorithm with the SMOTE technique can produce quite good accuracy in sentiment analysis of the Primaku application. obtained in this study was 88%.

Downloads

Download data is not yet available.

References

L. Yanti and H. F, “Pemberdayaan Kader Posyandu Dalam Pemantauan Tumbuh Kembang Balita Menggunakan Kpsp Dan Aplikasi ‘Primaku’ Sebagai Deteksi Dini Stunting Di Desa Tanah Merah Kabupaten Kampar,” Krida Cendekia, vol. 1, no. 9, 2022.

Z. Munawar, Iswanto, D. Widhiantoro, and N. I. Putri, “Analisis Sentimen Covid-19 Pada Media Sosial Dengan Model Neural Machine Translation Analysis of Covid-19 Sentiment on Social Media with Neural Machine Translation Model,” TEMATIK (Jurnal teknologi Informasi Komunikasi), vol. 9, no. 1, pp. 15–20, 2022, doi: 10.38204/tematik.v9i1.917.

R. Safitri, N. Alfira, D. Tamitiadini, W. W. A. Dewi, and N. Febriani, Analisis Sentimen: Metode Alternatif Penelitian Big Data. Universitas Brawijaya Press, 2021.

S. Wahyuddin et al., Data Mining. Global Eksekutif Teknologi, 2023.

R. Maulana, A. Voutama, and T. Ridwan, “Analisis Sentimen Ulasan Aplikasi MyPertamina pada Google Playstore menggunakan Algoritma NBC,” Jurnal Teknologi Terpadu, vol. 9, no. 1, pp. 42–48, 2023.

T. Tinaliah and T. Elizabeth, “Analisis Sentimen Ulasan Aplikasi PrimaKu Menggunakan Metode Support Vector Machine,” JATISI (Jurnal Teknik Informatika dan Sistem Informasi), vol. 9, no. 4, 2022, doi: https://doi.org/10.35957/jatisi.v9i4.3586.

W. A. Wahyuni, S. Saepudin, and F. Sembiring, “Sentiment Analysis of Online Investment Applications on Google Play Store using Random Forest Algorithm Method,” Jurnal Mantik, vol. 5, no. 4, 2022.

F. A. Larasati, D. E. Ratnawati, and B. T. Hanggara, “Analisis Sentimen Ulasan Aplikasi Dana dengan Metode Random Forest,” Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer, vol. 6, no. 9, pp. 4305–4313, 2022, [Online]. Available: http://j-ptiik.ub.ac.id

M. Diki Hendriyanto, A. A. Ridha, and U. Enri, “Analisis Sentimen Ulasan Aplikasi Mola Pada Google Play Store Menggunakan Algoritma Support Vector Machine Sentiment Analysis of Mola Application Reviews on Google Play Store Using Support Vector Machine Algorithm,” Journal of Information Technology and Computer Science (INTECOMS), vol. 5, no. 1, 2022.

R. Aryanti et al., “The use of resampling techniques to overcome imbalance of data on the classification algorithm,” in AIP Conference Proceedings, American Institute of Physics Inc., May 2023. doi: 10.1063/5.0128424.

G. Gumelar, Q. Ain, R. Marsuciati, S. Agustanti Bambang, A. Sunyoto, and M. Syukri Mustafa, “Kombinasi Algoritma Sampling dengan Algoritma Klasifikasi untuk Meningkatkan Performa Klasifikasi Dataset Imbalance,” in SISFOTEK : Sistem Informasi dan Teknologi, 2021, pp. 250–255.

M. I. Prasetiyowati, N. U. Maulidevi, and K. Surendro, “The accuracy of Random Forest performance can be improved by conducting a feature selection with a balancing strategy,” in PeerJ Computer Science, 2022.

E. C. Gök and M. O. Olgun, “SMOTE-NC and gradient boosting imputation based random forest classifier for predicting severity level of covid-19 patients with blood samples,” Neural Comput Appl, vol. 33, no. 22, pp. 15693–15707, 2021.

A. N. Kasanah, M. Muladi, and U. Pujianto, “Penerapan Teknik SMOTE untuk Mengatasi Imbalance Class dalam Klasifikasi Objektivitas Berita Online Menggunakan Algoritma KNN,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 3, no. 2. pp. 196–201, 2018. doi: https://doi.org/10.29207/resti.v3i2.945.

E. Erlin, Y. Desnelita, N. Nasution, L. Suryati, and F. Zoromi, “Dampak SMOTE terhadap Kinerja Random Forest Classifier berdasarkan Data Tidak seimbang,” MATRIK: Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, vol. 21, no. 3, pp. 677–690, 2022, doi: https://doi.org/10.30812/matrik.v21i3.1726.

H. C. Morama, D. E. Ratnawati, and I. Arwani, “Analisis Sentimen berbasis Aspek terhadap Ulasan Hotel Tentrem Yogyakarta menggunakan Algoritma Random Forest Classifier,” Jurnal Pengembangan Teknologi Informasi Dan Ilmu Komputer, vol. 6, no. 4, pp. 1702–1708, 2022, [Online]. Available: https://j-ptiik.ub.ac.id/index.php/j-ptiik/article/view/10908

T. Asgari, A. Daneshvar, A. P. Chobar, M. Ebrahimi, and S. Abrahamyan, “Identifying key success factors for startups With sentiment analysis using text data mining,” International Journal of Engineering Business Management, vol. 14, 2022, doi: 10.1177/18479790221131612.

N. Istiqamah, B. Surarso, and B. Warsito, “Classification of customer review using random forest classifier,” AIP Conf Proc, vol. 2738, no. 1, 2023.

H. C. Morama, D. E. Ratnawati, and I. Arwani, “Analisis Sentimen berbasis Aspek terhadap Ulasan Hotel Tentrem Yogyakarta menggunakan Algoritma Random Forest Classifier,” Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer, vol. 6, no. 4, pp. 1702–1708, 2022.

Prasetya J. and Abdurakhman A., “Comparison of Smote Random Forest and Smote K-Nearest Neighbors Classification Analysis on Imbalanced Data,” Media Statistika, vol. 15, no. 2, pp. 198–208, 2023, doi: 10.14710/medstat.15.2.198-208.

R. Alshamy, M. Ghurab, S. Othman, and F. Alshami, “Intrusion Detection Model for Imbalanced Dataset Using SMOTE and Random Forest Algorithm,” in Advances in Cyber Security. ACeS 2021. Communications in Computer and Information Science, vol. 1487, N. Abdullah, S. Manickam, and M. Anbar, Eds., Singapore: Springer, 2021. doi: 10.1007/978-981-16-8059-5_22.

V. Christanti Mawardi and E. Darmaja, “Logistic Regression Method for Sentiment Analysis Application on Google Playstore,” International Journal of Application on Sciences, Technology and Engineering, vol. 1, no. 1, pp. 241–247, Feb. 2023, doi: 10.24912/ijaste.v1.i1.241-247.

A. V Saurkar, K. G. Pathare, and S. A. Gode, “An overview on web scraping techniques and tools,” International Journal on Future Revolution in Computer Science & Communication Engineering, vol. 4, no. 4, pp. 363–367, 2018.

A. Ulfah and I. Najiah, “Implementasi Web Scraping Pada Situs Jurnal Sinta Menggunakan Framework Selenium Web Driver Python,” JIKA (Jurnal Informatika, vol. 7, no. 1, pp. 29–36, 2023.

M. Djufri, “Penerapan Teknik Web Scraping Untuk Penggalian Potensi Pajak (Studi Kasus Pada Online Market Place Tokopedia, Shopee Dan Bukalapak,” Jurnal BPPK: Badan Pendidikan Dan Pelatihan Keuangan, vol. 13, no. 2, pp. 65–75, 2020.

D. P. Santoso and W. Wibowo, “Analisis Sentimen Ulasan Aplikasi Buzzbreak Menggunakan Metode Naïve Bayes Classifier pada Situs Google Play Store,” Jurnal Sains dan Seni ITS, vol. 11, no. 2, pp. 190–196, 2022.

I. Muslim Karo Karo et al., “Analisis Sentimen Ulasan Aplikasi Info BMKG di Google Play Menggunakan TF-IDF dan Support Vector Machine,” Journal of Information System Research, vol. 4, no. 4, pp. 1423–1430, 2023, doi: 10.47065/josh.v4i4.3943.

Z. Min, “Drugs Reviews Sentiment Analysis using Weakly Supervised Model*,” in International Conference on Artificial Intelligence and Computer Applications (ICAICA), 2019, pp. 332–336.

HaqueTanjim Ul, N. N. Saber, and F. M. Shah, “Sentiment Analysis on Large Scale Amazon Product Reviews,” in International Conference on Innovative Research and Developme, 2018, pp. 190–196.

S. Rabbani, D. Safitri, N. Rahmadhani, and M. K. Anam, “Perbandingan Evaluasi Kernel SVM untuk Klasifikasi Sentimen dalam Analisis Kenaikan Harga BBM: Comparative Evaluation of SVM Kernels for Sentiment Classification in Fuel Price Increase Analysis,” MALCOM: Indonesian Journal of Machine Learning and Computer Science, vol. 3, no. 2, pp. 153–160, 2023.

R. A. Supono and M. A. Suprayogi, “Perbandingan Metode TF-ABS dan TF-IDF Pada Klasifikasi Teks Helpdesk Menggunakan K-Nearest Neighbor,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi, vol. 5, no. 5, pp. 911–918, 2021.

A. Syukron, E. Saputro, and P. Widodo, “Penerapan Metode Smote Untuk Mengatasi Ketidakseimbangan Kelas Pada Prediksi Gagal Jantung,” 2023. [Online]. Available: https://doi.org/10/25047/jtit.v10i1.312


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Analisis Sentimen Aplikasi Primaku Menggunakan Algoritma Random Forest dan SMOTE untuk Mengatasi Ketidakseimbangan Data

Dimensions Badge
Article History
Submitted: 2023-11-10
Published: 2023-11-30
Abstract View: 2253 times
PDF Download: 1616 times
Section
Articles