Perbandingan Kinerja Metode Naïve Bayes dan Random Forest untuk Klasifikasi Penyakit Diabetes Berdasarkan Data Medis
Abstract
Diabetes mellitus merupakan penyakit tidak menular yang prevalensinya terus meningkat di Indonesia. Proses diagnosis secara konvensional sering menghadapi berbagai tantangan, seperti keterlambatan dan biaya yang tinggi. Penelitian ini bertujuan untuk membandingkan kinerja algoritma Naive Bayes dan Random Forest dalam klasifikasi diabetes dengan menggunakan dataset Pima Indians Diabetes. Untuk mengatasi ketidakseimbangan kelas, dataset diproses menggunakan teknik Synthetic Minority Over-sampling Technique (SMOTE). Evaluasi kinerja dilakukan menggunakan metrik akurasi, presisi, recall, dan F1-score. Hasil penelitian menunjukkan bahwa algoritma Random Forest memperoleh akurasi sebesar 79,5%, presisi 79,6%, recall 79,5%, dan F1-score 79,5%. Sementara itu, algoritma Naive Bayes memperoleh akurasi 76,5%, presisi 76,5%, recall 76,5%, dan F1-score 76,5%. Temuan ini menunjukkan bahwa Random Forest unggul dalam menangani data yang kompleks dengan akurasi prediksi yang lebih tinggi, sedangkan Naive Bayes tetap efektif untuk implementasi yang lebih sederhana karena efisiensi komputasinya. Studi ini berkontribusi dalam pengembangan sistem pendukung keputusan cerdas untuk deteksi dini diabetes yang lebih cepat dan akurat, sehingga dapat membantu mengurangi beban pada sistem layanan kesehatan.
Downloads
References
BPS (Badan Pusat Statistik), “Statistik Kesehatan Indonesia: Diabetes Melitus,” Jakarta, Jun. 2021. Accessed: Jun. 05, 2025. [Online]. Available: https://www.bps.go.id/id/publication/2021/12/22/0f207323902633342a1f6b01/profil-statistik-kesehatan-2021.html
Kementerian Kesehatan Republik Indonesia, “Profil Kesehatan Indonesia 2021,” Jakarta, Jun. 2021. Accessed: Jun. 05, 2025. [Online]. Available: https://kemkes.go.id/app_asset/file_content_download/Profil-Kesehatan-Indonesia-2021.pdf
American Diabetes Association, “Diabetes Mellitus: Diagnosis and Management,” Diabetes Care, vol. 44, no. Supplement_1, pp. S151–S167, Jan. 2021, doi: 10.2337/dc21-S011.
International Diabetes Federation (IDF), IDF Diabetes Atlas, 10th Edition, 10th Edition. Brussels, Belgium: International Diabetes Federation, 2021. Accessed: Jun. 05, 2025. [Online]. Available: https://diabetesatlas.org/atlas/tenth-edition/
World Health Organization, “Global Report on Diabetes,” Geneva, Jun. 2020. Accessed: Jun. 05, 2025. [Online]. Available: https://www.who.int/publications/i/item/9789241565257
D. Hermawan and A. Supriyadi, “Analisis Penggunaan Machine Learning dalam Diagnosis Diabetes,” Jurnal Teknologi Kesehatan, vol. 15, no. 4, pp. 233–240, 2022.
M. A. Alam, A. Sohel, K. M. Hasan, and M. A. Islam, “Machine Learning And Artificial Intelligence in Diabetes Prediction And Management: A Comprehensive Review of Models,” Innovatech Engineering Journal, vol. 1, no. 01, pp. 107–124, Dec. 2024, doi: 10.70937/jnes.v1i01.41.
D. ; S. H. ; S. R. A. Nugroho, “Implementasi Naive Bayes dan Random Forest dalam Klasifikasi Diabetes,” Jurnal Informatika Medis, vol. 12, no. 3, pp. 112–118, 2021, Accessed: Jun. 04, 2025. [Online]. Available: https://www.kaggle.com/datasets
H. Mansourifar and W. Shi, “Deep Synthetic Minority Over-Sampling Technique,” arXiv preprint arXiv:2003.09788, Mar. 2020, doi: 10.48550/arXiv.2003.09788.
C. Y. Huang and H. L. Dai, “Learning from class-imbalanced data: review of data driven methods and algorithm driven methods,” Data Science in Finance and Economics, vol. 1, no. 1, pp. 21–36, 2021, doi: 10.3934/DSFE.2021002.
W. Chen, K. Yang, Z. Yu, Y. Shi, and C. L. P. Chen, “A survey on imbalanced learning: latest research, applications and future directions,” Artif Intell Rev, vol. 57, no. 6, Jun. 2024, doi: 10.1007/s10462-024-10759-6.
W. Hasanah and L. C. Munggaran, “Comparison of Naïve Bayes and Random Forest Methods for Diabetes Prediction,” Int J Comput Appl, vol. 174, no. 26, pp. 1–6, Mar. 2021, Accessed: Jun. 05, 2025. [Online]. Available: https://www.ijcaonline.org/archives/volume174/number26/31837-2021921184/
K. Huda and M. Ula, “Penerapan Naive Bayes, Regresi Logistik, Random Forest, Svm, Dan Knn Untuk Prediksi Diabetes,” SENASTIKA Universitas Malikussaleh, Nov. 2024. Accessed: Jun. 05, 2025. [Online]. Available: https://proceedings.unimal.ac.id/senastika/article/view/853
Y. , L. W. , Z. T. Chen, “Evaluating the impact of different features on the performance of machine learning models in predicting diabetes,” Journal of Data Science, vol. 18, no. 4, pp. 489–499, 2020.
I. H. Sarker, “Machine learning in healthcare: An introduction and review,” J Healthc Eng, pp. 1–17, 2021, doi: 10.1155/2021/9952417.
Scikit-learn Developers, “Naive Bayes classifier,” Scikit-learn Documentation. Accessed: Jun. 05, 2025. [Online]. Available: https://scikit-learn.org/stable/modules/naive_bayes.html
GeeksforGeeks, “Naive Bayes Classifiers.” Accessed: Jun. 05, 2025. [Online]. Available: https://www.geeksforgeeks.org/naive-bayes-classifiers/
Sunil Ray, “Naive Bayes Classifier Explained With Practical Problems.” Accessed: Jun. 05, 2025. [Online]. Available: https://www.analyticsvidhya.com/blog/2017/09/naive-bayes-explained/
M. Shahhosseini and G. Hu, “Improved weighted random forest for classification problems,” ArXiv, 2020.
T. Fulazzaky, A. Saefuddin, and A. M. Soleh, “Evaluating Ensemble Learning Techniques for Class Imbalance in Machine Learning: A Comparative Analysis of Balanced Random Forest, SMOTE-RF, SMOTEBoost, and RUSBoost,” Scientific Journal of Informatics, vol. 11, no. 4, pp. 969–980, Dec. 2024, doi: 10.15294/sji.v11i4.15937.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Perbandingan Kinerja Metode Naïve Bayes dan Random Forest untuk Klasifikasi Penyakit Diabetes Berdasarkan Data Medis
Pages: 431-443
Copyright (c) 2025 Rendy Risqi Pradana, Yani Parti Astuti

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















