Analisa Kinerja Algoritma Random Forest dan XGBoost dalam Klasifikasi Penyakit Cacar Monyet (Monkeypox)
Abstract
Monkeypox is a contagious disease that requires prompt and accurate handling, particularly in the diagnostic process. However, identifying symptomps manually often takes time and is prone to error. In response to this challenge, this study aims to develop a machine learning based classification model to support a more efficient diagnosis process. This research applies two machine learning algorithms XGBoost regression and Random Forest regression to classify patients as infected or uninfected with monkeypox based on clinical symptoms. The study focuses on assessing how well each algorithm can distinguish between positive and negative cases, especially when dealing with imbalanced data or overlapping features. The dataset used consists of 25.000 entries sourced from Kaggle, each containing clinical indicators related to monkeypox. Before modeling, the data underwent exploratory data analysis (EDA) and preprocessing, including handling missing values. Furthermore, cross-validation and parameter tuning techniques were implemented to optimize model performance. The results indicate that XGBoost outperformed Random Forest, achieving 68% accuracy, 69% precision, 89% recall, and a 78% F1-score. In contrast, Random Forest yielded slightly lower scores. Both models were evaluated using the ROC curve, where each reached an AUC values of 0.60. This suggests that wile both models show potential, their ability to clearly distinguish between classes positive and negative remains limited and can be improves in future work.
Downloads
References
J. Kwong, K. C. McNabb, J. G. Voss, A. Bergman, K. McGee, and J. Farley, “Monkeypox Virus Outbreak 2022: Key Epidemiologic, Clinical, Diagnostic, and Prevention Considerations,” J. Assoc. Nurses AIDS Care, vol. 33, no. 6, pp. 657–667, 2022, doi: 10.1097/JNC.0000000000000365.
E. Sherwood et al., “Invasive group A streptococcal disease in pregnant women and young children: a systematic review and meta-analysis,” Lancet Infect. Dis., vol. 22, no. 7, pp. 1076–1088, Jul. 2022, doi: 10.1016/S1473-3099(21)00672-1.
A. R. A. Saied, M. Dhawan, A. A. Metwally, M. L. Fahrni, P. Choudhary, and O. P. Choudhary, “Disease History, Pathogenesis, Diagnostics, and Therapeutics for Human Monkeypox Disease: A Comprehensive Review,” MDPI, Vol 10, No 12, doi: 10.3390/vaccines10122091.
F. Wei et al., “Study and prediction of the 2022 global monkeypox epidemic,” J. Biosaf. Biosecurity, vol. 4, no. 2, pp. 158–162, Dec. 2022, doi: 10.1016/j.jobb.2022.12.001.
J. Lu et al., “Mpox (formerly monkeypox): pathogenesis, prevention, and treatment,” Dec. 27, 2023, Springer Nature. doi: 10.1038/s41392-023-01675-2.
H. Harapan et al., “Monkeypox: A Comprehensive Review,” Sep. 29, 2022, MDPI. doi: 10.3390/v14102155.
D. L. Fink et al., “Clinical features and management of individuals admitted to hospital with monkeypox and associated complications across the UK: a retrospective cohort study,” Lancet Infect. Dis., vol. 23, no. 5, pp. 589–597, May 2023, doi: 10.1016/S1473-3099(22)00806-4.
F. Aldi, I. Nozomi, R. B. Sentosa, and A. Junaidi, “Machine Learning to Identify Monkey Pox Disease,” Sinkron, vol. 8, no. 3, pp. 1335–1347, Jul. 2023, doi: 10.33395/sinkron.v8i3.12524.
sehatnegeriku.kemkes.go.id, “Kasus monkeypox pertama di Indonesia terkonfirmasi. Sehat Negeriku,” https://sehatnegeriku.kemkes.go.id/baca/rilis-media/20220820/3140968/kasus-monkeypox-pertama-di-indonesia-terkonfirmasi-2/ .
M. M. Ahsan, M. R. Uddin, and S. A. Luna, “Monkeypox Image Data collection,” arxiv, Jun. 2022, doi: https://doi.org/10.48550/arxiv.2206.01774.
A. Wijoyo, A. Y. Saputra, S. Ristanti, R. Sya’ban, M. Amalia, and R. Febriansyah, “Pembelajaran Machine Learning,” OKTAL, vol. 3, pp. 375–380, Feb. 2024, Accessed: Feb. 05, 2024. [Online]. Available: https://journal.mediapublikasi.id/index.php/oktal/article/view/2305
P. Singh, N. Singh, K. K. Singh, and A. Singh, “Diagnosing of disease using machine learning,” in Machine Learning and the Internet of Medical Things in Healthcare, Elsevier, 2021, pp. 89–111. doi: 10.1016/B978-0-12-821229-5.00003-3.
S. Tufail, H. Riggs, M. Tariq, and A. I. Sarwat, “Advancements and Challenges in Machine Learning: A Comprehensive Review of Models, Libraries, Applications, and Algorithms,” MDPI, Vol 12, No 8, 2023, MDPI. doi: 10.3390/electronics12081789.
N. Nyoman, P. Pinata, M. Sukarsa, N. Kadek, and D. Rusjayanthi, “Prediksi Kecelakaan Lalu Lintas di Bali dengan XGBoost pada Python,” J. Ilm. MERPATI, vol. 8, pp. 188–196, Dec. 2020, doi: 10.24843/jim.2020.v08.i03.p04,2020.
F. ANISHA, Dodi Vionanda, Nonong amalita, and Zilrahmi, “Application of Random Forest for The Classification Diabetes Mellitus Disease in RSUP Dr. M. Jamil Padang,” UNP J. Stat. Data Sci., vol. 1, no. 2, pp. 45–52, Mar. 2023, doi: 10.24036/ujsds/vol1-iss2/30.
L. Hoang Huong, N. Hoang Khang, L. Nhat Quynh, L. Huu Thang, D. Minh Canh, and H. Phuoc Sang, “A Proposed Approach for Monkeypox Classification,” International Journal of Advanced Computer Science and Applications(IJACSA), Vol 14, No 8, 2023. doi: http://dx.doi.org/10.14569/IJACSA.2023.0140871.
M. E. Haque, M. R. Ahmed, R. S. Nila, and S. Islam, “Classification of Human Monkeypox Disease Using Deep Learning Models and Attention Mechanisms,” arxiv, Nov. 2022, doi: https://doi.org/10.48550/arXiv.2211.15459.
A. Khairunnisa, “Perbandingan Model Random Forest Dan Xgboost Untuk Prediksi Kejahatan Kesusilaan Di Provinsi Jawa Barat,” JIKO (Jurnal Inform. dan Komputer), vol. 7, no. 2, p. 202, Sep. 2023, doi: 10.26798/jiko.v7i2.799.
W. Hong et al., “A Comparison of XGBoost, Random Forest, and Nomograph for the Prediction of Disease Severity in Patients With COVID-19 Pneumonia: Implications of Cytokine and Immune Cell Profile,” Front. Cell. Infect. Microbiol., vol. 12, Apr. 2022, doi: 10.3389/fcimb.2022.819267.
M. Ahmed, “Monkey-Pox PATIENTS Dataset.,” https://doi.org/10.34740/KAGGLE/DSV/4271503.
H. Faisal, A. Febriandirza, and F. N. Hasan, “Analisis Sentimen Terkait Ulasan Pada Aplikasi PLN Mobile Menggunakan Metode Support Vector Machine,” KESATRIA J. Penerapan Sist. Inf. (Komputer Manajemen), vol. 5, no. 1, pp. 303–312, Jan. 2024, doi: https://doi.org/10.30645/kesatria.v5i1.339.
S. Desmalia, A. Mutoi Siregar, K. A. Baihaqi, and T. Rohana, “Comparison Model Optimal Machine Learning Model With Feature Extraction for Heart Attack Disease Classification,” Sci. J. Informatics, vol. 11, no. 2, pp. 485–492, Mar. 2024, doi: 10.15294/sji.v11i2.4561.
K. Nugroho and F. N. Hasan, “Analisis Sentimen Masyarakat Mengenai RUU Perampasan Aset Di Twitter Menggunakan Metode Naïve Bayes,” SMATIKA J., vol. 13, no. 02, pp. 273–283, Dec. 2023, doi: 10.32664/smatika.v13i02.899.
D. Baharudin, Pembelajaran Machine Learning. 2024. doi: https://books.google.co.id/books?id=bdouEQAAQBAJ&lpg=PP1&lr&pg=PP1#v=onepage&q&f=false.
R. Chairunisa, Adiwijaya, and W. Astuti, “Perbandingan CART dan Random Forest untuk Deteksi Kanker berbasis Klasifikasi Data Microarray,” Jurnal Resti, , vol. 4, no. 5, pp. 805–812, Oct. 2020, doi: 10.29207/resti.v4i5.2083.
F. Parsakh Nursyamsyi and F. Noor Hasan, “KLIK: Kajian Ilmiah Informatika dan Komputer Klasifikasi Sentimen Terhadap Aplikasi Identitas Kependudukan Digital Menggunakan Algoritma Naïve Bayes dan SVM,” Media Online, vol. 4, no. 3, pp. 1788–1798, Dec. 2023, doi: 10.30865/klik.v4i3.1517.
Muslih, A. Hilda Meutia, and M. Elly Jafar, “PETIR: Jurnal Pengkajian dan Penerapan Teknik Informatika Metode Klasifikasi Support Vector Machine (SVM) Untuk Analisis Sentimen Aplikasi Bing: Chat with AI & GPT-4 Di Google Play Store,” PETIR J. Pengkaj. dan Penerapan Tek. Inform., vol. 17, pp. 68–76, Jun. 2024, doi: 10.33322/petir.v17i1.2283.
A. P. Kirana, F. Dimas, and N. H. Firman, “Implementation of Data Mining to Predict Student Study Period with Decision Tree Algorithm (C4.5),” J. Sisfokom (Sistem Inf. dan Komputer), vol. 13, no. 1, pp. 31–39, Feb. 2024, doi: 10.32736/sisfokom.v13i1.1943.
S. Ramadani and N. H. Firman, “Analisis Sentimen Terhadap Program Makan Siang & Susu Gratis Menggunakan Algoritma Naive Bayes,” J. Teknol. Dan Sist. Inf. Bisnis, vol. 6, no. 3, pp. 411–419, Jul. 2024, doi: 10.47233/jteksis.v6i3.1378.
S. Rampogu, “A review on the use of machine learning techniques in monkeypox disease prediction,” Elsevier B.V, Sep. 23, 2023, doi: 10.1016/j.soh.2023.100040.
Hozairi, Anwari, and S. Alim, “Implementasi orange data mining untuk klasifikasi kelulusan mahasiswa dengan model K-Nearest Neighbor, Decision Tree serta Naive Bayes,”Jurnal Ilmiah Nero, Vol 6, No 2, 2021, doi: 10.21107/nero.v6i2.237.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Analisa Kinerja Algoritma Random Forest dan XGBoost dalam Klasifikasi Penyakit Cacar Monyet (Monkeypox)
Pages: 1757-1766
Copyright (c) 2025 Mohammad Dito Dwi Krisna, Firman Noor Hasan

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).






















