Analisa Kinerja Algoritma Random Forest dan XGBoost dalam Klasifikasi Penyakit Cacar Monyet (Monkeypox)


  • Mohammad Dito Dwi Krisna Universitas Muhammadiyah Prof. Dr. Hamka, Jakarta, Indonesia
  • Firman Noor Hasan * Mail Universitas Muhammadiyah Prof. Dr. Hamka, Jakarta, Indonesia https://orcid.org/0000-0002-1246-3462
  • (*) Corresponding Author
Keywords: Classification; Random Forest; XGBoost; Machine Learning; Monkeypox

Abstract

Monkeypox is a contagious disease that requires prompt and accurate handling, particularly in the diagnostic process. However, identifying symptomps manually often takes time and is prone to error. In response to this challenge, this study aims to develop a machine learning based classification model to support a more efficient diagnosis process. This research applies two machine learning algorithms XGBoost regression and Random Forest regression to classify patients as infected or uninfected with monkeypox based on clinical symptoms. The study focuses on assessing how well each algorithm can distinguish between positive and negative cases, especially when dealing with imbalanced data or overlapping features. The dataset used consists of 25.000 entries sourced from Kaggle, each containing clinical indicators related to monkeypox. Before modeling, the data underwent exploratory data analysis (EDA) and preprocessing, including handling missing values. Furthermore, cross-validation and parameter tuning techniques were implemented to optimize model performance. The results indicate that XGBoost outperformed Random Forest, achieving 68% accuracy, 69% precision, 89% recall, and a 78% F1-score. In contrast, Random Forest yielded slightly lower scores. Both models were evaluated using the ROC curve, where each reached an AUC values of 0.60. This suggests that wile both models show potential, their ability to clearly distinguish between classes positive and negative remains limited and can be improves in future work.

Downloads

Download data is not yet available.

Author Biographies

Mohammad Dito Dwi Krisna, Universitas Muhammadiyah Prof. Dr. Hamka, Jakarta

Program Studi Teknik Informatika

Firman Noor Hasan, Universitas Muhammadiyah Prof. Dr. Hamka, Jakarta

Program Studi Teknik Informatika

References

J. Kwong, K. C. McNabb, J. G. Voss, A. Bergman, K. McGee, and J. Farley, “Monkeypox Virus Outbreak 2022: Key Epidemiologic, Clinical, Diagnostic, and Prevention Considerations,” J. Assoc. Nurses AIDS Care, vol. 33, no. 6, pp. 657–667, 2022, doi: 10.1097/JNC.0000000000000365.

E. Sherwood et al., “Invasive group A streptococcal disease in pregnant women and young children: a systematic review and meta-analysis,” Lancet Infect. Dis., vol. 22, no. 7, pp. 1076–1088, Jul. 2022, doi: 10.1016/S1473-3099(21)00672-1.

A. R. A. Saied, M. Dhawan, A. A. Metwally, M. L. Fahrni, P. Choudhary, and O. P. Choudhary, “Disease History, Pathogenesis, Diagnostics, and Therapeutics for Human Monkeypox Disease: A Comprehensive Review,” MDPI, Vol 10, No 12, doi: 10.3390/vaccines10122091.

F. Wei et al., “Study and prediction of the 2022 global monkeypox epidemic,” J. Biosaf. Biosecurity, vol. 4, no. 2, pp. 158–162, Dec. 2022, doi: 10.1016/j.jobb.2022.12.001.

J. Lu et al., “Mpox (formerly monkeypox): pathogenesis, prevention, and treatment,” Dec. 27, 2023, Springer Nature. doi: 10.1038/s41392-023-01675-2.

H. Harapan et al., “Monkeypox: A Comprehensive Review,” Sep. 29, 2022, MDPI. doi: 10.3390/v14102155.

D. L. Fink et al., “Clinical features and management of individuals admitted to hospital with monkeypox and associated complications across the UK: a retrospective cohort study,” Lancet Infect. Dis., vol. 23, no. 5, pp. 589–597, May 2023, doi: 10.1016/S1473-3099(22)00806-4.

F. Aldi, I. Nozomi, R. B. Sentosa, and A. Junaidi, “Machine Learning to Identify Monkey Pox Disease,” Sinkron, vol. 8, no. 3, pp. 1335–1347, Jul. 2023, doi: 10.33395/sinkron.v8i3.12524.

sehatnegeriku.kemkes.go.id, “Kasus monkeypox pertama di Indonesia terkonfirmasi. Sehat Negeriku,” https://sehatnegeriku.kemkes.go.id/baca/rilis-media/20220820/3140968/kasus-monkeypox-pertama-di-indonesia-terkonfirmasi-2/ .

M. M. Ahsan, M. R. Uddin, and S. A. Luna, “Monkeypox Image Data collection,” arxiv, Jun. 2022, doi: https://doi.org/10.48550/arxiv.2206.01774.

A. Wijoyo, A. Y. Saputra, S. Ristanti, R. Sya’ban, M. Amalia, and R. Febriansyah, “Pembelajaran Machine Learning,” OKTAL, vol. 3, pp. 375–380, Feb. 2024, Accessed: Feb. 05, 2024. [Online]. Available: https://journal.mediapublikasi.id/index.php/oktal/article/view/2305

P. Singh, N. Singh, K. K. Singh, and A. Singh, “Diagnosing of disease using machine learning,” in Machine Learning and the Internet of Medical Things in Healthcare, Elsevier, 2021, pp. 89–111. doi: 10.1016/B978-0-12-821229-5.00003-3.

S. Tufail, H. Riggs, M. Tariq, and A. I. Sarwat, “Advancements and Challenges in Machine Learning: A Comprehensive Review of Models, Libraries, Applications, and Algorithms,” MDPI, Vol 12, No 8, 2023, MDPI. doi: 10.3390/electronics12081789.

N. Nyoman, P. Pinata, M. Sukarsa, N. Kadek, and D. Rusjayanthi, “Prediksi Kecelakaan Lalu Lintas di Bali dengan XGBoost pada Python,” J. Ilm. MERPATI, vol. 8, pp. 188–196, Dec. 2020, doi: 10.24843/jim.2020.v08.i03.p04,2020.

F. ANISHA, Dodi Vionanda, Nonong amalita, and Zilrahmi, “Application of Random Forest for The Classification Diabetes Mellitus Disease in RSUP Dr. M. Jamil Padang,” UNP J. Stat. Data Sci., vol. 1, no. 2, pp. 45–52, Mar. 2023, doi: 10.24036/ujsds/vol1-iss2/30.

L. Hoang Huong, N. Hoang Khang, L. Nhat Quynh, L. Huu Thang, D. Minh Canh, and H. Phuoc Sang, “A Proposed Approach for Monkeypox Classification,” International Journal of Advanced Computer Science and Applications(IJACSA), Vol 14, No 8, 2023. doi: http://dx.doi.org/10.14569/IJACSA.2023.0140871.

M. E. Haque, M. R. Ahmed, R. S. Nila, and S. Islam, “Classification of Human Monkeypox Disease Using Deep Learning Models and Attention Mechanisms,” arxiv, Nov. 2022, doi: https://doi.org/10.48550/arXiv.2211.15459.

A. Khairunnisa, “Perbandingan Model Random Forest Dan Xgboost Untuk Prediksi Kejahatan Kesusilaan Di Provinsi Jawa Barat,” JIKO (Jurnal Inform. dan Komputer), vol. 7, no. 2, p. 202, Sep. 2023, doi: 10.26798/jiko.v7i2.799.

W. Hong et al., “A Comparison of XGBoost, Random Forest, and Nomograph for the Prediction of Disease Severity in Patients With COVID-19 Pneumonia: Implications of Cytokine and Immune Cell Profile,” Front. Cell. Infect. Microbiol., vol. 12, Apr. 2022, doi: 10.3389/fcimb.2022.819267.

M. Ahmed, “Monkey-Pox PATIENTS Dataset.,” https://doi.org/10.34740/KAGGLE/DSV/4271503.

H. Faisal, A. Febriandirza, and F. N. Hasan, “Analisis Sentimen Terkait Ulasan Pada Aplikasi PLN Mobile Menggunakan Metode Support Vector Machine,” KESATRIA J. Penerapan Sist. Inf. (Komputer Manajemen), vol. 5, no. 1, pp. 303–312, Jan. 2024, doi: https://doi.org/10.30645/kesatria.v5i1.339.

S. Desmalia, A. Mutoi Siregar, K. A. Baihaqi, and T. Rohana, “Comparison Model Optimal Machine Learning Model With Feature Extraction for Heart Attack Disease Classification,” Sci. J. Informatics, vol. 11, no. 2, pp. 485–492, Mar. 2024, doi: 10.15294/sji.v11i2.4561.

K. Nugroho and F. N. Hasan, “Analisis Sentimen Masyarakat Mengenai RUU Perampasan Aset Di Twitter Menggunakan Metode Naïve Bayes,” SMATIKA J., vol. 13, no. 02, pp. 273–283, Dec. 2023, doi: 10.32664/smatika.v13i02.899.

D. Baharudin, Pembelajaran Machine Learning. 2024. doi: https://books.google.co.id/books?id=bdouEQAAQBAJ&lpg=PP1&lr&pg=PP1#v=onepage&q&f=false.

R. Chairunisa, Adiwijaya, and W. Astuti, “Perbandingan CART dan Random Forest untuk Deteksi Kanker berbasis Klasifikasi Data Microarray,” Jurnal Resti, , vol. 4, no. 5, pp. 805–812, Oct. 2020, doi: 10.29207/resti.v4i5.2083.

F. Parsakh Nursyamsyi and F. Noor Hasan, “KLIK: Kajian Ilmiah Informatika dan Komputer Klasifikasi Sentimen Terhadap Aplikasi Identitas Kependudukan Digital Menggunakan Algoritma Naïve Bayes dan SVM,” Media Online, vol. 4, no. 3, pp. 1788–1798, Dec. 2023, doi: 10.30865/klik.v4i3.1517.

Muslih, A. Hilda Meutia, and M. Elly Jafar, “PETIR: Jurnal Pengkajian dan Penerapan Teknik Informatika Metode Klasifikasi Support Vector Machine (SVM) Untuk Analisis Sentimen Aplikasi Bing: Chat with AI & GPT-4 Di Google Play Store,” PETIR J. Pengkaj. dan Penerapan Tek. Inform., vol. 17, pp. 68–76, Jun. 2024, doi: 10.33322/petir.v17i1.2283.

A. P. Kirana, F. Dimas, and N. H. Firman, “Implementation of Data Mining to Predict Student Study Period with Decision Tree Algorithm (C4.5),” J. Sisfokom (Sistem Inf. dan Komputer), vol. 13, no. 1, pp. 31–39, Feb. 2024, doi: 10.32736/sisfokom.v13i1.1943.

S. Ramadani and N. H. Firman, “Analisis Sentimen Terhadap Program Makan Siang & Susu Gratis Menggunakan Algoritma Naive Bayes,” J. Teknol. Dan Sist. Inf. Bisnis, vol. 6, no. 3, pp. 411–419, Jul. 2024, doi: 10.47233/jteksis.v6i3.1378.

S. Rampogu, “A review on the use of machine learning techniques in monkeypox disease prediction,” Elsevier B.V, Sep. 23, 2023, doi: 10.1016/j.soh.2023.100040.

Hozairi, Anwari, and S. Alim, “Implementasi orange data mining untuk klasifikasi kelulusan mahasiswa dengan model K-Nearest Neighbor, Decision Tree serta Naive Bayes,”Jurnal Ilmiah Nero, Vol 6, No 2, 2021, doi: 10.21107/nero.v6i2.237.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Analisa Kinerja Algoritma Random Forest dan XGBoost dalam Klasifikasi Penyakit Cacar Monyet (Monkeypox)

Dimensions Badge
Article History
Submitted: 2025-04-01
Published: 2025-04-30
Abstract View: 633 times
PDF Download: 409 times
How to Cite
Krisna, M. D. D., & Hasan, F. N. (2025). Analisa Kinerja Algoritma Random Forest dan XGBoost dalam Klasifikasi Penyakit Cacar Monyet (Monkeypox). Journal of Information System Research (JOSH), 6(3), 1757-1766. https://doi.org/10.47065/josh.v6i3.7167
Issue
Section
Articles