Model Prediksi Penyakit Jantung dengan Penanganan Outlier Menggunakan Interquartile Range dan Extreme Gradient Boosting
Abstract
Heart disease remains one of the leading causes of death worldwide, with increasing prevalence rates, including in Indonesia. Delayed detection and diagnosis are the main challenges in treating this disease, as most cases are only identified after patients experience serious symptoms or heart attacks. Medical data often containing outliers and noise adds to the complexity of developing accurate predictive models. This study aims to develop a heart disease prediction model using a combination of the Interquartile Range (IQR) method for outlier handling and the Extreme Gradient Boosting (XGBoost) algorithm for predictive modeling. The IQR method is applied at the pre-processing stage to identify and eliminate outliers robustly without reducing data integrity, while XGBoost is used to build an efficient prediction model through an ensemble learning approach. The results showed significant improvements in model performance, with accuracy increasing from 75.41% to 89.47% and AUC-ROC from 0.8615 to 0.9450. The model demonstrates balanced predictive capabilities with precision of 95.24% and recall of 80.00% for cases without disease, and precision of 86.11% and recall of 96.88% for cases with disease. The developed model makes significant contributions by improving data quality through robust outlier handling using the IQR method, building a more accurate prediction model by leveraging the advantages of the XGBoost algorithm in the ensemble learning approach.
Downloads
References
W. L. N. Husain, S. Buraena, R. F. Syamsu, N. Nurmadilla, and A. F. Arsal, “Gambaran Faktor Risiko Penyakit Jantung Koroner Akut Di RSUD Aloe Saboe Gorontalo,” Indones. J. Heal., vol. 2, no. 03, pp. 162–173, 2022, doi: 10.33368/inajoh.v2i03.75.
S. N. Tarmizi, “Kenali Gejala Jantung Sejak Dini,” Kemetrian Kesehatan. [Online]. Available: https://kemkes.go.id/id/rilis-kesehatan/kenali-gejala-jantung-sejak-dini
M. Melyani, L. N. Tambunan, and E. P. Baringbing, “Hubungan Usia dengan Kejadian Penyakit Jantung Koroner pada Pasien Rawat Jalan di RSUD dr. Doris Sylvanus Provinsi Kalimantan Tengah,” J. Surya Med., vol. 9, no. 1, pp. 119–125, 2023, doi: 10.33084/jsm.v9i1.5158.
K. Astle, “Experiencing pain after a heart attack may predict long-term survival,” American Heart Association. [Online]. Available: https://newsroom.heart.org/news/experiencing-pain-after-a-heart-attack-may-predict-long-term-survival
A. Yogianto, A. Homaidi, and Z. Fatah, “Implementasi Metode K-Nearest Neighbors (KNN) untuk Klasifikasi Penyakit Jantung,” G-Tech J. Teknol. Terap., vol. 8, no. 3, pp. 1720–1728, 2024, doi: 10.33379/gtech.v8i3.4495.
A. Sepharni, I. E. Hendrawan, and C. Rozikin, “Klasifikasi Penyakit Jantung dengan Menggunakan Algoritma C4.5,” STRING (Satuan Tulisan Ris. dan Inov. Teknol., vol. 7, no. 2, p. 117, 2022, doi: 10.30998/string.v7i2.12012.
D. Cahya Putri Buani, “Penerapan Algoritma Naïve Bayes dengan Seleksi Fitur Algoritma Genetika Untuk Prediksi Gagal Jantung,” Evolusi J. Sains dan Manaj., vol. 9, no. 2, pp. 43–48, 2021, doi: 10.31294/evolusi.v9i2.11141.
E. Edric and S. P. Tamba, “Prediksi Penyakit Gagal Jantung Dengan Menggunakan Random Forest,” J. Sist. Inf. dan Ilmu Komput. Prima(JUSIKOM PRIMA), vol. 5, no. 2, pp. 176–181, 2022, doi: 10.34012/jurnalsisteminformasidanilmukomputer.v5i2.2445.
A. Putranto, N. L. Azizah, and A. I. Ratna Ika, “Sistem Prediksi Penyakit Jantung Berbasis Web Menggunakan Metode SVM dan Framework Streamlit,” J. Penerapan Sist. Inf. (Komputer Manajemen), vol. 4, no. 2, pp. 442–452, 2023, [Online]. Available: https://archive.ics.uci.edu/ml/datasets/heart+disease
A. Razaki, Y. H. Chrisnanto, and M. Melina, “Penanganan Outlier Pada Metode Algoritma K- Nearest Neighbors (KNN) Dengan Metode Kernel Density Estimation Pada Kasus Penyakit Diabetes,” INTECOMS J. Inf. Technol. Comput. Sci., vol. 7, no. 4, pp. 1177–1188, 2024, doi: 10.31539/intecoms.v7i4.10866.
R. Efendi, A. Junaidi, and A. M. Rizki, “Penentuan Pusat Klaster Secara Otomatis Pada Algoritma Density Peaks Clustering Berbasis Metode Inter Quartile Range,” J. Inform. dan Tek. Elektro Terap., vol. 12, no. 3, 2024, doi: 10.23960/jitet.v12i3.4997.
M. Nijhuis and I. van Lelyveld, “Outlier Detection with Reinforcement Learning for Costly to Verify Data,” Entropy, vol. 25, no. 6, pp. 1–17, 2023, doi: 10.3390/e25060842.
V. Magar, D. Ruikar, S. Bhoite, and R. Mente, “Innovative Inter Quartile Range-based Outlier Detection and Removal Technique for Teaching Staff Performance Feedback Analysis,” J. Eng. Educ. Transform., vol. 37, no. 3, pp. 176–184, 2024, doi: 10.16920/jeet/2024/v37i3/24013.
M. D. Maulana, A. I. Hadiana, and F. R. Umbara, “Algoritma Xgboost Untuk Klasifikasi Kualitas Air Minum,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 5, pp. 3251–3256, 2024, doi: 10.36040/jati.v7i5.7308.
F. A. P. Prasetya and P. H. P. Rosa, “Klasifikasi Kegagalan Pembayaran Kredit Nasabah Bank dengan Algoritma XGBoost,” in Seminar Nasional Informatika Bela Negara (SANTIKA), 2024, pp. 110–115.
D. T. Murdiansyah, “Prediksi Stroke Menggunakan Extreme Gradient Boosting,” JIKO (Jurnal Inform. dan Komputer), vol. 8, no. 2, p. 419, 2024, doi: 10.26798/jiko.v8i2.1295.
M. Salsabil, N. Lutvi, and A. Eviyanti, “Implementasi Data Mining dalam Melakukan Prediksi Penyakit Diabetes Menggunakan Metode Random Forest dan XGBoost,” J. Ilm. KOMPUTASI, vol. 23, no. 1, pp. 51–58, 2024.
D. Kurnia, M. Itqan Mazdadi, D. Kartini, R. Adi Nugroho, and F. Abadi, “Seleksi Fitur dengan Particle Swarm Optimization pada Klasifikasi Penyakit Parkinson Menggunakan XGBoost,” J. Teknol. Inf. dan Ilmu Komput., vol. 10, no. 5, pp. 1083–1094, 2023, doi: 10.25126/jtiik.20231057252.
R. I. Borman, R. Napianto, N. Nugroho, D. Pasha, Y. Rahmanto, and Y. E. P. Yudoutomo, “Implementation of PCA and KNN Algorithms in the Classification of Indonesian Medicinal Plants,” in International Conference on Computer Science, Information Technology and Electrical Engineering (ICOMITEE), IEEE, 2021, pp. 46–50.
M. Yasser, “Heart Disease Dataset,” Kaggle. [Online]. Available: https://www.kaggle.com/datasets/yasserh/heart-disease-dataset
R. I. Borman and M. Wati, “Penerapan Data Maining Dalam Klasifikasi Data Anggota Kopdit Sejahtera Bandarlampung Dengan Algoritma Naïve Bayes,” J. Ilm. Fak. Ilmu Komput., vol. 9, no. 1, pp. 25–34, 2020.
R. I. Borman, F. Rossi, D. Alamsyah, R. Nuraini, and Y. Jusman, “Classification of Medicinal Wild Plants Using Radial Basis Function Neural Network with Least Mean Square,” in International Conference on Electronic and Electrical Engineering and Intelligent System (ICE3IS), IEEE, 2022.
Fredianto and D. A. P. Putri, “Comparison of the interquartile range algorithm and local outlier factor on Australian weather data sets,” AIP Conf. Proc., vol. 2727, no. 1, p. 40010, Jun. 2023, doi: 10.1063/5.0141897.
M. Ridwansyah and H. Zakaria, “Implementasi Algortima Gradient Boosting Pada Aplikasi Hutang Piutang Perorangan Secara Berbasis Web Untuk Meningkatan Akurasi Prediksi Pelunasan Hutang (Studi Kasus : PT Naila Kreasi Mandiri),” JURIHUM J. Inov. dan Hum., vol. 1, no. 4, pp. 440–451, 2023.
A. F. L. Ptr, M. M. Siregar, and I. Daniel, “Analysis of Gradient Boosting, XGBoost, and CatBoost on Mobile Phone Classification,” J. Comput. Networks, Archit. High Perform. Comput., vol. 6, no. 2, pp. 661–670, 2024.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Model Prediksi Penyakit Jantung dengan Penanganan Outlier Menggunakan Interquartile Range dan Extreme Gradient Boosting
Pages: 1398−1406
Copyright (c) 2025 Lukman Azhari, Novi Wulandari, Feru Adiningrat, Allan Desi Alexander

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).






















