Analisis Kinerja Algoritma Machine Learning Untuk Klasifikasi Potensi FRAUD Klaim Layanan Kesehatan Rumah Sakit
Abstract
Fraud in healthcare claims represents a critical challenge that undermines the efficiency and sustainability of Indonesia's National Health Insurance (JKN) system. This study contributes a large-scale comparative evaluation of five machine learning algorithms for classifying potential fraud in BPJS Kesehatan claims, namely Artificial Neural Network (ANN), Support Vector Machine (SVM), Random Forest (RF), XGBoost + SMOTE, and Logistic Regression (LR). A novelty of this study lies in applying the SMOTE technique in conjunction with XGBoost to address class imbalance in fraud datasets. The dataset consists of over 200,000 claim entries, which have undergone data cleaning, normalization, and feature selection. Performance was assessed using precision, recall on fraud class (positive), f1-score, accuracy, and confusion matrix visualizations to capture classification error distribution. Results demonstrate that ANN and XGBoost + SMOTE are superior in detecting fraudulent claims with high recall, while SVM achieves the most balanced performance in terms of precision and sensitivity. Random Forest and Logistic Regression serve as moderate baselines but are less effective in identifying complex fraud patterns. This study contributes to the development of a more adaptive and efficient fraud detection system based on machine learning, with practical implications for strengthening the automatic verification system used by BPJS Kesehatan.
Downloads
References
P. Gunadi, S. Hasan, I. N. Arda, E. D. Rahayu, and Z. Andika, “Implementasi Sistem Verifikasi Digital Untuk Pencegahan Fraud Pada Program Jkn (Studi Kasus Rs Syariah Jakarta),” Media Riset Bisnis Ekonomi Sains dan Terapan, vol. 2, no. 4, pp. 1–8, Jan. 2025, doi: 10.71312/mrbest.v2i4.208.
B. Santoso, J. Hendrartini, B. U. Djoko Rianto, and L. Trisnantoro, “System For Detection Of National Healthcare Insurance Fraud Based On Computer Application,” Public Health of Indonesia, vol. 4, no. 2, pp. 46–56, Jun. 2018, doi: 10.36685/phi.v4i2.199.
R. Annisa, S. Winda, E. Dwisaputro, and K. N. Isnaini, “Mengatasi Defisit Dana Jaminan Sosial Kesehatan Melalui Perbaikan Tata Kelola,” INTEGRITAS: Jurnal Antikorupsi, vol. 6, no. 2, pp. 209–224, 2020, doi: 10.32697/integritas.v6i2.664.
I. Sugiarti, I. Masturoh, and F. Fadly, “Menelusuri Potensi Fraud dalam Jaminan Kesehatan Nasional melalui Rekam Medis di Rumah Sakit,” Jurnal Kesehatan Vokasional, vol. 7, no. 1, p. 42, Feb. 2022, doi: 10.22146/jkesvo.69056.
M. S. M. S. Humasak Tommy Argo Simanjuntak, “Deteksi Fraud Pada Klaim Layanan Rumah Sakit Menggunakan Model Neural Network,” Journal of Applied Technology and Informatics Indonesia , vol. vol 1, 2021, doi: 10.54074/jati.v1i1.30.
A. C. Nugraha and M. I. Irawan, “Komparasi Deteksi Kecurangan pada Data Klaim Asuransi Pelayanan Kesehatan Menggunakan Metode Support Vector Machine (SVM) dan Extreme Gradient Boosting (XGBoost),” Jurnal Sains dan Seni ITS, vol. 12, no. 1, May 2023, doi: 10.12962/j23373520.v12i1.107032.
E. Nabrawi and A. Alanazi, “Fraud Detection in Healthcare Insurance Claims Using Machine Learning,” Risks, vol. 11, no. 9, Sep. 2023, doi: 10.3390/risks11090160.
C. E. D. Vanegas, J. C. G. Mejía, F. A. V. Agudelo, and D. E. S. Duran, “A Representation Based on Essence for the CRISP-DM Methodology,” Computacion y Sistemas, vol. 27, no. 3, pp. 675–689, 2023, doi: 10.13053/CyS-27-3-3446.
R. Winurputra and D. E. Ratnawati, “Peramalan Penjualan Produk Menggunakan Extreme Gradient Boosting ( Xgboost ) Dan Kerangka Kerja Crisp-Dm Untuk Pengoptimalan Manajemen Persediaan ( Studi Kasus : Ub Mart ) Product Sales Forecasting Using Extreme Gradient Boosting ( Xgboost ) And Crisp-Dm ,” vol. 12, no. 2, pp. 417–428, 2025, doi: 10.25126/jtiik.2025129451.
J. T. Hancock, R. A. Bauder, H. Wang, and T. M. Khoshgoftaar, “Explainable machine learning models for Medicare fraud detection,” J Big Data, vol. 10, no. 1, p. 154, 2023, doi: 10.1186/s40537-023-00821-5.
C. Li, “Preprocessing Methods and Pipelines of Data Mining: An Overview,” Jun. 2019, doi: https://doi.org/10.48550/arXiv.1906.08510.
H. N. Rofiq, “Deteksi Inefisiensi pada Klaim BPJS Kesehatan dengan menggunakan Machine Learning,” Jurnal Jaminan Kesehatan Nasional, vol. 3, no. 1, Jun. 2023, doi: 10.53756/jjkn.v3i1.134.
L. N. Hapsari and N. Rokhman, “Anomaly Detection of Hospital Claim Using Support Vector Regression,” IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 18, no. 1, p. 1, 2024, doi: 10.22146/ijccs.91857.
E. Nabrawi and A. Alanazi, “Fraud Detection in Healthcare Insurance Claims Using Machine Learning,” Risks, vol. 11, no. 9, Sep. 2023, doi: 10.3390/risks11090160.
A. Karim, “Perbandingan Prediksi Kemiskinan di Indonesia Menggunakan Support Vector Machine (SVM) dengan Regresi Linear,” Jurnal Sains Matematika dan Statistika, vol. 6, no. 1, p. 107, 2020, doi: 10.24014/jsms.v6i1.9259.
H. A. Salman, A. Kalakech, and A. Steiti, “Random Forest Algorithm Overview,” Babylonian Journal of Machine Learning, vol. 2024, pp. 69–79, 2024, doi: 10.58496/bjml/2024/007.
S. Parthasarathy, A. Raj Lakshminarayanan, A. Abdul Azeez Khan, K. Javubar Sathick, and V. Jayaraman, “Detection of Health Insurance Fraud using Bayesian Optimized XGBoost,” International Journal of Safety and Security Engineering, vol. 13, no. 5, pp. 853–861, Nov. 2023, doi: 10.18280/ijsse.130509.
R. Tyasnurita and A. Y. M. Pamungkas, “Deteksi Diabetik Retinopati menggunakan Regresi Logistik,” ILKOM Jurnal Ilmiah, vol. 12, no. 2, pp. 130–135, Aug. 2020, doi: 10.33096/ilkom.v12i2.578.130-135.
R. Sahila, T. Widiharih, and I. T. Utami, “Analisis Klasifikasi Menggunakan Regresi Logistik Biner Dan Algoritma Naïve Bayes Classifier Pada Penyakit Hipertensi,” Jurnal Gaussian, vol. 13, no. 2, pp. 319–327, Nov. 2024, doi: 10.14710/j.gauss.13.2.319-327.
F. R. Suprihati, “Analisis Klasifikasi SMS Spam Menggunakan Logistic Regression,” Jurnal Sistem Cerdas, vol. 4, no. 3, pp. 155–160, 2021, doi: 10.37396/jsc.v4i3.166.
A. M. Syahbani, W. Firdaus, and K. A. Musodo, “A Comparative Study of Data Mining Algorithms for Fraud Detection in Financial Transactions,” Sinkron, vol. 9, no. 2, pp. 814–821, Apr. 2025, doi: 10.33395/sinkron.v9i2.14645.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Analisis Kinerja Algoritma Machine Learning Untuk Klasifikasi Potensi FRAUD Klaim Layanan Kesehatan Rumah Sakit
Pages: 296-307
Copyright (c) 2025 Imanullah Ali Ubed, Iwan Syarif, Ferry Astika Saputra

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















