Penerapan Algoritma K-Nearest Neighbor (KNN) Untuk Klasifikasi Resiko Penyakit Jantung


  • Aprillia Wulan Nanda Dari * Mail Universitas Amikom Yogyakarta, Yogyakarta, Indonesia
  • Ika Nur Fajri Universitas Amikom Yogyakarta, Yogyakarta, Indonesia
  • (*) Corresponding Author
Keywords: Data Mining; K-Nearest Neighbor; Risk Classification; Evaluation Model; Heart Disease

Abstract

Heart disease is one of the deadliest diseases in the world, where there is a disruption in the function of the heart and blood vessels that causes chest pain, irregular heartbeat, and difficulty breathing. According to data from the World Health Organization (WHO), there are 17.9 million deaths each year due to heart disease. The difficulty in classifying heart disease accurately and quickly is a significant problem. From this problem, researchers conducted data mining research using the KNN algorithm to classify the risk of heart disease by taking data from the official Kaggle website. In this study, there are 4 stages, namely data collection, model formation, mode evaluation, and prediction interface. By using the KNN algorithm, the analysis results obtained an accuracy of 83%, precision 0.88, recall 0.77 and f1-score 0.82. With the results of the model evaluation data, it shows that the classification of heart disease risk using the KNN algorithm has quite good performance. The results of the modeling are then presented in the form of a website by deploying the model.

Downloads

Download data is not yet available.

References

S. F. apt. Yasmin Azhar, “Penyakit Jantung.” https://www.klikdokter.com/penyakit/masalah-jantung-dan-pembuluh-darah/penyakit-jantung (accessed Sep. 30, 2024).

Humas Fakultas Kedokteran Universitas Brawijaya, “World Heart Day 2023: Use Heart Know Heart,” Prasetya Online, 2023. https://prasetya.ub.ac.id/world-heart-day-2023-use-heart-know-heart/ (accessed Sep. 30, 2024).

R. Setiawan, “Apa itu Data Mining dan Bagaimana Metodenya?” https://www.dicoding.com/blog/apa-itu-data-mining/ (accessed Jun. 13, 2024).

U. Nijunnihayah and S. S. Hilabi, “Implementation of the K-Nearest Neighbor Algorithm to Predict Sales of Medical Devices in Medical Devices Implementasi Algoritma K-Nearest Neighbor untuk Prediksi Penjualan Alat Kesehatan pada Media Alkes,” vol. 4, no. April, pp. 695–701, 2024.

R. Rismala, I. Ali, and A. Rizki Rinaldi, “Penerapan Metode K-Nearest Neighbor Untuk Prediksi Penjualan Sepeda Motor Terlaris,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 1, pp. 585–590, 2023, doi: 10.36040/jati.v7i1.6419.

M. Fansyuri, “Analisa algoritma klasifikasi k-nearest neighbor dalam menentukan nilai akurasi terhadap kepuasan pelanggan (study kasus pt. Trigatra komunikatama),” Humanika J. Ilmu Sos. Pendidikan, dan Hum., vol. 3, no. 1, pp. 29–33, 2020.

M. Heydarian, T. E. Doyle, and R. Samavi, “MLCM: Multi-Label Confusion Matrix,” IEEE Access, vol. 10, pp. 19083–19095, 2022, doi: 10.1109/ACCESS.2022.3151048.

N. Hafidhoh, A. P. Atmaja, G. N. Syaifuddiin, I. B. Sumafta, S. M. Pratama, and H. N. Khasanah, “Machine Learning untuk Prediksi Kegagalan Mesin dalam Predictive Maintenance System,” J. Masy. Inform., vol. 15, no. 1, pp. 56–66, 2024, doi: 10.14710/jmasif.15.1.63641.

D. Lapp, “Heart Disease Dataset,” 2019. https://www.kaggle.com/datasets/johnsmith88/heart-disease-dataset (accessed May. 2, 2024).

Syahril Dwi Prasetyo, Shofa Shofiah Hilabi, and Fitri Nurapriani, “Analisis Sentimen Relokasi Ibukota Nusantara Menggunakan Algoritma Naïve Bayes dan KNN,” J. KomtekInfo, vol. 10, pp. 1–7, 2023, doi: 10.35134/komtekinfo.v10i1.330.

J. Muliawan and E. Dazki, “Sentiment Analysis of Indonesia’S Capital City Relocation Using Three Algorithms: Naïve Bayes, Knn, and Random Forest,” J. Tek. Inform., vol. 4, no. 5, pp. 1227–1236, 2023, doi: 10.52436/1.jutif.2023.4.5.1436.

Fritz Al, “Pra-pemrosesan dan Visualisasi Data untuk Model Machine Learning,” ICHI.PRO, Feb. 12, 2020. https://ichi.pro/id/pra-pemrosesan-dan-visualisasi-data-untuk-model-machine-learning-156112039253027# (accessed Jun. 12, 2024). .

M. Ramdhani et al., “Prediksi Capaian Bulanan Pajak Daerah Kabupaten Bandung Barat Menggunakan Metode Logistic Regression,” J. Inf. Syst. Res., vol. 5, no. 4, pp. 881–890, 2024, doi: 10.47065/josh.v5i4.5330.

P. R. Sihombing and A. M. Arsani, “Comparison of Machine Learning Methods in Classifying Poverty in Indonesia in 2018,” J. Tek. Inform., vol. 2, no. 1, pp. 51–56, 2021, doi: 10.20884/1.jutif.2021.2.1.52.

D. Cahyanti, A. Rahmayani, and S. A. Husniar, “Analisis performa metode Knn pada Dataset pasien pengidap Kanker Payudara,” Indones. J. Data Sci., vol. 1, no. 2, pp. 39–43, 2020, doi: 10.33096/ijodas.v1i2.13.

R. Y. Parapat, E. Sandjaya, S. A. Nurfadhilah, M. M. Fetok, N. Hikmah, and Salafffudin, “Scientica Scientica,” Eval. Keselam. Kerja Di PT. Timah Ind. Dengan Menggunakan Metod. HIRARC, vol. 2, pp. 251–255, 2024.

M. Gamma, A. Hakim, and F. Irwiensyah, “Analisis Sentimen Terhadap Ulasan Pengguna Pada Aplikasi BCA Mobile Menggunakan Metode Naïve Bayes,” J. Inf. Syst. Res., vol. 5, no. 4, pp. 911–921, 2024, doi: 10.47065/josh.v5i4.5343.

M. Tam, “Analisis Penerimaan Pengguna E-Wallet DANA Menggunakan,” vol. 5, no. 4, pp. 891–900, 2024, doi: 10.47065/josh.v5i4.5334.

R. Rifaldi, J. Indra, A. R. Pratama, and A. R. Juwita, “Analisis Sentimen Pemboikotan Produk dengan Pendekatan Algoritma Naïve Bayes Media Sosial X,” J. Inf. Syst. Res., vol. 5, no. 4, pp. 940–946, 2024, doi: 10.47065/josh.v5i4.5420.

T. Handayani, S. Bahri, and Kasliono, “Implementasi Metode K-Medoids Dalam Pengelompokan Kepuasan Masyarakat Terhadap Pelayanan Rumah Sakit,” J. Inf. Syst. Res., vol. 5, no. 4, pp. 1006–1017, 2024, doi: 10.47065/josh.v5i4.5331.

M. Istifarsari, L. T. Ningrum, and L. Utari, “Implementasi Algoritma Apriori Menggunakan Cross-Industry Standar Process for Data-Mining Untuk Menentukan Pola Pembelian Obat,” J. Inf. Syst. Res., vol. 5, no. 4, pp. 1063–1075, 2024, doi: 10.47065/josh.v5i4.5263.

R. Harahap, M. Irpan, M. A. Dinata, L. Efrizoni, and Rahmaddeni, “Perbandingan Algoritma Random Forest Dan Xgboost Untuk Klasifikasi Penyakit Paru-Paru Berdasarkan Data Demografi Pasien,” J. Ilm. Betrik, vol. 15, no. 02, pp. 130–141, 2024.

S. Anggraini, M. Akbar, A. Wijaya, H. Syaputra, and M. Sobri, “Klasifikasi Gejala Penyakit Coronavirus Disease 19 (COVID-19) Menggunakan Machine Learning,” J. Softw. Eng. Ampera, vol. 2, no. 1, pp. 57–68, 2021, doi: 10.51519/journalsea.v2i1.105.

A. Handika Permana, F. Rakhmat Umbara, and F. Kasyidi, “Klasifikasi Penyakit Jantung Tipe Kardiovaskular Menggunakan Adaptive Synthetic Sampling dan Algoritma Extreme Gradient Boosting,” Build. Informatics, Technol. Sci., vol. 6, no. 1, pp. 499–508, 2024, doi: 10.47065/bits.v6i1.5421.

J. Sihombing, “Klasifikasi Data Antroprometri Individu Menggunakan Algoritma Naïve Bayes Classifier,” BIOS J. Teknol. Inf. dan Rekayasa Komput., vol. 2, no. 1, pp. 1–10, 2021, doi: 10.37148/bios.v2i1.15.

M. C. Yustina, I. N. Ichsan, and G. M. Suranegara, “Implementasi Algoritma Genetika Proses Mutasi Differential Evolution Pada Sistem Penjadwalan Mata Pelajaran,” KLIK Kaji. Ilm. Inform. dan Komput., vol. 5, no. 1, pp. 116–130, 2024, doi: 10.30865/klik.v5i1.2109.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Penerapan Algoritma K-Nearest Neighbor (KNN) Untuk Klasifikasi Resiko Penyakit Jantung

Dimensions Badge
Article History
Submitted: 2024-10-08
Published: 2024-10-23
Abstract View: 612 times
PDF Download: 623 times
How to Cite
Dari, A., & Fajri, I. (2024). Penerapan Algoritma K-Nearest Neighbor (KNN) Untuk Klasifikasi Resiko Penyakit Jantung. Journal of Information System Research (JOSH), 6(1), 417-425. https://doi.org/10.47065/josh.v6i1.6038
Section
Articles