Prediksi Kelulusan Mahasiswa Menggunakan Algoritma Decision Tree C4.5 Berbasis Data Akademik dengan Validasi 10-Fold


  • Nurhasanah Nurhasanah * Mail Universitas Pamulang, Tangerang Selatan, Indonesia
  • Risky Dwi Setiyawan Universitas Pamulang, Tangerang Selatan, Indonesia
  • Doni Hermawan Universitas Pamulang, Tangerang Selatan, Indonesia
  • Oki Herdiyanto Universitas Pamulang, Tangerang Selatan, Indonesia
  • (*) Corresponding Author
Keywords: Data Mining; Decision Tree; C4.5 Algorithm; Student Graduation Prediction; Validasi 10-Fold Cross Validation; Academic Data

Abstract

Predicting student graduation outcomes is an important indicator for evaluating academic quality and the effectiveness of learning processes in higher education. This study aims to analyze and predict student graduation status based on academic data using the C4.5 Decision Tree algorithm. The dataset consists of 100 students from the Informatics Study Program at Universitas Pamulang, with five main attributes: Grade Point Average (GPA), attendance percentage, assignment scores, midterm examination scores, and final examination scores. The research stages include data cleaning, data transformation, model construction, and model evaluation using the 10-fold cross-validation technique to ensure performance stability. The experimental results show an accuracy of 88.74%, precision of 91.79%, recall of 95.34%, and an AUC value of 0.94, indicating that the model demonstrates strong discriminatory ability in classifying graduation outcomes. GPA and final examination scores were identified as the most influential attributes in determining graduation predictions. The resulting predictive model is expected to serve as a foundation for developing an early warning system that enables universities to identify at-risk students and support data-driven decision-making to improve educational quality.

Downloads

Download data is not yet available.

References

Alturki, S., & Alturki, N. (2021). Using Educational Data Mining to Predict Students’ Academic Performance for Applying Early Interventions. Journal of Information Technology Education: Innovations in Practice, 20, 121–137. https://doi.org/10.28945/4835

Alvian Setiono, S., & Purwanto, E. (2025). Prediksi Kelulusan Mahasiswa Menggunakan Algoritma Decision Tree. Prosiding Seminar Nasional Teknologi Informasi Dan Bisnis, 401–406. https://doi.org/10.47701/4q3z9j41

Ammellia Putri, Z., Nurina Sari, B., & Maulana, I. (2024). Prediksi Kelulusan Mahasiswa Fasilkom Unsika Menggunakan Algoritma C4.5. JATI (Jurnal Mahasiswa Teknik Informatika), 8(4), 7727–7736. https://doi.org/10.36040/jati.v8i4.10464

Anggraiwan, Y., & Siregar, B. (2022). Klasifikasi Harga Mobil Menggunakan Metode Decision Tree Algoritma C4.5. Computatio: Journal of Computer Science and Information Systems, 6(2), 70–79. https://doi.org/10.24912/computatio.v6i2.19994

Cahyani, N., Irsyada, R., Firman, A., Inayaturohmat, F., & Pramesti, R. F. (2025). Profit Prediction for Skincare Resellers Using the Exponential Smoothing Method. Brilliance: Research of Artificial Intelligence, 5(2), 628–635. https://doi.org/10.47709/brilliance.v5i2.6585

Damanik, S. F., Wanto, A., & Gunawan, I. (2022). Penerapan Algoritma Decision Tree C4.5 untuk Klasifikasi Tingkat Kesejahteraan Keluarga pada Desa Tiga Dolok. Jurnal Krisnadana, 1(2), 21–32. https://doi.org/10.58982/krisnadana.v1i2.108

Dengen, C. N., Kusrini, K., & Luthfi, E. T. (2020). Implementasi Decision Tree Untuk Prediksi Kelulusan Mahasiswa Tepat Waktu. SISFOTENIKA, 10(1), 1. https://doi.org/10.30700/jst.v10i1.484

Frazier, A., Silva, J., Meilak, R., Sahoo, I., Broda, M., & Chan, D. (2023). Decision Tree-Based Predictive Models for Academic Achievement Using College Students’ Support Networks. Journal of Data Science, 557–577. https://doi.org/10.6339/21-JDS1033

Guanin-Fajardo, J. H., Guaña-Moya, J., & Casillas, J. (2024). Predicting Academic Success of College Students Using Machine Learning Techniques. Data, 9(4), 60. https://doi.org/10.3390/data9040060

Hartati, S., Ramdhan, N. A., & SAN, H. A. (2022). Prediksi Kelulusan Mahasiswa Dengan Naïve Bayes Dan Feature Selection Information Gain. Jurnal Ilmiah Intech : Information Technology Journal of UMUS, 4(02), 223–234. https://doi.org/10.46772/intech.v4i02.889

Hassan, M. A., Muse, A. H., & Nadarajah, S. (2024). Predicting Student Dropout Rates Using Supervised Machine Learning: Insights from the 2022 National Education Accessibility Survey in Somaliland. Applied Sciences, 14(17), 7593. https://doi.org/10.3390/app14177593

Kalita, E., Alfarwan, A. M., el Aouifi, H., Kukkar, A., Hussain, S., Ali, T., & Gaftandzhieva, S. (2025). Predicting student academic performance using Bi-LSTM: a deep learning framework with SHAP-based interpretability and statistical validation. Frontiers in Education, 10. https://doi.org/10.3389/feduc.2025.1581247

Muzakir, A., Syaputra, H., & Panjaitan, F. (2022). A Comparative Analysis of Classification Algorithms for Cyberbullying Crime Detection: An Experimental Study of Twitter Social Media in Indonesia. Scientific Journal of Informatics, 9(2), 133–138. https://doi.org/10.15294/sji.v9i2.35149

Pratiwi, U. M., & Ibad, M. (2022). Klasifikasi Faktor Yang Berpengaruh Dalam Kehamilan Tidak Diinginkan Menggunakan Metode Algoritma Decision Tree. Jurnal Lebesgue: Jurnal Ilmiah Pendidikan Matematika, Matematika Dan Statistika, 3(2), 406–416. https://doi.org/10.46306/lb.v3i2.129

Suyanto, R. V. A., Eduard Rusdianto, & Ernawati. (2024). Penerapan Algoritma Decision Tree C4.5 dan Metode AdaBoost Untuk Prediksi Kelulusan Mahasiswa. Jurnal Informatika Atma Jogja, 5(1), 75–86. https://doi.org/10.24002/jiaj.v5i1.8646

Uci Suriani. (2023). Penerapan Data Mining untuk Memprediksi Tingkat Kelulusan Mahasiswa Menggunakan Algoritma Decision Tree C4.5. Journal of Computer and Information Systems Ampera, Vol. 4 No. 2 (2023): Journal of Computer and Information Systems Ampera, 1. https://doi.org/10.51519/journalcisa.v4i2.393

Villar, A., & de Andrade, C. R. V. (2024). Supervised machine learning algorithms for predicting student dropout and academic success: a comparative study. Discover Artificial Intelligence, 4(1), 2. https://doi.org/10.1007/s44163-023-00079-z

Yağcı, M. (2022). Educational data mining: prediction of students’ academic performance using machine learning algorithms. Smart Learning Environments, 9(1), 11. https://doi.org/10.1186/s40561-022-00192-z

Yatimah, M. N. (2021). Implementasi Data Mining untuk Prediksi Kelulusan Tepat Waktu Mahasiswa STIMIK ESQ Menggunakan Decision Tree C4.5. JUMANJI (Jurnal Masyarakat Informatika Unjani), 5(2), 89. https://doi.org/10.26874/jumanji.v5i2.95

Zou, W., Zhong, W., Du, J., & Yuan, L. (2025). Prediction of Student Academic Performance Utilizing a Multi-Model Fusion Approach in the Realm of Machine Learning. Applied Sciences, 15(7), 3550. https://doi.org/10.3390/app15073550


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Prediksi Kelulusan Mahasiswa Menggunakan Algoritma Decision Tree C4.5 Berbasis Data Akademik dengan Validasi 10-Fold

Dimensions Badge
Article History
Published: 2025-11-25
Abstract View: 0 times
PDF Download: 0 times
Section
Articles