Prediksi Kinerja Akademik Siswa Bimbingan Belajar Menggunakan Algoritma Extreme Gradient Boosting (XGBoost)


  • Muhammad Bayu Ardi Alfarizi * Mail Universitas Jenderal Achmad Yani, Cimahi, Indonesia
  • Wina Witanti Universitas Jenderal Achmad Yani, Cimahi, Indonesia
  • Agus Komarudin Universitas Jenderal Achmad Yani, Cimahi, Indonesia
  • (*) Corresponding Author
Keywords: Academic Performance Prediction; Hyperparameter; SMOTE; Data Mining; Machine Learning

Abstract

Improving the quality of education has become a primary focus in addressing the increasingly complex challenges of the educational landscape. One promising approach to support data-driven decision-making is the prediction of students' academic performance using machine learning algorithms. This study aims to develop a classification model for predicting students' academic performance by leveraging the Extreme Gradient Boosting (XGBoost) algorithm. The dataset used was obtained from SMPN 1 Gunung Halu and includes both academic and non-academic attributes of students. Five key features were selected: initial grades, midterm grades, final grades, student behavior, and attendance. Data preprocessing involved feature selection, handling missing values, transforming categorical variables using label encoding, and balancing the classes using the SMOTE method. The XGBoost model was then trained using an 80:20 data split and hyperparameter tuning was performed using Grid Search. Evaluation results showed that the model achieved an accuracy of 84% with balanced F1-scores across all classes. The model outperformed other algorithms such as Bagging and Random Forest. With its strong accuracy and stability, the XGBoost model has the potential to serve as a tool for identifying students who require academic intervention. This study makes a significant contribution to the development of AI-based education systems and provides a foundation for the application of machine learning in improving the quality of secondary-level learning.

Downloads

Download data is not yet available.

References

T. Gori, A. Sunyoto, and H. Al Fatta, “Preprocessing Data dan Klasifikasi untuk Prediksi Kinerja Akademik Siswa,” J. Teknol. Inf. dan Ilmu Komput., vol. 11, no. 1, pp. 215–224, 2024, doi: 10.25126/jtiik.20241118074.

O. Ojajuni et al., “Predicting Student Academic Performance Using Machine Learning,” in Lecture Notes in Computer Science, Springer International Publishing, 2021, pp. 481–491. doi: https://doi.org/10.1007/978-3-030-87013-3_36.

P. Septiana Rizky, R. Haiban Hirzi, and U. Hidayaturrohman, “Perbandingan Metode LightGBM dan XGBoost dalam Menangani Data dengan Kelas Tidak Seimbang,” J Stat. J. Ilm. Teor. dan Apl. Stat., vol. 15, no. 2, pp. 228–236, 2022, doi: 10.36456/jstat.vol15.no2.a5548.

G. Dwilestari, “Prediksi Adopsi Hewan Peliharaan Menggunakan Metode Xgboost,” JATI (Jurnal Mhs. Tek. Inform., vol. 8, no. 4, pp. 7470–7477, 2024, doi: 10.36040/jati.v8i4.10337.

A. A. Nababan, M. Jannah, M. Aulina, and D. Andrian, “Prediksi Kualitas Udara Menggunakan Xgboost Dengan Synthetic Minority Oversampling Technique (Smote) Berdasarkan Indeks Standar Pencemaran Udara (Ispu),” JTIK (Jurnal Tek. Inform. Kaputama), vol. 7, no. 1, pp. 214–219, 2023, doi: 10.59697/jtik.v7i1.66.

B. P. Salsabila, P. Belva, C. Trana, N. Ramadhani, A. P. Sari, and N. Bayes, “Penerapan Algoritma Naive Bayes Terhadap Kualitas Udara Di Jakarta dan Rekomendasi Aktivitas Masyarakat,” JATI (Jurnal Mhs. Tek. Inform., vol. 8, no. 6, pp. 11732–11738, 2024, doi: 10.36040/jati.v8i6.11592.

E. J. Sudarman and S. Budi, “Pengembangan Model Kecerdasan Mesin Extreme Gradient Boosting untuk Prediksi Keberhasilan Studi Mahasiswa,” J. Strateg., vol. 5, no. 2, pp. 297–314, 2023.

M. R. Givari, M. R. Sulaeman, and Y. Umaidah, “Perbandingan Algoritma SVM, Random Forest Dan XGBoost Untuk Penentuan Persetujuan Pengajuan Kredit,” J. Nuansa Inform., vol. 16, no. 1, pp. 141–149, 2022, doi: 10.25134/nuansa.v16i1.5406.

C. E. Sukmawati, A. Fitri, N. Masruriyah, and A. R. Juwita, “Efektivitas Algoritma AdaBoost dan XGBoost pada Dataset Obesitas Populasi Dewasa,” Jambura J. Informatics, vol. 6, no. 2, pp. 101–111, 2024, doi: 10.37905/jji.

M. T. I. Rahmayani, “Analisis Clustering Tingkat Keparahan Penyakit Pasien Menggunakan Algoritma K-Means,” J. Inov. Tek. Inform., vol. 1, no. 2, pp. 40–44, 2018.

E. Muningsih and S. Kiswati, “Penerapan Metode K-Means Untuk Clustering Produk Online Shop Dalam Penentuan Stok Barang,” J. Bianglala Inform., vol. 1, no. 3, pp. 10–17, 2015.

C. Romero and S. Ventura, “Educational Data Mining and Learning Analytics: An Updated Survey,” Wiley Interdiscip. Rev. Data Min. Knowl. Discov. Min. Knowl. Discov., vol. 10, no. 3, pp. 1–21, 2020, doi: 10.1002/widm.1355.

G. N. Sihombing, “Optimasi Sistem Pemantauan Akademik Siswa dengan Collaborative Filtering dan Decision Tree,” J. Sains Student Res., vol. 2, no. 6, pp. 369–375, 2024, doi: https://doi.org/10.61722/jssr.v2i6.2982.

K. Aditya, A. Wisnu, and A. M. A. Rahim, “Analisis Perbandingan Algoritma XGBoost Dan Algoritma Random Forest Untuk Klasifikasi Data Kesehatan Mental,” Log. J. Ilmu Komput. dan Pendidik., vol. 2, no. 5, pp. 808–818, 2024.

Y. N. Sukmaningtyas, R. M. Akbar, and G. Rohma, “Penerapan Predictive Analytics untuk Analisis Faktor-faktor yang Mempengaruhi Performa Akademik Siswa,” Arcitech J. Comput. Sci. Artif. Intell., vol. 4, no. 2, pp. 127–145, 2024, doi: http://dx.doi.org/10.29240/arcitech.v4i2.12048.

K. D. K. Wardhani and M. Akbar, “Diabetes Risk Prediction Using Extreme Gradient Boosting (XGBoost),” J. Online Inform., vol. 7, no. 2, pp. 244–250, 2022, doi: 10.15575/join.v7i2.970.

D. A. Anggoro and S. S. Mukti, “Performance Comparison of Grid Search and Random Search Methods for Hyperparameter Tuning in Extreme Gradient Boosting Algorithm to Predict Chronic Kidney Failure,” Int. J. Intell. Eng. Syst., vol. 14, no. 6, pp. 198–207, 2021, doi: 10.22266/ijies2021.1231.19.

A. Charu, Data Mining The Textbook. Springer, 2016. doi: 10.1007/978-3-319-14142-8 ISBN.

N. Syahfitri, E. Budianita, A. Nazir, and I. Afrianty, “Pengelompokan Produk Berdasarkan Data Persediaan Barang Menggunakan Metode Elbow dan K-Medoid,” KLIK Kaji. Ilm. Inform. dan Komput., vol. 4, no. 3, pp. 1668–1675, 2023, doi: 10.30865/klik.v4i3.1525.

K. Boros and Z. Kmetty, “Identifying Missing Data Handling Methods withText Mining,” Int. J. Data Sci. Anal., 2024, doi: 10.1007/s41060-024-00582-1.

M. Buda, A. Maki, and M. A. Mazurowski, “A systematic Study of the Class Imbalance Problem in Convolutional Neural Networks,” Neural Networks, vol. 106, pp. 249–259, 2018, doi: 10.1016/j.neunet.2018.07.011.

A. I. Pradana et al., “Perbandingan Data Untuk Memprediksi Ketepatan Studi Berdasarkan Atribut Keluarga Menggunakan Machine Learning,” JIKA (Jurnal Informatics) Univ. Muhammadiyah Tangerang, vol. 8, no. 2, pp. 221–228, 2024, doi: 10.31000/jika.v8i2.10752.

M. R. Santoso and P. Musa, “Rekomendasi Kesehatan Janin Dengan Penerapan Algoritma C5.0 Menggunakan Classifying Cardiotocography Dataset,” J. Simantec, vol. 9, no. 2, pp. 65–76, 2021, doi: 10.21107/simantec.v9i2.10730.

M. Ardianti, O. D. Nurhayati, and B. Warsito, “Model Prediksi Kinerja Siswa Berdasarkan Data Log LMS Menggunakan Ensemble Machine Learning,” J. Sains dan Teknol., vol. 12, no. 3, pp. 562–571, 2024, doi: 10.23887/jstundiksha.v12i3.59816.

M. Yağcı, “Educational Data Mining: Prediction of Students’ Academic Performance Using Machine Learning Algorithms,” Yağcı Smart Learn. Environ., vol. 9, no. 1, pp. 1–19, 2022, doi: 10.1186/s40561-022-00192-z.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Prediksi Kinerja Akademik Siswa Bimbingan Belajar Menggunakan Algoritma Extreme Gradient Boosting (XGBoost)

Dimensions Badge
Article History
Submitted: 2025-05-19
Published: 2025-06-25
Abstract View: 13 times
PDF Download: 4 times
How to Cite
Alfarizi, M. B., Witanti, W., & Komarudin, A. (2025). Prediksi Kinerja Akademik Siswa Bimbingan Belajar Menggunakan Algoritma Extreme Gradient Boosting (XGBoost). Building of Informatics, Technology and Science (BITS), 7(1), 584-594. https://doi.org/10.47065/bits.v7i1.7387
Section
Articles