Prediksi Kinerja Akademik Siswa Bimbingan Belajar Menggunakan Algoritma Extreme Gradient Boosting (XGBoost)
Abstract
Improving the quality of education has become a primary focus in addressing the increasingly complex challenges of the educational landscape. One promising approach to support data-driven decision-making is the prediction of students' academic performance using machine learning algorithms. This study aims to develop a classification model for predicting students' academic performance by leveraging the Extreme Gradient Boosting (XGBoost) algorithm. The dataset used was obtained from SMPN 1 Gunung Halu and includes both academic and non-academic attributes of students. Five key features were selected: initial grades, midterm grades, final grades, student behavior, and attendance. Data preprocessing involved feature selection, handling missing values, transforming categorical variables using label encoding, and balancing the classes using the SMOTE method. The XGBoost model was then trained using an 80:20 data split and hyperparameter tuning was performed using Grid Search. Evaluation results showed that the model achieved an accuracy of 84% with balanced F1-scores across all classes. The model outperformed other algorithms such as Bagging and Random Forest. With its strong accuracy and stability, the XGBoost model has the potential to serve as a tool for identifying students who require academic intervention. This study makes a significant contribution to the development of AI-based education systems and provides a foundation for the application of machine learning in improving the quality of secondary-level learning.
Downloads
References
T. Gori, A. Sunyoto, and H. Al Fatta, “Preprocessing Data dan Klasifikasi untuk Prediksi Kinerja Akademik Siswa,” J. Teknol. Inf. dan Ilmu Komput., vol. 11, no. 1, pp. 215–224, 2024, doi: 10.25126/jtiik.20241118074.
O. Ojajuni et al., “Predicting Student Academic Performance Using Machine Learning,” in Lecture Notes in Computer Science, Springer International Publishing, 2021, pp. 481–491. doi: https://doi.org/10.1007/978-3-030-87013-3_36.
P. Septiana Rizky, R. Haiban Hirzi, and U. Hidayaturrohman, “Perbandingan Metode LightGBM dan XGBoost dalam Menangani Data dengan Kelas Tidak Seimbang,” J Stat. J. Ilm. Teor. dan Apl. Stat., vol. 15, no. 2, pp. 228–236, 2022, doi: 10.36456/jstat.vol15.no2.a5548.
G. Dwilestari, “Prediksi Adopsi Hewan Peliharaan Menggunakan Metode Xgboost,” JATI (Jurnal Mhs. Tek. Inform., vol. 8, no. 4, pp. 7470–7477, 2024, doi: 10.36040/jati.v8i4.10337.
A. A. Nababan, M. Jannah, M. Aulina, and D. Andrian, “Prediksi Kualitas Udara Menggunakan Xgboost Dengan Synthetic Minority Oversampling Technique (Smote) Berdasarkan Indeks Standar Pencemaran Udara (Ispu),” JTIK (Jurnal Tek. Inform. Kaputama), vol. 7, no. 1, pp. 214–219, 2023, doi: 10.59697/jtik.v7i1.66.
B. P. Salsabila, P. Belva, C. Trana, N. Ramadhani, A. P. Sari, and N. Bayes, “Penerapan Algoritma Naive Bayes Terhadap Kualitas Udara Di Jakarta dan Rekomendasi Aktivitas Masyarakat,” JATI (Jurnal Mhs. Tek. Inform., vol. 8, no. 6, pp. 11732–11738, 2024, doi: 10.36040/jati.v8i6.11592.
E. J. Sudarman and S. Budi, “Pengembangan Model Kecerdasan Mesin Extreme Gradient Boosting untuk Prediksi Keberhasilan Studi Mahasiswa,” J. Strateg., vol. 5, no. 2, pp. 297–314, 2023.
M. R. Givari, M. R. Sulaeman, and Y. Umaidah, “Perbandingan Algoritma SVM, Random Forest Dan XGBoost Untuk Penentuan Persetujuan Pengajuan Kredit,” J. Nuansa Inform., vol. 16, no. 1, pp. 141–149, 2022, doi: 10.25134/nuansa.v16i1.5406.
C. E. Sukmawati, A. Fitri, N. Masruriyah, and A. R. Juwita, “Efektivitas Algoritma AdaBoost dan XGBoost pada Dataset Obesitas Populasi Dewasa,” Jambura J. Informatics, vol. 6, no. 2, pp. 101–111, 2024, doi: 10.37905/jji.
M. T. I. Rahmayani, “Analisis Clustering Tingkat Keparahan Penyakit Pasien Menggunakan Algoritma K-Means,” J. Inov. Tek. Inform., vol. 1, no. 2, pp. 40–44, 2018.
E. Muningsih and S. Kiswati, “Penerapan Metode K-Means Untuk Clustering Produk Online Shop Dalam Penentuan Stok Barang,” J. Bianglala Inform., vol. 1, no. 3, pp. 10–17, 2015.
C. Romero and S. Ventura, “Educational Data Mining and Learning Analytics: An Updated Survey,” Wiley Interdiscip. Rev. Data Min. Knowl. Discov. Min. Knowl. Discov., vol. 10, no. 3, pp. 1–21, 2020, doi: 10.1002/widm.1355.
G. N. Sihombing, “Optimasi Sistem Pemantauan Akademik Siswa dengan Collaborative Filtering dan Decision Tree,” J. Sains Student Res., vol. 2, no. 6, pp. 369–375, 2024, doi: https://doi.org/10.61722/jssr.v2i6.2982.
K. Aditya, A. Wisnu, and A. M. A. Rahim, “Analisis Perbandingan Algoritma XGBoost Dan Algoritma Random Forest Untuk Klasifikasi Data Kesehatan Mental,” Log. J. Ilmu Komput. dan Pendidik., vol. 2, no. 5, pp. 808–818, 2024.
Y. N. Sukmaningtyas, R. M. Akbar, and G. Rohma, “Penerapan Predictive Analytics untuk Analisis Faktor-faktor yang Mempengaruhi Performa Akademik Siswa,” Arcitech J. Comput. Sci. Artif. Intell., vol. 4, no. 2, pp. 127–145, 2024, doi: http://dx.doi.org/10.29240/arcitech.v4i2.12048.
K. D. K. Wardhani and M. Akbar, “Diabetes Risk Prediction Using Extreme Gradient Boosting (XGBoost),” J. Online Inform., vol. 7, no. 2, pp. 244–250, 2022, doi: 10.15575/join.v7i2.970.
D. A. Anggoro and S. S. Mukti, “Performance Comparison of Grid Search and Random Search Methods for Hyperparameter Tuning in Extreme Gradient Boosting Algorithm to Predict Chronic Kidney Failure,” Int. J. Intell. Eng. Syst., vol. 14, no. 6, pp. 198–207, 2021, doi: 10.22266/ijies2021.1231.19.
A. Charu, Data Mining The Textbook. Springer, 2016. doi: 10.1007/978-3-319-14142-8 ISBN.
N. Syahfitri, E. Budianita, A. Nazir, and I. Afrianty, “Pengelompokan Produk Berdasarkan Data Persediaan Barang Menggunakan Metode Elbow dan K-Medoid,” KLIK Kaji. Ilm. Inform. dan Komput., vol. 4, no. 3, pp. 1668–1675, 2023, doi: 10.30865/klik.v4i3.1525.
K. Boros and Z. Kmetty, “Identifying Missing Data Handling Methods withText Mining,” Int. J. Data Sci. Anal., 2024, doi: 10.1007/s41060-024-00582-1.
M. Buda, A. Maki, and M. A. Mazurowski, “A systematic Study of the Class Imbalance Problem in Convolutional Neural Networks,” Neural Networks, vol. 106, pp. 249–259, 2018, doi: 10.1016/j.neunet.2018.07.011.
A. I. Pradana et al., “Perbandingan Data Untuk Memprediksi Ketepatan Studi Berdasarkan Atribut Keluarga Menggunakan Machine Learning,” JIKA (Jurnal Informatics) Univ. Muhammadiyah Tangerang, vol. 8, no. 2, pp. 221–228, 2024, doi: 10.31000/jika.v8i2.10752.
M. R. Santoso and P. Musa, “Rekomendasi Kesehatan Janin Dengan Penerapan Algoritma C5.0 Menggunakan Classifying Cardiotocography Dataset,” J. Simantec, vol. 9, no. 2, pp. 65–76, 2021, doi: 10.21107/simantec.v9i2.10730.
M. Ardianti, O. D. Nurhayati, and B. Warsito, “Model Prediksi Kinerja Siswa Berdasarkan Data Log LMS Menggunakan Ensemble Machine Learning,” J. Sains dan Teknol., vol. 12, no. 3, pp. 562–571, 2024, doi: 10.23887/jstundiksha.v12i3.59816.
M. Yağcı, “Educational Data Mining: Prediction of Students’ Academic Performance Using Machine Learning Algorithms,” Yağcı Smart Learn. Environ., vol. 9, no. 1, pp. 1–19, 2022, doi: 10.1186/s40561-022-00192-z.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Prediksi Kinerja Akademik Siswa Bimbingan Belajar Menggunakan Algoritma Extreme Gradient Boosting (XGBoost)
Pages: 584-594
Copyright (c) 2025 Muhammad Bayu Ardi Alfarizi, Wina Witanti, Agus Komarudin

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















