Implementasi Algoritma Random Forest untuk Prediksi Waktu Penyelesaian Hafalan Al-Qur’an Berbasis Website
Abstract
Manual monitoring of Quranic memorization (tahfizh) in Islamic boarding schools faces efficiency challenges due to large student populations and paper-based record keeping. This study aims to implement the Random Forest algorithm to predict the estimated completion time of Quranic memorization in a web-based monitoring system at Madrasah Aliyah Jam’iyyah Islamiyyah, Tangerang Selatan, Indonesia. The dataset consists of 12,458 memorization logs from 271 students during March 1 to May 3, 2026. Feature engineering produced 15 features covering Quranic text complexity, student memorization history, and temporal patterns; Spearman correlation feature selection reduced these to 13 significant features. The model was optimized using GridSearchCV and evaluated with MAE, RMSE, R², MAPE, and 5-fold cross-validation. Random Forest achieves R²=0.8966, MAE=0.6141, and MAPE=6.98% on the 70:30 split, outperforming Decision Tree (R²=0.8879) and matching XGBoost (R²=0.8964). Cross-validation yields CV R²=0.9004, confirming stable generalization. Feature importance analysis indicates that student learning habits are stronger predictors than Quranic text complexity. As a practical contribution, the model is integrated into a web-based monitoring system enabling teachers to track all students’ progress centrally and receive automated memorization completion estimates, enhancing the effectiveness of guidance in tahfizh institutions.
Downloads
References
Adiwisastra, M. F., Darmawan, I., & Nurjanah, D. (2024). Dataset development for Quran memorizers: A step towards data-driven personalized learning path. Indonesian Journal of Electrical Engineering and Informatics (IJEEI), 14(1). https://doi.org/10.52549/ijeei.v14i1.7343
Ahmed, E. (2024). Student performance prediction using machine learning algorithms. Applied Computational Intelligence and Soft Computing, 2024, Article 4067721. https://doi.org/10.1155/2024/4067721
Chen, M., & Liu, Z. (2024). Predicting performance of students by optimizing tree components of random forest using genetic algorithm. Heliyon, 10(12), e32570. https://doi.org/10.1016/j.heliyon.2024.e32570
Chen, Y., & Jin, K. (2024). Educational performance prediction with Random Forest and innovative optimizers: A data mining approach. International Journal of Advanced Computer Science and Applications (IJACSA), 15(3). https://doi.org/10.14569/IJACSA.2024.0150308
Hariyanto, F., Budiman, T., Yulianto, A. B., & Yasin, V. (2025). Designing a web-based information system for monitoring final projects. International Journal of Engineering, Science and Information Technology (IJESTY), 5(2), 142–153. https://doi.org/10.52088/ijesty.v5i2.799
Haryono, K., Rajagede, R. A., & Negara, M. U. A. S. (2023). Quran memorization technologies and methods: Literature review. International Journal on Informatics for Development (IJID), 11(1), 192–201. https://doi.org/10.14421/ijid.2022.3746
Jiang, J., Zhang, X., & Yuan, Z. (2024). Feature selection for classification with Spearman's rank correlation coefficient-based self-information in divergence-based fuzzy rough sets. Expert Systems with Applications, 249, 123633. https://doi.org/10.1016/j.eswa.2024.123633
Khairy, D., Alharbi, N., Amasha, M. A., Areed, M. F., Alkhalaf, S., & Abougalala, R. A. (2024). Prediction of student exam performance using data mining classification algorithms. Education and Information Technologies, 29, 21621–21645. https://doi.org/10.1007/s10639-024-12619-w
Kumar, M., Singh, N., Wadhwa, J., Singh, P., Kumar, G., & Qtaishat, A. (2024). Utilizing Random Forest and XGBoost data mining algorithms for anticipating students' academic performance. International Journal of Modern Education and Computer Science (IJMECS), 16(2), 29–44. https://doi.org/10.5815/ijmecs.2024.02.03
Nalenz, M., Rodemann, T., & Augustin, T. (2024). Learning de-biased regression trees and forests from complex samples. Machine Learning, 113, 3379–3398. https://doi.org/10.1007/s10994-023-06439-1
Noviandy, T. R., Zahriah, Z., Yandri, E., Jalil, Z., Yusuf, M., Yusof, N. I. S. M., Lala, A., & Idroes, R. (2024). Machine learning for early detection of dropout risks and academic excellence: A stacked classifier approach. Journal of Educational Management and Learning, 2(1), 28–34. https://doi.org/10.60084/jeml.v2i1.191
Nurdin, M., & Fauziah, F. (2024). Analytical study forecasting students using Random Forest and linear regression algorithms. Sinkron: Jurnal dan Penelitian Teknik Informatika, 8(4), 2369–2378. https://doi.org/10.33395/sinkron.v8i4.13886
Shantal, M., Othman, Z., & Abu Bakar, A. (2025). Missing data imputation using correlation coefficient and min-max normalization weighting. Intelligent Data Analysis, 29(1). https://doi.org/10.3233/IDA-230140
Villar, A., & de Andrade, C. R. V. (2024). Supervised machine learning algorithms for predicting student dropout and academic success: A comparative study. Discover Artificial Intelligence, 4(1), 2. https://doi.org/10.1007/s44163-023-00079-z
Zhao, Y., Zhao, F., Liu, S., & Zhang, J. (2024). Research on student performance prediction based on Random Forest algorithm. Proceedings of the 2024 International Symposium on Artificial Intelligence for Education (SAIE 2024), 8–13. https://doi.org/10.1145/3700297.3700385
Ahmad, A. I., Nugroho, D. A., Aliyu, S. A., & Abdullahi, A. M. (2025). Predicting student dropout in e-learning using simple machine learning and explainable data analysis. LogicLink: Journal of Artificial Intelligence and Multimedia in Informatics, 2(2), 138–148. https://doi.org/10.28918/logiclink.v2i2.13116
Waheed, H., Hassan, S.-U., Aljohani, N. R., Hardman, J., Alelyani, S., & Nawaz, R. (2023). Predicting academic performance of students from VLE big data using deep learning models. Computers in Human Behavior, 142, 107704. https://doi.org/10.1016/j.chb.2023.107704
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
Géron, A. (2022). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow (3rd ed.). O'Reilly Media. https://www.oreilly.com/library/view/hands-on-machine-learning/9781098125967/
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, É. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830. https://www.jmlr.org/papers/v12/pedregosa11a.html
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Implementasi Algoritma Random Forest untuk Prediksi Waktu Penyelesaian Hafalan Al-Qur’an Berbasis Website
Pages: 2257-2266
Copyright (c) 2026 Muchtar Ali Anwar, Sholihin Sholihin, Muhammad Nur Fajriansyah, Wisnu Chairin

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).













