Implementasi Algoritma Random Forest untuk Prediksi Waktu Penyelesaian Hafalan Al-Qur’an Berbasis Website


  • Muchtar Ali Anwar * Mail Universitas Pamulang, Tangerang Selatan, Indonesia
  • Sholihin Sholihin Universitas Pamulang, Tangerang Selatan, Indonesia
  • Muhammad Nur Fajriansyah Universitas Pamulang, Tangerang Selatan, Indonesia
  • Wisnu Chairin Universitas Pamulang, Tangerang Selatan, Indonesia
  • (*) Corresponding Author
Keywords: Random Forest; Quran Memorization; Monitoring System; Prediction; Machine Learning

Abstract

Manual monitoring of Quranic memorization (tahfizh) in Islamic boarding schools faces efficiency challenges due to large student populations and paper-based record keeping. This study aims to implement the Random Forest algorithm to predict the estimated completion time of Quranic memorization in a web-based monitoring system at Madrasah Aliyah Jam’iyyah Islamiyyah, Tangerang Selatan, Indonesia. The dataset consists of 12,458 memorization logs from 271 students during March 1 to May 3, 2026. Feature engineering produced 15 features covering Quranic text complexity, student memorization history, and temporal patterns; Spearman correlation feature selection reduced these to 13 significant features. The model was optimized using GridSearchCV and evaluated with MAE, RMSE, R², MAPE, and 5-fold cross-validation. Random Forest achieves R²=0.8966, MAE=0.6141, and MAPE=6.98% on the 70:30 split, outperforming Decision Tree (R²=0.8879) and matching XGBoost (R²=0.8964). Cross-validation yields CV R²=0.9004, confirming stable generalization. Feature importance analysis indicates that student learning habits are stronger predictors than Quranic text complexity. As a practical contribution, the model is integrated into a web-based monitoring system enabling teachers to track all students’ progress centrally and receive automated memorization completion estimates, enhancing the effectiveness of guidance in tahfizh institutions.

Downloads

Download data is not yet available.

References

Adiwisastra, M. F., Darmawan, I., & Nurjanah, D. (2024). Dataset development for Quran memorizers: A step towards data-driven personalized learning path. Indonesian Journal of Electrical Engineering and Informatics (IJEEI), 14(1). https://doi.org/10.52549/ijeei.v14i1.7343

Ahmed, E. (2024). Student performance prediction using machine learning algorithms. Applied Computational Intelligence and Soft Computing, 2024, Article 4067721. https://doi.org/10.1155/2024/4067721

Chen, M., & Liu, Z. (2024). Predicting performance of students by optimizing tree components of random forest using genetic algorithm. Heliyon, 10(12), e32570. https://doi.org/10.1016/j.heliyon.2024.e32570

Chen, Y., & Jin, K. (2024). Educational performance prediction with Random Forest and innovative optimizers: A data mining approach. International Journal of Advanced Computer Science and Applications (IJACSA), 15(3). https://doi.org/10.14569/IJACSA.2024.0150308

Hariyanto, F., Budiman, T., Yulianto, A. B., & Yasin, V. (2025). Designing a web-based information system for monitoring final projects. International Journal of Engineering, Science and Information Technology (IJESTY), 5(2), 142–153. https://doi.org/10.52088/ijesty.v5i2.799

Haryono, K., Rajagede, R. A., & Negara, M. U. A. S. (2023). Quran memorization technologies and methods: Literature review. International Journal on Informatics for Development (IJID), 11(1), 192–201. https://doi.org/10.14421/ijid.2022.3746

Jiang, J., Zhang, X., & Yuan, Z. (2024). Feature selection for classification with Spearman's rank correlation coefficient-based self-information in divergence-based fuzzy rough sets. Expert Systems with Applications, 249, 123633. https://doi.org/10.1016/j.eswa.2024.123633

Khairy, D., Alharbi, N., Amasha, M. A., Areed, M. F., Alkhalaf, S., & Abougalala, R. A. (2024). Prediction of student exam performance using data mining classification algorithms. Education and Information Technologies, 29, 21621–21645. https://doi.org/10.1007/s10639-024-12619-w

Kumar, M., Singh, N., Wadhwa, J., Singh, P., Kumar, G., & Qtaishat, A. (2024). Utilizing Random Forest and XGBoost data mining algorithms for anticipating students' academic performance. International Journal of Modern Education and Computer Science (IJMECS), 16(2), 29–44. https://doi.org/10.5815/ijmecs.2024.02.03

Nalenz, M., Rodemann, T., & Augustin, T. (2024). Learning de-biased regression trees and forests from complex samples. Machine Learning, 113, 3379–3398. https://doi.org/10.1007/s10994-023-06439-1

Noviandy, T. R., Zahriah, Z., Yandri, E., Jalil, Z., Yusuf, M., Yusof, N. I. S. M., Lala, A., & Idroes, R. (2024). Machine learning for early detection of dropout risks and academic excellence: A stacked classifier approach. Journal of Educational Management and Learning, 2(1), 28–34. https://doi.org/10.60084/jeml.v2i1.191

Nurdin, M., & Fauziah, F. (2024). Analytical study forecasting students using Random Forest and linear regression algorithms. Sinkron: Jurnal dan Penelitian Teknik Informatika, 8(4), 2369–2378. https://doi.org/10.33395/sinkron.v8i4.13886

Shantal, M., Othman, Z., & Abu Bakar, A. (2025). Missing data imputation using correlation coefficient and min-max normalization weighting. Intelligent Data Analysis, 29(1). https://doi.org/10.3233/IDA-230140

Villar, A., & de Andrade, C. R. V. (2024). Supervised machine learning algorithms for predicting student dropout and academic success: A comparative study. Discover Artificial Intelligence, 4(1), 2. https://doi.org/10.1007/s44163-023-00079-z

Zhao, Y., Zhao, F., Liu, S., & Zhang, J. (2024). Research on student performance prediction based on Random Forest algorithm. Proceedings of the 2024 International Symposium on Artificial Intelligence for Education (SAIE 2024), 8–13. https://doi.org/10.1145/3700297.3700385

Ahmad, A. I., Nugroho, D. A., Aliyu, S. A., & Abdullahi, A. M. (2025). Predicting student dropout in e-learning using simple machine learning and explainable data analysis. LogicLink: Journal of Artificial Intelligence and Multimedia in Informatics, 2(2), 138–148. https://doi.org/10.28918/logiclink.v2i2.13116

Waheed, H., Hassan, S.-U., Aljohani, N. R., Hardman, J., Alelyani, S., & Nawaz, R. (2023). Predicting academic performance of students from VLE big data using deep learning models. Computers in Human Behavior, 142, 107704. https://doi.org/10.1016/j.chb.2023.107704

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324

Géron, A. (2022). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow (3rd ed.). O'Reilly Media. https://www.oreilly.com/library/view/hands-on-machine-learning/9781098125967/

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, É. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830. https://www.jmlr.org/papers/v12/pedregosa11a.html


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Implementasi Algoritma Random Forest untuk Prediksi Waktu Penyelesaian Hafalan Al-Qur’an Berbasis Website

Dimensions Badge
Article History
Published: 2026-05-25
Abstract View: 12 times
PDF Download: 4 times
Issue
Section
Articles