Predictive Modeling of National University Rankings Using Ensemble Machine Learning and Multi-Dimensional Institutional Performance Indicators: Evidence from Japan
Abstract
The global higher education landscape is becoming increasingly competitive in attracting outstanding students, qualified faculty, and international research collaborations. University ranking systems serve as strategic instruments for assessing institutional performance and as a basis for public policy. However, traditional ranking approaches employing linear aggregate scores often oversimplify the complex relationships among indicators such as research, internationalization, and graduate outcomes. This study develops a data-driven predictive model to map the non-linear relationships among university performance indicators. The research employs a quantitative predictive analytics approach using a dataset of 52 Japanese universities from the 2024–2026 period, encompassing the variables Research_Impact_Score, Employment_Rate, Intl_Student_Ratio, Institution_Age, Institution_Type, and Region, with National_Rank as the target variable. The research stages include data preprocessing (handling missing values, encoding, scaling), feature engineering (including Institutional Age), regression model development (Linear, Ridge, Lasso, SVR) as well as ensemble models (Random Forest and Gradient Boosting), evaluation using RMSE, MAE, and R², and explainable analysis based on feature importance. The results indicate that the Gradient Boosting model delivers the best performance with an RMSE of 1.175117, MAE of 1.087856, and R² of 0.994988, followed by Random Forest with an RMSE of 1.436536 and R² of 0.992510. Traditional linear regression models demonstrate significantly lower performance (R² 0.657519), confirming the superiority of non-linear approaches in modeling complex relationships among indicators. Stability testing using K-Fold Cross Validation yields an average RMSE of 1.1045 with a difference of 0.4493 between folds, indicating model consistency. Feature contribution analysis reveals that Research_Impact_Score is the dominant factor with a contribution of 97.94%, followed by Employment_Rate at 1.81%, while internationalization indicators and geographical factors contribute minimally. These findings confirm that research performance constitutes the primary determinant of university rankings, whereas employability and internationalization serve as supporting factors. This study demonstrates that ensemble-based machine learning models are effective in predicting national rankings accurately and interpretably. This approach offers a multidimensional evaluation framework that is more representative than linear aggregate scores, and provides policy implications for enhancing research quality, curriculum relevance, and internationalization strategies of higher education institutions.
Downloads
References
J. M. Candilasa and K. T. Onahon, “Global University Rankings: Characterization of Higher Education Institution’s Competitiveness,” Int. J. Res. Innov. Soc. Sci., vol. 8, no. 12, pp. 3267–3279, 2024, doi: 10.47772/IJRISS.2024.8120270.
T. Teixeira and C. T. Picinin, “University rankings: Proposal for a future research agenda through a systematic literature review,” Sustainability, vol. 16, no. 7, p. 3043, 2024, doi: 10.3390/su16073043.
N. H. Ling, C. J. Chen, C. S. Teh, D. S. John, L. C. Ch’ng, and Y. F. Lay, “Global Trends of Educational Data Mining in Online Learning,” Int. J. Technol. Educ., vol. 6, no. 4, pp. 656–680, 2023, doi: 10.46328/ijte.558.
A. Welch and E. Aziz, “Higher Education in Indonesia,” in International Handbook on Education in South East Asia, Springer, Singapore, 2023, pp. 1–30. doi: 10.1007/978-981-16-8136-3_41-2.
L. Prasojo, L. Yuliana, and L. Ary Prihandoko, “Research Performance in Higher Education: A PLS-SEM Analysis of Research Atmosphere, Collaboration, Funding, Competence, and Output, Especially for Science and Engineering Facilities in Indonesian Universities,” ASEAN J. Sci. Eng., vol. 5, no. 1, pp. 123–144, Jan. 2025, doi: 10.17509/ajse.v5i1.81224.
A. R. Dina, N. Alifah, and L. Paz, “Leveraging big data for student success and institutional growth: Memanfaatkan big data untuk kesuksesan mahasiswa dan pertumbuhan institusi,” J. MENTARI Manajemen, Pendidik. dan Teknol. Inf., vol. 3, no. 2, pp. 147–156, 2025, doi: 10.33050/mentari.v3i2.746.
L. J. Wardley, E. Rajabi, S. H. Amin, and M. Ramesh, “A machine learning approach feature to forecast the future performance of the universities in canada,” Mach. Learn. with Appl., vol. 16, p. 100548, 2024, doi: 10.1016/j.mlwa.2024.100548.
W. Y. Leong, “Leveraging Artificial Intelligence to Predict Future Trends in University Rankings,” Educ. Innov. Emerg. Technol., vol. 5, no. 1, pp. 1–11, 2025, doi: 10.35745/eiet2025v05.01.0004.
E. López-Meneses, P. C. Mellado-Moreno, C. Gallardo Herrerías, and N. Pelícano-Piris, “Educational Data Mining and Predictive Modeling in the Age of Artificial Intelligence: An In-Depth Analysis of Research Dynamics,” Computers, vol. 14, no. 2, p. 68, 2025, doi: 10.3390/computers14020068.
A. Rushiti, A. Luma, Y. Januzaj, A. Aliu, H. Snopçe, and A. Sefidanoski, “The Republic of North Macedonia’s Research Ranking Platform for Academic Staff and Universities,” SAR J., vol. 7, no. 1, pp. 3–11, 2024, doi: 10.18421/SAR71-01.
A. Yonezawa, “Japan’s higher education policies under global challenges,” Asian Econ. Policy Rev., vol. 18, no. 2, pp. 220–237, 2023, doi: 10.1111/aepr.12421.
K. C. Gonugunta and K. Leo, “Role of data-driven decision making in enhancing higher education performance: A comprehensive analysis of analytics in institutional management,” Int. J. Acta Inform., vol. 3, no. 1, pp. 149–159, 2024, doi: https://www.yuktabpublisher.com/index.php/IJAI/article/view/236.
F. Hu, L. Qiu, S. Wei, H. Zhou, I. A. Bathuure, and H. Hu, “The spatiotemporal evolution of global innovation networks and the changing position of China: a social network analysis based on cooperative patents,” R&D Manag., vol. 54, no. 3, pp. 574–589, 2024, doi: 10.1111/radm.12662.
B. Ćudić, P. Alešnik, and D. Hazemali, “Factors impacting university–industry collaboration in European countries,” J. Innov. Entrep., vol. 11, no. 1, p. 33, 2022, doi: 10.1186/s13731-022-00226-3.
S. Sihono, M. F. Isbah, and P. Pangestuti, “Komparasi Standar Penilaian Pendidikan di Negara-negara Maju:(Studi Kasus Finlandia, Jepang, dan Singapura),” Cetta J. Ilmu Pendidik., vol. 8, no. 1, pp. 388–401, 2025, doi: 10.37329/cetta.v8i1.3830.
G. Feng, M. Fan, and Y. Chen, “Analysis and prediction of students’ academic performance based on educational data mining,” IEEE Access, vol. 10, pp. 19558–19571, 2022, doi: 10.1109/ACCESS.2022.3151652.
M. Arunkumar, K. Rajkumar, W. R. Jeyaseelan, and N. A. Natraj, “Data Mining, Machine Learning, and Statistical Modeling for Predictive Analytics with Behavioral Big Data,” Teh. Vjesn., vol. 32, no. 1, pp. 72–77, 2025, doi: 10.17559/TV-20231102001073.
M. Gul, W. Abbasi, M. Babar, A. Aljohani, and M. Arif, “Data driven decisions in education using a comprehensive machine learning framework for student performance prediction,” Discov. Comput., vol. 28, no. 1, Jul. 2025, doi: 10.1007/s10791-025-09585-3.
S. khan et al., “Predictive analytics in education- enhancing student achievement through machine learning,” Soc. Sci. Humanit. Open, vol. 12, p. 101824, 2025, doi: 10.1016/j.ssaho.2025.101824.
M. Bhushan, U. Verma, C. Garg, and A. Negi, “Machine Learning-Based Academic Result Prediction System,” Int. J. Softw. Innov., vol. 12, no. 1, 2024, doi: 10.4018/IJSI.334715.
P. Koukaras and C. Tjortjis, “Data Preprocessing and Feature Engineering for Data Mining: Techniques, Tools, and Best Practices,” AI, vol. 6, no. 10, p. 257, 2025, doi: 10.3390/ai6100257.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Predictive Modeling of National University Rankings Using Ensemble Machine Learning and Multi-Dimensional Institutional Performance Indicators: Evidence from Japan
Pages: 2715-2726
Copyright (c) 2026 Bernadus Gunawan Sudarsono, Raditya Galih Whendasmoro

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















