Perancangan Skema Evaluasi untuk Sistem Rekomendasi Berita Menggunakan Metrik Precision, Recall, dan F1‑Score
Abstract
Standardization of evaluation for news recommendation systems remains minimal, despite the importance of these systems in addressing information overload in the digital era. This research was designed to develop a comprehensive evaluation scheme for content-based news recommendation systems using five primary evaluation metrics: Precision, Recall, F1-Score, Hit Rate, and Mean Reciprocal Rank (MRR). The study utilized the News Category Dataset from HuffPost, which contains 209,527 news articles across 41 categories. Evaluation was conducted by simulating user feedback through three approaches: random baseline as a comparison reference, content-based filtering with TF-IDF, and Approximate Nearest Neighbor (ANN) based on Faiss. For the final evaluation, 10,000 randomly selected articles were used. Results demonstrate that TF-IDF achieved Precision@10 of 20.20%, Recall@10 of 0.57%, F1-Score@10 of 1.10%, and Hit Rate@10 of 69%, while ANN yielded Precision@10 of 11.50%, Recall@10 of 0.33%, F1-Score@10 of 0.63%, and Hit Rate@10 of 43%. The Hit Rate@10 metric shows that TF-IDF successfully provides at least one relevant article in 69% of queries, compared to ANN which achieves 43% and Random Baseline which only achieves 27%. TF-IDF surpasses ANN by 1.76 times in terms of Precision@10 (20.20% vs 11.50%) and 1.73 times in terms of Recall@10 (0.57% vs 0.33%). In terms of computational efficiency, TF-IDF achieves a runtime of 0.0100 seconds per recommendation, only 1.04 times faster than ANN which achieves 0.0104 seconds, showing a very minimal difference. The primary contribution of this research is a structured evaluation scheme using five complementary metrics that can be applied to various news recommendation systems and provides a framework for objective comparison among different methods.
Downloads
References
Bobadilla, J., Ortega, F., Hernando, A., & Gutiérrez, A. (2013). Recommender Systems Survey. Knowledge-Based Systems, 46, 109–132. https://doi.org/10.1016/j.knosys.2013.03.012
Boka, T. F., Niu, Z., & Neupane, R. B. (2024). A Survey of Sequential Recommendation Systems: Techniques, Evaluation, And Future Directions. Information Systems, 125. https://doi.org/10.1016/J.IS.2024.102427
Burke, R. (2002). Hybrid Recommender Systems: Survey And Experiments. User Modelling and User-Adapted Interaction, 12(4), 331–370. https://doi.org/10.1023/A:1021240730564
Gomez-Uribe, C. A., & Hunt, N. (2015). The Netflix Recommender System: Algorithms, Business Value, And Innovation. ACM Transactions on Management Information Systems, 6(4), 1. https://doi.org/10.1145/2843948
He, X., Deng, K., Wang, X., Li, Y., Zhang, Y. D., & Wang, M. (2020). LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation. SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 639–648. https://doi.org/10.1145/3397271.3401063
Hou, Y., Hu, B., Zhang, Z., & Zhao, W. X. (2022). CORE: Simple And Effective Session-Based Recommendation Within Consistent Representation Space. SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, 1796–1801. https://doi.org/10.1145/3477495.3531955
Johnson, J., Douze, M., & Jegou, H. (2017). Billion-Scale Similarity Search with GPUs. IEEE Transactions on Big Data, 7(3), 535–547. https://doi.org/10.1109/TBDATA.2019.2921572
Koren, Y., Bell, R., & Volinsky, C. (2009). Matrix Factorization Techniques for Recommender Systems. Computer, 42(8), 30–37. https://doi.org/10.1109/MC.2009.263
Le, Q., & Mikolov, T. (2014). Distributed Representations of Sentences and Documents. Proceedings of the 31 St International Conference on Machine Learning, 1188–1196. https://proceedings.mlr.press/v32/le14.html
Liu, T., Xu, *, Qiao, Yuxin, Jiang, & Chen. (2024). News Recommendation with Attention Mechanism. Journal of Industrial Engineering and Applied Science, 2(1). https://doi.org/10.5281/zenodo.10635481
Lops, P., de Gemmis, M., & Semeraro, G. (2011). Content-based Recommender Systems: State of the Art and Trends. Recommender Systems Handbook, 73–105. https://doi.org/10.1007/978-0-387-85820-3_3
Malkov, Y. A., & Yashunin, D. A. (2016). Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(4), 824–836. https://doi.org/10.1109/TPAMI.2018.2889473
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. International Conference on Learning Representations.
Misra, R. (2022). News Category Dataset. Https://Www.Huffpost.Com/. https://www.kaggle.com/datasets/rmisra/news-category-dataset
Rajaraman, A., & Ullman, J. D. (2012). Mining of Massive Datasets. In Journal of Data Analysis and Information Processing (Vol. 9781107015357). Cambridge University Press. https://doi.org/10.1017/CBO9781139058452
Raza, S., & Ding, C. (2021). News Recommender System: A Review of Recent Progress, Challenges, And Opportunities. Artificial Intelligence Review, 55(1), 749–800. https://doi.org/10.1007/S10462-021-10043-X
Rong, Z., Yuan, L., & Yang, L. (2024). Enhanced Knowledge Graph Recommendation Algorithm Based on Multi-Level Contrastive Learning. Scientific Reports, 14(1). https://doi.org/10.1038/S41598-024-74516-Z
Walker, J., Zhou, F., Baagyere, E. Y., Ahene, E., & Zhang, F. (2022). Implicit Optimal Variational Collaborative Filtering. Complex and Intelligent Systems, 8(5), 4369–4384. https://doi.org/10.1007/S40747-022-00696-8
Wu, C., Huang, J., Wu, F., Huang, Y., An, M., & Xie, X. (2019). NPA: Neural News Recommendation with Personalized Attention. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2576–2584. https://doi.org/10.1145/3292500.3330665
Wu, S., Sun, F., Zhang, W., Xie, X., & Cui, B. (2023). Graph Neural Networks in Recommender Systems: A Survey. ACM Computing Surveys, 55(5). https://doi.org/10.1145/3535101/ASSET/66843D8A-EBB9-44EC-8B10-E3C9B6900D75/ASSETS/IMAGES/LARGE/CSUR-2021-0234-F09.JPG
Zhang, S., Yao, L., Sun, A., & Tay, Y. (2020). Deep Learning Based Recommender System: A Survey and New Perspectives. ACM Computing Surveys, 52(1). https://doi.org/10.1145/3285029
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Perancangan Skema Evaluasi untuk Sistem Rekomendasi Berita Menggunakan Metrik Precision, Recall, dan F1‑Score
Pages: 1351-1360
Copyright (c) 2026 Irwan Kurnia Phan, Yuricha Yuricha

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).













