Segmentation-Aware Recommendation with Cluster-Specific Item Graphs Using Pointwise Mutual Information for Market Basket Analysis


  • Khalifatur Rauf * Mail Universitas Teknologi Yogyakarta, Sleman, Indonesia
  • Arief Hermawan Universitas Teknologi Yogyakarta, Sleman, Indonesia
  • Donny Avianto Universitas Teknologi Yogyakarta, Sleman, Indonesia
  • (*) Corresponding Author
Keywords: Graph-Based Recommendation; Node2Vec; Customer Segmentation; Association Rule Mining; E-Commerce Transactions

Abstract

Traditional Association Rule-based recommendation methods often exhibit limited coverage and high redundancy when applied to sparse transactional data, thereby constraining their effectiveness for product discovery in e-commerce systems. This study proposes a hybrid recommendation framework that integrates customer behavioral segmentation with graph-based item representation learning to address these limitations. Customers are first grouped into behaviorally homogeneous clusters using historical transaction features. For each cluster, an item co-occurrence graph is constructed and weighted using pointwise mutual information to mitigate sparsity bias and emphasize informative associations. Graph-based representation learning is then applied using Node2Vec to generate low-dimensional product embeddings that capture both local structural proximity and higher-order relational patterns. The proposed framework explicitly restricts the candidate item space to the Top 100 most frequent products within each behavioral cluster, thereby focusing the recommendation task on improving localized discovery within high-frequency product segments rather than global catalog exploration. The objective of this research is to assess whether segmentation-aware graph embeddings can outperform traditional FP-Growth association rules under a strict temporal split between the Historical Training Set and the Hold-out Evaluation Set, ensuring realistic and leakage-free evaluation. Model performance is evaluated using precision, recall, normalized discounted cumulative gain, and intra-list diversity on the Hold-out Evaluation Set. Experimental results indicate that the proposed graph-based approach improves ranking quality and diversity within constrained high-frequency item spaces, demonstrating more effective localized discovery within Top 100 product segments compared to FP-Growth. These results demonstrate that graph-based embeddings are more robust to sparse behavioral patterns within high-frequency product segments and better suited for exploratory recommendation scenarios within dense product subsets. The proposed framework offers a scalable and temporally valid foundation for knowledge-driven recommender systems.

Downloads

Download data is not yet available.

References

M. Kholod and N. Mokrenko, “Market Basket Analysis Using Rule-Based Algorithms and Data Mining Techniques,” arXiv preprint, 2024, doi: 10.48550/arxiv.2412.18699.

M. Rahman, S. Mushfik, M. A. Rupak, M. N. Hasan, M. B. Farukee, and S. K. Suter, “Exploring Challenges and Innovations in E-Commerce Recommendation Systems: A Comprehensive Review,” in Lecture Notes in Computer Science, 2024, pp. 123–130. doi: 10.1007/978-981-99-9040-5_8.

D. Brancato, “Apriori Versus FP-Growth for Recommendation System,” in Lecture Notes in Computer Science, 2023, pp. 155–162. doi: 10.1007/978-981-19-9493-7_16.

A. B. Prasetio, “Interpretable Product Recommendation through Association Rule Mining: An Apriori-Based Analysis on Retail Transaction Data,” International Journal of Informatics and Information Systems, vol. 8, no. 2, pp. 67–74, 2025, doi: 10.47738/ijiis.v8i2.252.

M. Keskin, E. Teper, and A. Kurt, “Comparative Evaluation of Word2Vec and Node2Vec for Frequently Bought Together Recommendations in E-Commerce,” in Proceedings of the 6th International Conference on Computer Science and Engineering (UBMK), 2024, pp. 1–5. doi: 10.1109/ubmk63289.2024.10773398.

K. Ammar, W. Inoubli, S. Zghal, and E. M. Nguifo, “Graph Representation Learning for Recommendation Systems: A Short Review,” in Lecture Notes in Computer Science, 2024, pp. 33–48. doi: 10.1007/978-3-031-51664-1_3.

X. Wang, X. He, M. Wang, F. Feng, and T.-S. Chua, “Neural Graph Collaborative Filtering,” in Proceedings of the 42nd International ACM SIGIR Conference, 2019, pp. 165–174. doi: 10.1145/3331184.3331267.

Y. Wu, X. He, Y. Wang, Y. Liu, and M. Wang, “A Survey on Graph-Based Recommender Systems,” ACM Comput. Surv., vol. 55, no. 5, pp. 1–37, 2023, doi: 10.1145/3535101.

Q. Liu, Y. Zeng, R. Mokhosi, and H. Zhang, “STAMP: Short-Term Attention/Memory Priority Model for Session-Based Recommendation,” in Proceedings of the 24th ACM SIGKDD International Conference, 2018, pp. 1831–1839. doi: 10.1145/3219819.3219950.

C. Cai, H. Chen, Y. Liu, D. Chen, X. Zhou, and Y. Lin, “Graph-Based Feature Crossing to Enhance Recommender Systems,” Mathematics, vol. 13, no. 2, p. 302, 2025, doi: 10.3390/math13020302.

T. Meisen and Y. H., “Instacart Online Grocery Basket Analysis Dataset.” [Online]. Available: https://www.kaggle.com/datasets/yasserh/instacart-online-grocery-basket-analysis-dataset

T. Meisen, “A Review on Customer Segmentation Methods for Personalized Customer Targeting in E-Commerce Use Cases,” Information Systems and e-Business Management, 2023, doi: 10.1007/s10257-023-00640-4.

K. Yuliandari, R. Anggraini, and B. Amaliah, “Cluster-Aware Next-Basket Recommendation Using a Hybrid Autoencoder-RNN and Graph Transformer Approach,” in Proceedings of ICERA, 2025, pp. 1–6. doi: 10.1109/icera66156.2025.11087374.

C. Arizmendi and K. M. Gates, “Clustering Individuals Based on Similarity in Idiographic Factor Loading Patterns,” Multivariate Behav. Res., pp. 1–25, 2024, doi: 10.1080/00273171.2024.2374826.

A. M. Jørgensen, “Exploratory Analysis of Grocery Product Networks,” Journal of Management Analytics, vol. 9, no. 2, pp. 169–184, 2022, doi: 10.1080/23270012.2022.2072779.

S. Balcisoy and B. Bozkaya, “A Link Prediction-Based Recommendation System Using Transactional Data,” Sci. Rep., vol. 13, no. 1, 2023, doi: 10.1038/s41598-023-34055-5.

M. M. Lawal and O. T. Matthew, “FP-Growth Algorithm: Mining Association Rules without Candidate Sets Generation,” KASU Journal of Computer Science, vol. 1, no. 2, pp. 392–411, 2024, doi: 10.47514/kjcs/2024.1.2.0016.

F. Nuraeni, D. Tresnawati, Y. H. Agustin, and G. Fauzi, “Optimization of Market Basket Analysis Using Centroid-Based Clustering Algorithm and FP-Growth Algorithm,” Jurnal Teknik Informatika, vol. 3, no. 6, pp. 1581–1590, 2022, doi: 10.20884/1.jutif.2022.3.6.399.

A. Jadon and A. Patil, “A Comprehensive Survey of Evaluation Techniques for Recommendation Systems,” arXiv preprint, 2023, doi: 10.48550/arxiv.2312.16015.

B. Alhijawi, A. Awajan, and S. Fraihat, “Survey on the Objectives of Recommender Systems: Measures, Solutions, Evaluation Methodology, and New Perspectives,” ACM Comput. Surv., vol. 55, no. 5, pp. 1–38, 2022, doi: 10.1145/3527449.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Segmentation-Aware Recommendation with Cluster-Specific Item Graphs Using Pointwise Mutual Information for Market Basket Analysis

Dimensions Badge
Article History
Submitted: 2026-04-20
Published: 2026-06-05
Abstract View: 0 times
PDF Download: 0 times
How to Cite
Rauf, K., Hermawan, A., & Avianto, D. (2026). Segmentation-Aware Recommendation with Cluster-Specific Item Graphs Using Pointwise Mutual Information for Market Basket Analysis. Building of Informatics, Technology and Science (BITS), 8(1), 83-94. https://doi.org/10.47065/bits.v8i1.9707
Section
Articles