Agglomerative Hierarchical Clustering (AHC) Method for Data Mining Sales Product Clustering


  • Ridha Maya Faza Lubis * Mail Southern Taiwan University of Science and Technology, Taiwan, Province of China
  • Jen-Peng Huang Southern Taiwan University of Science and Technology, Taiwan, Province of China
  • Pai-Chou Wang Southern Taiwan University of Science and Technology, Taiwan, Province of China
  • Kiki Khoifin Southern Taiwan University of Science and Technology, Taiwan, Province of China
  • Yuli Elvina Southern Taiwan University of Science and Technology, Taiwan, Province of China
  • Dyah Ayu Kusumaningtyas Southern Taiwan University of Science and Technology, Taiwan, Province of China
  • (*) Corresponding Author
Keywords: Data Mining; Clustering; AHC Method; Sales Products

Abstract

Supermarkets are Indonesian terms that refer to large stores or supermarkets that offer a variety of daily needs such as food, drinks, cleaning products, household appliances, clothing, and so on. In contrast to stalls or small shops, supermarkets have a larger size and provide a variety of products. Because of this, many people prefer to shop for their daily needs at the supermarket rather than at the nearest shop because the existence of the supermarket makes it easier for consumers to buy various products in one place without having to move to another store. However, sales in supermarkets also pose a problem, namely how to sort or group products that are not selling well so they can be replaced with products that are selling better or reduce the number of suppliers. This is where data mining or data analysis techniques that use business intelligence are needed. The research was conducted to classify the best-selling products in supermarkets using the Agglomerative Hierarchical Clustering (AHC) method, in which alternatives with the same matrix or distance are grouped into certain clusters. In applying the AHC method, the number of clusters formed is 3. There are three different clusters, namely cluster 0, cluster 1, and cluster 2, each with a different alternative group. Each cluster has a different number of products and a different percentage. Cluster 0 is the cluster with the highest number of products and the largest percentage, namely 45% with a total of 9 products, followed by cluster 2, and cluster 1 has the smallest number of products and percentage, namely 0.30% with a total of 6 products and 0 .25% with a total of 5 products. In addition, sales data for several products each month are grouped based on certain price ranges

Downloads

Download data is not yet available.

References

G. Gunadi and D. I. Sensuse, “Penerapan Metode Data Mining Market Basket Analysis Terhadap Data Penjualan Produk Buku Dengan Menggunakan Algoritma Apriori Dan Frequent Pattern Growth ( Fp-Growth ) :,” Telematika, vol. 4, no. 1, pp. 118–132, 2012.

B. D. Mudzakkir, “Pengelompokan Data Penjualan Produk Pada Pt Advanta Seeds Indonesia Menggunakan Metode K-Means,” J. Mhs. Tek. Inform., vol. 2, no. 2, pp. 34–40, 2018.

G. Gustientiedina, M. H. Adiya, and Y. Desnelita, “Penerapan Algoritma K-Means Untuk Clustering Data Obat-Obatan,” J. Nas. Teknol. Dan Sist. Inf., vol. 5, no. 1, pp. 17–24, 2019.

S. Al Syahdan and A. Sindar, “Data Mining Penjualan Produk Dengan Metode Apriori Pada Indomaret Galang Kota,” J. Nas. Komputasi dan Teknol. Inf., vol. 1, no. 2, 2018, doi: 10.32672/jnkti.v1i2.771.

Z. Nabila, A. Rahman Isnain, and Z. Abidin, “Analisis Data Mining Untuk Clustering Kasus Covid-19 Di Provinsi Lampung Dengan Algoritma K-Means,” J. Teknol. dan Sist. Inf., vol. 2, no. 2, p. 100, 2021, [Online]. Available: http://jim.teknokrat.ac.id/index.php/JTSI

A. Aditya, I. Jovian, and B. N. Sari, “Implementasi K-Means Clustering Ujian Nasional Sekolah Menengah Pertama di Indonesia Tahun 2018/2019,” J. Media Inform. Budidarma, vol. 4, no. 1, pp. 51–58, 2020.

H. Al Rasyid, B. F. K. Soebari, and D. S. Y. Kartika, “IMPLEMENTASI ALGORITMA K-MEANS CLUSTERING UNTUK PENGELOMPOKAN PENJUALAN PRODUK PADA ONLINE SHOP TOKO GIZI,” in Prosiding Seminar Nasional Teknologi dan Sistem Informasi, 2022, vol. 2, no. 1, pp. 242–248.

M. Dahria, R. Gunawan, and Z. Lubis, “Implementasi K-Means Untuk Pengelompokan Produk Terbaik PT. Koko Pelli,” in Seminar Nasional Sains dan Teknologi Informasi (SENSASI), 2019, vol. 2, no. 1.

B. H. T. S. As and L. Zahrotun, “Penerapan Penerapan Data Mining dalam Mengelompokkan Data Riwayat Akademik Sebelum Kuliah dan Data Kelulusan Mahasiswa menggunakan Metode Agglomerative Hierarchical Clustering (AHC),” J. Teknol. Informasi, Komputer, dan Apl., vol. 3, no. 1, pp. 62–71, 2021.

A. Z. Siregar, “Implementasi Metode Regresi Linier Berganda Dalam Estimasi Tingkat Pendaftaran Mahasiswa Baru,” Kesatria J. Penerapan Sist. Inf. (Komputer dan Manajemen), vol. 2, no. 3, pp. 133–137, 2021, [Online]. Available: https://tunasbangsa.ac.id/pkm/index.php/kesatria/article/view/73

S. S. S, A. T. Purba, V. Marudut, M. Siregar, T. Komputer, and P. B. Indonesia, “SISTEM PENDUKUNG KEPUTUSAN KELAYAKAN PEMBERIAN PINJAMAN,” vol. 3, pp. 25–30, 2020, doi: 10.37600/tekinkom.v3i1.131.

B. S. Pranata and D. P. Utomo, “Penerapan Data Mining Algoritma FP-Growth Untuk Persediaan Sparepart Pada Bengkel Motor (Study Kasus Bengkel Sinar Service),” Bull. Inf. Technol., vol. 1, no. 2, pp. 83–91, 2020.

F. O. Lusiana, I. Fatma, and A. P. Windarto, “Estimasi Laju Pertumbuhan Penduduk Menggunakan Metode Regresi Linier Berganda Pada BPS Simalungun,” J. Informatics Manag. Inf. Technol., vol. 1, no. 2, pp. 79–84, 2021, [Online]. Available: https://hostjournals.com/

A. Wanto et al., Data Mining: Algoritma dan Implementasi. Yayasan kita menulis, 2020.

M. Azhari, Z. Situmorang, and R. Rosnelly, “Perbandingan Akurasi, Recall, dan Presisi Klasifikasi pada Algoritma C4.5, Random Forest, SVM dan Naive Bayes,” J. Media Inform. Budidarma, vol. 5, no. 2, p. 640, 2021, doi: 10.30865/mib.v5i2.2937.

S. Widaningsih, “Perbandingan Metode Data Mining Untuk Prediksi Nilai Dan Waktu Kelulusan Mahasiswa Prodi Teknik Informatika Dengan Algoritma C4,5, Naïve Bayes, Knn Dan Svm,” J. Tekno Insentif, vol. 13, no. 1, pp. 16–25, 2019, doi: 10.36787/jti.v13i1.78.

H. Maulidiya and A. Jananto, “Asosiasi Data Mining Menggunakan Algoritma Apriori dan FP-Growth sebagai Dasar Pertimbangan Penentuan Paket Sembako,” Proceeding SENDIU 2020, vol. 6, pp. 36–42, 2020.

A. S. L. T. T. H. Hafizah, “Data Mining Estimasi Biaya Produksi Ikan Kembung Rebus Dengan Regresi Linier Berganda,” J. Sist. Inf. Triguna Dharma (JURSI TGD), no. Vol 1, No 6 (2022): EDISI NOVEMBER 2022, pp. 888–897, 2022, [Online]. Available: https://ojs.trigunadharma.ac.id/index.php/jsi/article/view/5732/1938

Y. L. Nainel, E. Buulolo, and I. Lubis, “Penerapan Data Mining Untuk Estimasi Penjualan Obat Berdasarkan Pengaruh Brand Image Dengan Algoritma Expectation Maximization (Studi Kasus: PT. Pyridam Farma Tbk),” JURIKOM (Jurnal Ris. Komputer), vol. 7, no. 2, p. 214, 2020, doi: 10.30865/jurikom.v7i2.2097.

F. Harahap, “Perbandingan Algoritma K Means dan K Medoids Untuk Clustering Kelas Siswa Tunagrahita,” TIN Terap. Inform. Nusant., vol. 2, no. 4, pp. 191–197, 2021.

B. Harli Trimulya Suandi As and L. Zahrotun, “PENERAPAN DATA MINING DALAM MENGELOMPOKKAN DATA RIWAYAT AKADEMIK SEBELUM KULIAH DAN DATA KELULUSAN MAHASISWA MENGGUNAKAN METODE AGGLOMERATIVE HIERARCHICAL CLUSTERING (Implementation Of Data Mining In Grouping Academic History Data Before Students And Stud,” J. Teknol. Informasi, Komput. dan Apl., vol. 3, no. 1, pp. 62–71, 2021, [Online]. Available: http://jtika.if.unram.ac.id/index.php/JTIKA/

R. A. Setyawan and R. M. Fadilla, “Klasterisasi media pembelajaran daring di era pandemi COVID-19 menggunakan metode Agglomerative,” Inf. Interaktif, vol. 5, no. 3, 2020, [Online]. Available: http://www.e-journal.janabadra.ac.id/index.php/informasiinteraktif/article/view/1305%0Ahttps://www.e-journal.janabadra.ac.id/index.php/informasiinteraktif/article/download/1305/890

Marjiyono, “Penerapan Algoritma Ahc Algorithm Dalam Aplikasi Ppembagian Kelas Siswa Baru,” Semin. Nas. Teknol. Inf. dan Multimed. 2015, pp. 6–8, 2015.

P. Govender and V. Sivakumar, “Application of k-means and hierarchical clustering techniques for analysis of air pollution: A review (1980–2019),” Atmos. Pollut. Res., vol. 11, no. 1, pp. 40–56, 2020.

C. Briggs, Z. Fan, and P. Andras, “Federated learning with hierarchical clustering of local updates to improve training on non-IID data,” in 2020 International Joint Conference on Neural Networks (IJCNN), 2020, pp. 1–9.

K. Zeng, M. Ning, Y. Wang, and Y. Guo, “Hierarchical clustering with hard-batch triplet loss for person re-identification,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 13657–13665.

N. K. Zuhal, “Study Comparison K-Means Clustering dengan Algoritma Hierarchical Clustering,” Pros. Semin. Nas. Teknol. dan Sains, vol. 1, pp. 200–205, 2022, [Online]. Available: https://jurnal.dharmawangsa.ac.id/index.php/djtechno/article/view/966/867


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Agglomerative Hierarchical Clustering (AHC) Method for Data Mining Sales Product Clustering

Dimensions Badge
Article History
Submitted: 2023-06-04
Published: 2023-06-29
Abstract View: 12 times
PDF Download: 3 times
How to Cite
Lubis, R. M. F., Huang, J.-P., Wang, P.-C., Khoifin, K., Elvina, Y., & Kusumaningtyas, D. A. (2023). Agglomerative Hierarchical Clustering (AHC) Method for Data Mining Sales Product Clustering. Building of Informatics, Technology and Science (BITS), 5(1), 285−294. https://doi.org/10.47065/bits.v5i1.3569
Issue
Section
Articles