A Multi-Label Classification of Al-Quran Verses Using Ensemble Method and Naïve Bayes

  • Muhammad Rizqi Choirulfikri Telkom University, Bandung, Indonesia
  • Kemas Muslim Lhaksamana * Mail Telkom University, Bandung, Indonesia
  • Said Al Faraby Telkom University, Bandung, Indonesia
  • (*) Corresponding Author
Keywords: Al-Quran; Multi-label; Classification; Ensemble Method; Naïve Bayes


Al-Quran is the holy book as a guide and also a source of law for muslims. Thus, understanding and studying Al-Quran is very important for muslims. To make it easier for muslims to understand and study the Qur'an, it is necessary to classify the verses of the Al-Qur'an. This study built a system that can perform multi-label classification of Al-Quran verses. Multi-label means that the classification will divide each verse of the Al-Quran into more than 1 topic. The model is built using the ensemble method by combining several Naïve Bayes algorithms. The ensemble method was chosen because research with different datasets can obtain good performance. The naïve Bayes algorithm was also chosen because it has a simple calculation so it requires a fairly short computation time. The preprocessing step is also carried out to see the comparison of performance results. To measure the performance of the system that has been built, the calculation of hamming loss is used. Based on the experimental results with several testing scenarios, the best performance results are obtained by combining Multinomial NB and Bernoulli NB with a hamming loss value of 0.1167. Thus, the use of the ensemble method can improve performance compared to without the ensemble method. This research can also of course build a multi-label classification model for the verses of Al-Quran with the ensemble method


Download data is not yet available.


N. S. Huda, M. S. Mubarok, and Adiwijaya, “A multi-label classification on topics of quranic verses (english translation) using backpropagation neural network with stochastic gradient descent and adam optimizer,” 2019 7th Int. Conf. Inf. Commun. Technol. ICoICT 2019, hal. 1–5, 2019, doi: 10.1109/ICoICT.2019.8835362.

R. A. Pane, M. S. Mubarok, N. S. Huda, and Adiwijaya, “A multi-lable classification on topics of Quranic verses in English translation using multinomial naive bayes,” 2018 6th Int. Conf. Inf. Commun. Technol. ICoICT 2018, no. April 2019, hal. 481–484, 2018, doi: 10.1109/ICoICT.2018.8528777.

A. M. K. Izzaty, M. S. Mubarok, N. S. Huda, and Adiwijaya, “A multi-label classification on topics of quranic verses in English translation using Tree Augmented Naïve Bayes,” 2018 6th Int. Conf. Inf. Commun. Technol. ICoICT 2018, vol. 0, no. c, hal. 103–106, 2018, doi: 10.1109/ICoICT.2018.8528802.

G. I. Ulumudin, A. Adiwijaya, and M. S. Mubarok, “A multilabel classification on topics of qur’anic verses in English translation using K-Nearest Neighbor method with Weighted TF-IDF,” J. Phys. Conf. Ser., vol. 1192, no. 1, 2019, doi: 10.1088/1742-6596/1192/1/012026.

S. Xu, “Bayesian Naïve Bayes classifiers to text classification,” J. Inf. Sci., vol. 44, no. 1, hal. 48–59, 2018, doi: 10.1177/0165551516677946.

F. Gurcan, “Multi-Class Classification of Turkish Texts with Machine Learning Algotirthms,” ISMSIT 2018 - 2nd Int. Symp. Multidiscip. Stud. Innov. Technol. Proc., hal. 18–22, 2018, doi: 10.1109/ISMSIT.2018.8567307.

A. O. Adeleke, N. A. Samsudin, A. Mustapha, and N. M. Nawi, “Comparative analysis of text classification algorithms for automated labelling of Quranic verses,” Int. J. Adv. Sci. Eng. Inf. Technol., vol. 7, no. 4, hal. 1419–1427, 2017, doi: 10.18517/ijaseit.7.4.2198.

M. Y. Abu Bakar, Adiwijaya, and S. Al Faraby, “Multi-Label Topic Classification of Hadith of Bukhari (Indonesian Language Translation)Using Information Gain and Backpropagation Neural Network,” Proc. 2018 Int. Conf. Asian Lang. Process. IALP 2018, hal. 344–350, 2019, doi: 10.1109/IALP.2018.8629263.

H. Yin, Z. R. Yang, and R. Everson, “Preface,” Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 3177, no. May, 2004, doi: 10.1007/978-3-540-28651-6.

M. H. D. M. Ribeiro and L. dos Santos Coelho, “Ensemble approach based on bagging, boosting and stacking for short-term prediction in agribusiness time series,” Appl. Soft Comput. J., vol. 86, hal. 105837, 2020, doi: 10.1016/j.asoc.2019.105837.

A. R. Arrahimi, M. K. Ihsan, D. Kartini, M. R. Faisal, and F. Indriani, “Teknik Bagging And Boosting Pada Algoritma CART Untuk Klasifikasi Masa Studi Mahasiswa,” J. Sains and Inform., vol. 5, no. 1, hal. 21–30, 2019, doi: 10.34128/jsi.v5i1.171.

A. W. Romadon, K. M. Lhaksmana, I. Kurniawan, and D. Richasdy, “Analyzing TF-IDF and Word Embedding for Implementing Automation in Job Interview Grading,” 2020 8th Int. Conf. Inf. Commun. Technol. ICoICT 2020, hal. 1–4, 2020, doi: 10.1109/ICoICT49345.2020.9166364.

Adiwijaya, Said Al Faraby, and Mohamad Syahrul Mubarok. 2021. Indonesian Translation of the Holy Quran (Multi-label), Dataverse Telkom University, DOI : https://doi.org/10.34820/FK2/XQCNPN

T. B. Chandra, K. Verma, B. K. Singh, D. Jain, and S. S. Netam, “Coronavirus disease (COVID-19) detection in Chest X-Ray images using majority voting based classifier ensemble,” Expert Syst. Appl., vol. 165, no. May 2020, hal. 113909, 2021, doi: 10.1016/j.eswa.2020.113909.

M. Hosni, I. Abnane, A. Idri, J. M. Carrillo de Gea, and J. L. Fernández Alemán, “Reviewing ensemble classification methods in breast cancer,” Comput. Methods Programs Biomed., vol. 177, hal. 89–112, 2019, doi: 10.1016/j.cmpb.2019.05.019.

Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel A Multi-Label Classification of Al-Quran Verses Using Ensemble Method and Naïve Bayes

Dimensions Badge
Article History
Submitted: 2022-02-02
Published: 2022-03-31
Abstract View: 292 times
PDF Download: 332 times
How to Cite
Choirulfikri, M., Lhaksamana, K., & Faraby, S. (2022). A Multi-Label Classification of Al-Quran Verses Using Ensemble Method and Naïve Bayes. Building of Informatics, Technology and Science (BITS), 3(4), 473-479. https://doi.org/10.47065/bits.v3i4.1287