Perbandingan Kinerja Random forest dan SVM Pada Klasifikasi Tingkat Kekumuhan Permukiman Menggunakan SMOTE


  • Nurika Dwi Wahyuni Universitas Islam Negeri Sultan Syarif Kasim Riau, Pekanbaru, Indonesia
  • Fadhilah Syafria * Mail Universitas Islam Negeri Sultan Syarif Kasim Riau, Pekanbaru, Indonesia
  • Novi Yanti Universitas Islam Negeri Sultan Syarif Kasim Riau, Pekanbaru, Indonesia
  • Surya Agustian Universitas Islam Negeri Sultan Syarif Kasim Riau, Pekanbaru, Indonesia
  • (*) Corresponding Author
Keywords: Imbalanced Data; Slum Classification; Random Forest; Indicator Scoring; SMOTE; Support Vector Machine

Abstract

Classifying slum levels is essential for a structured, data-driven analysis of settlement conditions. This study compares the performance of Random forest and Support vector machine (SVM) in classifying slum levels in Pekanbaru City across two scenarios with and without SMOTE using slum indicator scoring data. Its contributions include analyzing SMOTE's impact on model performance and evaluating the top 10 features against the full feature set. The dataset comprises 992 RT-level records from Disperkim Pekanbaru City (2020, 2021, and 2023) featuring 16 slum indicator scores based on PUPR Ministerial Regulation No. 14/2018, categorized into three classes: Non-Slum, Low Slum, and Moderate Slum. Following the KDD process (selection, preprocessing, transformation, data mining, evaluation, and analysis), the data was split 80:20 using stratified sampling and evaluated based on accuracy, precision, recall, F1-score, and confusion matrix. Results show that the Linear SVM without SMOTE achieved perfect evaluation metrics (1.0000); however, this is interpreted cautiously as the class labels derive from strict regulatory scoring rules, making class boundaries inherently linear. Random forest saw its F1-score rise from 0.9660 to 0.9700 after SMOTE, while the most significant improvement occurred in SVM RBF, jumping from 0.9214 to 0.9779. Testing the top 10 features led to a decreased F1-score across models, indicating that utilizing all 16 features remains optimal for this dataset.

Downloads

Download data is not yet available.

References

BPS Kota Pekanbaru, “Kota Pekanbaru Dalam Angka 2025,” Badan Pusat Statistik Kota Pekanbaru. Accessed: Apr. 29, 2026. [Online]. Available: https://pekanbarukota.bps.go.id/id/publication/2025/02/28/782f2589686f3095440a4005/kota-pekanbaru-dalam-angka-2025.html

Z. Hasan, S. B. N. Arisaputri, F. Alicia, F. Sabila, and Z. Fuady, “Environmental Quality Assessment of Planned Residential Areas in a Peri-Urban Zone Under Urban Sprawl Pressure: A Case Study of Ingin Jaya Subdistrict, Indonesia,” Elkawnie: Journal of Islamic Science and Technology, vol. 11, no. 2, pp. 171–188, Dec. 2025, doi: 10.22373/ekw.v11i2.31691.

A. R. Sari and M. A. Ridlo, “Studi Literature : Identifikasi Faktor Penyebab Terjadinya Permukiman Kumuh Di Kawasan Perkotaan,” Jurnal Kajian Ruang, vol. 1, no. 2, pp. 160–176, Sep. 2021, doi: https://dx.doi.org/10.30659/jkr.v1i2.20022.

A. R. Nasution and S. M. Sihombing, “Evaluasi Program Kota Tanpa Kumuh (KOTAKU) dalam Penanganan Kawasan Kumuh di Kabupaten Karo,” Jurnal Manajemen dan Ilmu Administrasi Publik (JMIAP), vol. 6, no. 2, pp. 223–234, May 2024, doi: 10.24036/jmiap.v6i2.772.

W. Z. Dela Lathifah A.R. and Z. Rusli, “Strategi Pengembangan dan Penataan Kawasan Permukiman Kumuh Kota Pekanbaru,” Journal of Comprehensive Science, vol. 3, no. 11, pp. 4950–4968, Nov. 2024, doi: https://doi.org/10.59188/jcs.v3i11.2709.

Kementerian Pekerjaan Umum dan Perumahan Rakyat, Peraturan Menteri Pekerjaan Umum Dan Perumahan Rakyat Republik Indonesia Nomor 14/Prt/M/2018 Tentang Pencegahan Dan Peningkatan Kualitas Terhadap Perumahan Kumuh Dan Permukiman Kumuh. 2018. Accessed: Apr. 29, 2026. [Online]. Available: https://peraturan.bpk.go.id/Details/104649/permen-pupr-no-14prtm2018-tahun-2018

E. Banjarnahor, R. Belferik, W. Cendana, Y. Adi, and S. Abraham, “Analisis Implementasi Support vector machine dan Random forest untuk Prediksi Kategori Indeks Kualitas Udara Jakarta,” Instek, vol. 10, no. 1, pp. 175–184, Apr. 2025, doi: https://doi.org/10.24252/instek.v10i1.56477.

M. Kasahun and A. Legesse, “Machine learning for urban land use/ cover mapping: Comparison of artificial neural network, random forest and support vector machine, a case study of Dilla town,” Heliyon, vol. 10, Oct. 2024, doi: 10.1016/j.heliyon.2024.e39146.

P. Widayani, A. Fadilah, I. Z. Irawan, and K. Ghosh, “Implementing Support vector machine Algorithm for Early Slums Identification in Yogyakarta City, Indonesia Using Pleiades Images,” Forum Geografi, vol. 37, no. 1, pp. 88–97, Jul. 2023, doi: 10.23917/forgeo.v37i1.15248.

V. Oktaviani, N. Rosmawarni, and M. Panji Muslim, “Perbandingan Kinerja Random forest Dan Smote Random forest Dalam Mendeteksi Dan Mengukur Tingkat Stres Pada Mahasiswa Tingkat Akhir,” IFTK Jurnal Informatik, vol. 20, no. 1, pp. 43–49, Apr. 2024, doi: https://doi.org/10.52958/iftk.v20i1.9158.

I. G. A. N. Lestari and K. A. A. Aryanto, “Peningkatan Akurasi Klasifikasi Kualitas Udara melalui Oversampling dengan Metode Support vector machine dan Random forest,” Jurnal Sistem Dan Informatika (JSI), vol. 18, no. 1, pp. 1–9, Nov. 2023, doi: https://doi.org/10.30864/jsi.v18i1.596.

M. A. G. Muttaqin and G. Alfa Trisnapradika, “Optimasi Algoritma SVM dengan Teknik SMOTE dan Tuning Parameter pada Klasifikasi Balita Stunting,” Building of Informatics, Technology and Science (BITS), vol. 7, no. 3, pp. 1547–1556, Dec. 2025, doi: 10.47065/bits.v7i3.8330.

D. W. Y. Rahayu, K. Umam, and M. R. Handayani, “Performance of Machine Learning Algorithms on Imbalanced Sentiment Datasets Without Balancing Techniques,” Journal of Applied Informatics and Computing (JAIC), vol. 9, no. 3, pp. 998–1005, Jun. 2025, doi: https://doi.org/10.30871/jaic.v9i3.9584.

E. Erlin, Y. Desnelita, N. Nasution, L. Suryati, and F. Zoromi, “Dampak SMOTE terhadap Kinerja Random forest Classifier berdasarkan Data Tidak seimbang,” MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, vol. 21, no. 3, pp. 677–690, Jul. 2022, doi: 10.30812/matrik.v21i3.1726.

M. Sofyan Alfandi and Z. Fatah, “Penerapan Data mining Menggunakan Metode K-Means Clustering Untuk Analisa Penjualan Toko Umama Hijab Kaliwates Jember,” Jurnal Riset Sistem Informasi, vol. 1, no. 4, pp. 94–102, Dec. 2024, doi: https://doi.org/10.69714/3ty90586.

A. Géron, Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems, 2nd ed. O’Reilly Media, 2019.

N. I. Chaerunnisa, M. Yosep, and T. Sulistyono, “Machine Learning-Based Teacher Performance Classification Using Administrative and Credit Point Assessment (PAK) Data: A Comparative Study of Decision Tree and Naive Bayes,” Journal of Applied Informatics and Computing (JAIC), vol. 10, no. 2, pp. 1853–1863, Apr. 2026, doi: https://doi.org/10.30871/jaic.v10i2.12377.

W. Wijiyanto, A. I. Pradana, S. Sopingi, and V. Atina, “Teknik K-Fold Cross Validation untuk Mengevaluasi Kinerja Mahasiswa,” Jurnal Algoritma, vol. 21, no. 1, pp. 239–248, May 2024, doi: 10.33364/algoritma/v.21-1.1618.

B. Or, “Improving Requirements Classification with SMOTE-Tomek Preprocessing,” arXiv preprint arXiv:2501.06491, Dec. 2025, [Online]. Available: http://arxiv.org/abs/2501.06491

R. S. Andarujaya and R. R. Suryono, “Perbandingan Kinerja Algoritma Random forest, KNN, dan SVM dalam Analisis Sentimen Cryptocurrency,” Building of Informatics, Technology and Science (BITS), vol. 6, no. 4, pp. 2288–2299, Mar. 2025, doi: 10.47065/bits.v6i4.6572.

N. A. Arifuddin et al., Machine Learning. Padang Pariaman: Lingkar Edukasi Indonesia, 2025. [Online]. Available: www.lingkaredukasiindonesia.com


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Perbandingan Kinerja Random forest dan SVM Pada Klasifikasi Tingkat Kekumuhan Permukiman Menggunakan SMOTE

Dimensions Badge
Article History
Submitted: 2026-05-29
Published: 2026-06-23
Abstract View: 0 times
PDF Download: 0 times
How to Cite
Wahyuni, N., Syafria, F., Yanti, N., & Agustian, S. (2026). Perbandingan Kinerja Random forest dan SVM Pada Klasifikasi Tingkat Kekumuhan Permukiman Menggunakan SMOTE. Building of Informatics, Technology and Science (BITS), 8(1), 374-385. https://doi.org/10.47065/bits.v8i1.10101
Issue
Section
Articles

Most read articles by the same author(s)

1 2 3 > >>