Algoritma Stemming Teks Bahasa Batak Angkola Berbasis Aturan Tata Bahasa
Abstract
The Angkola Batak language is a variety of Batak languages, to be precise in the southern Tapanuli area, which is still used and maintained as an everyday language. Until now, the resources of the Angkola Batak language are not yet available in digital form that can be used by researchers in the analytical stages of human natural language processing. Natural language processing (NLP Taks) for the Angkola Batak language must follow the stages of text processing starting from tokenization, lexical analysis, syntax, semantics, and phragmatics. This study conducted natural language processing in the first stage, namely lexical analysis. At the lexical analysis stage, one of the most important NLP tasks is stemming. Stemming is the process of determining root words from affixed words. In this research, an analysis and design of the Angkola Batak stemming algorithm have been carried out based on grammar rules. The stages in this research are starting from collecting the grammar rules of the Angkola Batak language, collecting basic words in the Angkola Batak language as a database dictionary, and removing affixes from root words. The output of this research is the stemmer of the Angkola Batak language in the form of PHP. Based on tests conducted on 450 words originating from the Batak Angkola folklore, 448 test words were correct (99.56%) and 2 test words were wrong (0.44%). The wrong test word is obtained because the root word is not found in the dictionary.
Downloads
References
Asrif, “Pembinaan dan Pengembangan Bahasa Daerah dalam Memantapkan Kedudukan dan Fungsi Bahasa Indonesia,” Mabasan, vol. 4, no. 1, hal. 11–23, 2010.
N. W. Putri, “Pergeseran Bahasa Daerah Lampung Pada Masyarakat Kota Bandar Lampung,” J. Penelit. Hum., vol. 19, no. 14, hal. 77–86, 2018.
A. F. A. Batubara dan M. L. Anggapuspa, “Perancangan Pop-Up Book Ilustrasi Etnis Batak sebagai Media Interaktif untuk Anak Usia 9-10 Tahun,” J. Barik, vol. 2, no. 2, hal. 108–120, 2021.
T. H. Dongoran, J. Naibaho, P. Sihombing, M. Sinaga, dan R. Tampubolon, Fonologi Bahasa Angkola. Jakarta : Pusat Pembinaan dan Pengembangan Bahasa, 1997.
N. Indurkhya dan F. J. Damerau, Handbook of Natural Language Processing, 2 ed. Chapman & Hall/CRC, 2010.
H. R. Pramudita, “Penerapan Algoritma Stemming Nazief & Adriani dan Similarity pada Penerimaan Judul Tesis,” J. Ilm. DASI, vol. 15, no. 04, hal. 15–19, 2014.
L. Agusta, “Perbandingan Algoritma Stemming Porter dengan Algoritma Nazief & Adriani untuk Stemming Dokumen Teks Bahasa Indonesia,” Konf. Nas. Sist. dan Inform., hal. 196–201, 2009.
W. B. Frankes dan R. Baeza-Yates, Information Retrieval: Data Structures & Algorithms, 1992 ed. Prentice-Hall, 1992.
W. Prasetyo, “Algoritma Stemming Teks Bahasa Massenrempulu Berbasis Aturan Tata Bahasa,” Universitas Islam Negeri Sultan Syarif Kasim Riau, 2019.
Made Agus Putra Subali, “Pengembangan Metode Stemmer untuk Bahasa Bali dengan Pendekatan Rule-Based dan N-Gram Stemming,” Institut Teknologi Sepuluh Nopembe, Surabaya, 2019.
Yusra, M. Fikry, dan Hendri, “Stemmer Bahasa Melayu Riau Berdasarkan Aturan Morfologi,” Semin. Nas. Teknol. Informasi, Komun. dan Ind. 13, no. November, hal. 118–124, 2021.
S. Megi, “Algoritma Stemming Teks Bahasa Karo Berdasarkan Aturan TataBahasa,” Universitas Islam Negeri Sultan Syarif Kasim Riau, 2021.
F. Alfajri, “Algoritma Stemming Tekas Bahasa Batak Simalungun Baerbasis Aturan,” Universitas Islam Negeri Sultan Syarif Kasim Riau, 2020.
W. Anisah, “Algoritma Stemming Bahasa Pakpak Dairi Menggunakan Aturan Tata Bahasa,” Universitas Islam Negeri Sultan Syarif Kasim Riau, 2022.
Z. Abidin, A. Wijaya, dan D. Pasha, “Aplikasi Stemming Kata Bahasa Lampung Dialek Api Menggunakan Pendekatan Brute-Force dan Pemograman C#,” J. Media Inform. Budidarma, vol. 5, no. 1, hal. 1, 2021, doi: 10.30865/mib.v5i1.2483.
M. Fauziyah, “Stemming Bahasa Jawa Menggunakan Algoritma Levenshtein dan Analisa Morfologi,” Universitas Islam Negeri Maulana Malik Ibrahim, 2019.
Y. F. Andriani, E. Utami, dan S. Raharjo, “Modifikasi Algoritma Porter Stemmer Untuk Stemming Bahasa Sasak,” J. Inf. J. Penelit. dan Pengabdi. Masy., vol. 5, no. 3, 2019, doi: 10.46808/informa.v5i3.147.
P. G. S. C. Nugraha dan N. W. Wardani, “Stemming Dokumen Teks Bahasa Bali Dengan Metode Rule Base Approach,” J. Tek. Inform. dan Sist. Inf., vol. 7, no. 3, hal. 510–521, 2020.
N. Sari dan K. Ummi, “Perancangan Aplikasi kamus Bahasa Minang Indonesia Dan Indonesia Minang Menggunakan Algoritma Levenshtein,” J. FTIK, vol. 1, no. 1, hal. 1113–1124, 2020.
I. M. A. Agastya, “Pengaruh Stemmer Bahasa Indonesia Terhadap Performa Analisis Sentimen Terjemahan Ulasan Film,” J. TEKNOKOMPAK, vol. 12, no. 1, hal. 18–23, 2018.
N. Himawan, G. W. Wicaksono, dan I. Nuryasin, “Ekstraksi Fi’il dan Isim Pada Kaidah Nahwu Shorof Berbasis Android,” J. Repos., vol. 2, no. 5, hal. 619–626, 2020, doi: 10.22219/repositor.v2i5.110.
A. A. Magriyanti, “Analisis Pengembangan Algoritma Porter Stemming dalam Bahasa Indonesia,” 2018.
Maryanto, A. Hutasuhut, C. Nasution, Sriasrianti, Z. Hidayat, dan H. Al Banna, Kamus Angkola Mandailing-Indonesia. Meda- Sumatera Utara: Balai Bahasa Provinsi Sumatera Utara, Badan Pengembangan dan Pembinaan Bahasa, 2021.
M. P. Lisa Septia Dewi Br. Ginting, S.Pd., Bahasa Bantu Batak Angkola. Medan: Guepedia, 2021.
G. S. Baumi dan P. Ritonga, Pelajaran Adat Tapanuli Selatan Pabagas Boru. Marindal-Medan: Pertama Mitra Sari, 2017.
R. Rosaly dan M. K. Andy Prasetyo,.ST., “Pengertian Flowchart Beserta Fungsi dan Simbol-simbol Flowchart yang Paling Umum Digunakan,” Https://Www.Nesabamedia.Com, vol. 2, hal. 2, 2019, [Daring]. Tersedia pada: https://www.nesabamedia.com/pengertian-flowchart/https://www.nesabamedia.com/pengertian-flowchart/
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Algoritma Stemming Teks Bahasa Batak Angkola Berbasis Aturan Tata Bahasa
Pages: 642-648
Copyright (c) 2023 Nur Hasanah Hrp, Muhammad Fikry, Yusra Yusra

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).