Natural Language Processing Ekstraksi Akronim Dan Ekspansi Pada Artikel Berbahasa Indonesia Menggunakan Metode Text Mining Dan Term Frequency-Inverse Document Frequency
Abstract
Acronyms are abbreviations of combinations of several letters or syllables written and pronounced as words according to the phonological rules of the affected language. The extension of an acronym is called expansion. Acronym extraction and expansion is one of the text mining tasks in the field of information retrieval used in search engines. Search engines require a database of acronyms and expansion in determining search results for relevant information. The problem is that it often occurs when someone or a researcher makes a scientific work, especially research in Indonesia, which ignores the extraction of acronyms from each word used or is not quite right, so a way is needed to overcome this by creating an application or media to detect the extraction of the acronym using applying the Text Mining Algorithm and Term Frequency-Inverse Document Frequency (TF-IDF). Based on the problems contained in this research, the author is interested in conducting research on a thesis with the title "Natural Language Processing Acronym Extraction and Expansion in Indonesian Articles Using Text Mining Methods and Term Frequency-Inverse Document Frequency (TF-IDF)". Based on the results of calculations with TF-IDF, in acronym extraction and expansion, the weight value obtained is with a weight value of -0.053. Based on this, the extracted sentence is obtained.
References
A. F. Harahap and G. L. Ginting, “Penerapan Algoritma RAITA pada Kamus Akronim Bahasa Indonesia Berbasis Android,” TIN Terap. Inform…, vol. 1, no. 3, 2020, [Online]. Available: http://ejurnal.seminar- id.com/index.php/tin/article/view/426%0Ahttp://ejurnal.seminar- id.com/index.php/tin/article/download/426/276
M. P. Simatupang and D. P. Utomo, “Analisa Testimonial Dengan Menggunakan Algoritma Text Mining Dan Term Frequency- Inverse Document Frequence (Tf-Idf) Pada Toko Allmeeart,” KOMIK (Konferensi Nas. Teknol. Inf. dan Komputer), vol. 3, no. 1, pp. 808–814, 2019, doi: 10.30865/komik.v3i1.1697.
T. P. Lestari, “Analisis Text Mining pada Sosial Media Twitter Menggunakan Metode Support Vector Machine (SVM) dan Social Network Analysis (SNA),” J. Inform. Ekon. Bisnis, vol. 4, no. 3, pp. 65–71, 2022, doi: 10.37034/infeb.v4i3.146.
R. Yusuf, T. A. Saputri, and A. A. Wicaksono, “Penerapan Natural Language Processing Berbasis Virtual Assistant Pada Bagian Administrasi Akademik Stmik Dharma Wacana,” Int. Res. Big-Data Comput. Technol. I- Robot, vol. 5, no. 1, pp. 33–47, 2022, doi: 10.53514/ir.v5i1.228.
Yosi Arisanti Linda, “104 | J u r n a l L I T E R A S I Volume 2 | Nomor 2 | Oktober 2018,” J u r n a l L I T E R A S I, vol. 2, pp. 104–112, 2018.
R. Menaha and V. E. Jayanthi, “A Survey on Acronym – Expansion Mining Approaches from Text and Web,” 1921.
R. A. Sasmita, A. Z. Falani, F. I. Komputer, U. N. Surabaya, and T. Mining, “Pemanfaatan algoritma tf/idf pada sistem informasi ecomplaint handling,” vol. 27, no. 1, pp. 27–33, 2018.
H. Sari, G. L. Ginting, and T. Zebua, “Penerapan Algoritma Text Mining dan TF-IDF Untuk Pengelompokan Topik Skripsi Pada Aplikasi Repository STMIK Budi Darma,” vol. 2, no. 7, pp. 414–432, 2021.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Natural Language Processing Ekstraksi Akronim Dan Ekspansi Pada Artikel Berbahasa Indonesia Menggunakan Metode Text Mining Dan Term Frequency-Inverse Document Frequency
Pages: 85 - 94
Copyright (c) 2024 bahrus sobri pulungan

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).


