Natural Language Processing Ekstraksi Akronim Dan Ekspansi Pada Artikel Berbahasa Indonesia Menggunakan Metode Text Mining Dan Term Frequency-Inverse Document Frequency


  • bahrus sobri pulungan * Mail Universitas Budi Darma, Medan, Indonesia
  • (*) Corresponding Author
Keywords: Extraction; Acronyms; Expansion; Articles; Indonesian

Abstract

Acronyms are abbreviations of combinations of several letters or syllables written and pronounced as words according to the phonological rules of the affected language. The extension of an acronym is called expansion. Acronym extraction and expansion is one of the text mining tasks in the field of information retrieval used in search engines. Search engines require a database of acronyms and expansion in determining search results for relevant information. The problem is that it often occurs when someone or a researcher makes a scientific work, especially research in Indonesia, which ignores the extraction of acronyms from each word used or is not quite right, so a way is needed to overcome this by creating an application or media to detect the extraction of the acronym using applying the Text Mining Algorithm and Term Frequency-Inverse Document Frequency (TF-IDF). Based on the problems contained in this research, the author is interested in conducting research on a thesis with the title "Natural Language Processing Acronym Extraction and Expansion in Indonesian Articles Using Text Mining Methods and Term Frequency-Inverse Document Frequency (TF-IDF)". Based on the results of calculations with TF-IDF, in acronym extraction and expansion, the weight value obtained is with a weight value of -0.053. Based on this, the extracted sentence is obtained.

References

A. F. Harahap and G. L. Ginting, “Penerapan Algoritma RAITA pada Kamus Akronim Bahasa Indonesia Berbasis Android,” TIN Terap. Inform…, vol. 1, no. 3, 2020, [Online]. Available: http://ejurnal.seminar- id.com/index.php/tin/article/view/426%0Ahttp://ejurnal.seminar- id.com/index.php/tin/article/download/426/276

M. P. Simatupang and D. P. Utomo, “Analisa Testimonial Dengan Menggunakan Algoritma Text Mining Dan Term Frequency- Inverse Document Frequence (Tf-Idf) Pada Toko Allmeeart,” KOMIK (Konferensi Nas. Teknol. Inf. dan Komputer), vol. 3, no. 1, pp. 808–814, 2019, doi: 10.30865/komik.v3i1.1697.

T. P. Lestari, “Analisis Text Mining pada Sosial Media Twitter Menggunakan Metode Support Vector Machine (SVM) dan Social Network Analysis (SNA),” J. Inform. Ekon. Bisnis, vol. 4, no. 3, pp. 65–71, 2022, doi: 10.37034/infeb.v4i3.146.

R. Yusuf, T. A. Saputri, and A. A. Wicaksono, “Penerapan Natural Language Processing Berbasis Virtual Assistant Pada Bagian Administrasi Akademik Stmik Dharma Wacana,” Int. Res. Big-Data Comput. Technol. I- Robot, vol. 5, no. 1, pp. 33–47, 2022, doi: 10.53514/ir.v5i1.228.

Yosi Arisanti Linda, “104 | J u r n a l L I T E R A S I Volume 2 | Nomor 2 | Oktober 2018,” J u r n a l L I T E R A S I, vol. 2, pp. 104–112, 2018.

R. Menaha and V. E. Jayanthi, “A Survey on Acronym – Expansion Mining Approaches from Text and Web,” 1921.

R. A. Sasmita, A. Z. Falani, F. I. Komputer, U. N. Surabaya, and T. Mining, “Pemanfaatan algoritma tf/idf pada sistem informasi ecomplaint handling,” vol. 27, no. 1, pp. 27–33, 2018.

H. Sari, G. L. Ginting, and T. Zebua, “Penerapan Algoritma Text Mining dan TF-IDF Untuk Pengelompokan Topik Skripsi Pada Aplikasi Repository STMIK Budi Darma,” vol. 2, no. 7, pp. 414–432, 2021.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Natural Language Processing Ekstraksi Akronim Dan Ekspansi Pada Artikel Berbahasa Indonesia Menggunakan Metode Text Mining Dan Term Frequency-Inverse Document Frequency

Dimensions Badge
Article History
Submitted: 2024-11-13
Published: 2024-08-30
Abstract View: 316 times
PDF Download: 336 times
Section
Articles