Personality Detection on Twitter Social Media Using IndoBERT Method
Abstract
Personality is the fundamental characteristic of human beings that makes humans unique. Because of these differences in human characteristics, personality becomes a benchmark for consideration in various recruitment processes. One way to predict personality is to apply an interview system or fill out questionnaires which often experience problems due to ineffectiveness in terms of time and cost. Results become inaccurate if prospective employees do not know themselves well. The big five personality method, divided into openness, conscientiousness, extraversion, agreeableness, and neuroticism, is widely used to predict personality. This study uses a deep learning method, IndoBERT, to detect personality based on five dimensions according to the big five personalities whose data is taken from Twitter tweets with crawling data. From the results of these studies, it is known that personality research using the IndoBERT method without a stemming process has a higher accuracy rate of 0.46.
Downloads
References
C. George Boeree, Personality Theories: Melacak Kepribadian Anda Bersama Psikolog Dunia. prismasophie, 2017.
V. Delima, “Impact of Personality Traits on Employees’ Job Performance in Batticaloa Teaching Hospital,” Jun. 2020.
B. Raad, The Big Five Personality Factors: The psycholexical approach to personality. 2000.
M. NASYROH and R. Wikansari, “Hubungan antara Kepribadian (Big Five Personality Model) dengan Kinerja Karyawan,” Ecopsy, vol. 4, no. 1, pp. 10–16, 2017.
R. Indira and W. Maharani, “Personality Detection on Social Media Twitter Using Long Short-Term Memory with Word2Vec,” in 2021 IEEE International Conference on Communication, Networks and Satellite (COMNETSAT), 2021, pp. 64–69. doi: 10.1109/COMNETSAT53002.2021.9530820.
S. V. Therik and E. B. Setiawan, “DETEKSI KEPRIBADIAN BIG FIVE PENGGUNA TWITTER DENGAN METODE C4.5.”
“• Most spoken languages in the world | Statista.” https://www.statista.com/statistics/266808/the-most-spoken-languages-worldwide/ (accessed Jun. 17, 2022).
B. Wilie et al., “IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding,” in Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, Dec. 2020, pp. 843–857. [Online]. Available: https://aclanthology.org/2020.aacl-main.85
H. K. Putra, “Deteksi Penggunaan Kalimat Abusive Pada Teks Bahasa Indonesia Menggunakan Metode IndoBERT,” e-Proceeding of Engineering, vol. 8, pp. 3028–3038, 2021.
“Twitter: most users by country | Statista.” https://www.statista.com/statistics/242606/number-of-active-twitter-users-in-selected-countries/ (accessed Jul. 16, 2022).
O. P. John and S. Srivastava, “The Big-Five Trait Taxonomy: History, Measurement, and Theoretical Perspectives.”
“Twitter Developer Platform overview | Docs | Twitter Developer Platform.” https://developer.twitter.com/en/docs/platform-overview (accessed Jul. 16, 2022).
J. Devlin, M.-W. Chang, K. Lee, K. T. Google, and A. I. Language, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” Naacl-Hlt 2019, no. Mlm, 2018, Accessed: Jul. 07, 2022. [Online]. Available: https://github.com/tensorflow/tensor2tensor
F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, “IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP,” in Proceedings of the 28th International Conference on Computational Linguistics, Dec. 2020, pp. 757–770. doi: 10.18653/v1/2020.coling-main.66.
M. Grandini, E. Bagli, and G. Visani, “Metrics for Multi-Class Classification: an Overview,” Aug. 2020.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Personality Detection on Twitter Social Media Using IndoBERT Method
Pages: 448-453
Copyright (c) 2022 Tri Ayu Syifa'ur Rohmah, Warih Maharani

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).





















