Identification of Big Five Personality on Twitter Users using the AdaBoost Method


  • Ajeung Angsaweni * Mail Telkom University, Bandung, Indonesia
  • Warih Maharani Telkom University, Bandung, Indonesia
  • (*) Corresponding Author
Keywords: Twitter; AdaBoost; Big Five

Abstract

Social media is one of the lifestyles in the modern era that uses web-based technology for social interaction. As one of the most popular social media platforms, Twitter allows users to express themselves through tweets that can show their personality. The Big Five theory states that a person’s personality is divided into five dimensions: openness, conscientiousness, extraversion, agreeableness, and neuroticism. Several methods have been used to conduct user personality research based on activity on social media. The AdaBoost method is used in this study to identify the personality of Twitter users using sentiment, emotion, social, PCA, and POS-tag features. There are two test scenarios in this study. The first is testing the AdaBoost model with all features, and the second is testing the AdaBoost model with a combination of three features. The research indicates that the data preprocessing method can affect the model. The results showed that the AdaBoost model with all the features and without the stemming process had the highest accuracy value of 53.57%.

Downloads

Download data is not yet available.

References

S. Kemp, “Digital 2022: Indonesia — DataReportal – Global Digital Insights,” Feb. 15, 2022. https://datareportal.com/reports/digital-2022-indonesia (accessed May 07, 2022).

N. Majumder, S. Poria, A. Gelbukh, and E. Cambria, “Deep Learning-Based Document Modeling for Personality Detection from Text,” IEEE Intelligent Systems, vol. 32, no. 2, pp. 74–79, Mar. 2017, doi: 10.1109/MIS.2017.23.

Y. Mehta, N. Majumder, A. Gelbukh, and E. Cambria, “Recent trends in deep learning based personality detection,” Artificial Intelligence Review, vol. 53, no. 4, pp. 2313–2339, Apr. 2020, doi: 10.1007/s10462-019-09770-z.

W. Maharani and V. Effendy, “Big five personality prediction based in Indonesian tweets using machine learning methods,” International Journal of Electrical and Computer Engineering, vol. 12, no. 2, pp. 1973–1981, Apr. 2022, doi: 10.11591/ijece.v12i2.pp1973-1981.

R. Indira and W. Maharani, “Personality Detection on Social Media Twitter Using Long Short-Term Memory with Word2Vec,” in 2021 IEEE International Conference on Communication, Networks and Satellite (COMNETSAT), 2021, pp. 64–69. doi: 10.1109/COMNETSAT53002.2021.9530820.

R. P. Pratama and W. Maharani, “Predicting Big Five Personality Traits Based on Twitter User Using Random Forest Method,” in 2021 International Conference on Data Science and Its Applications (ICoDSA), 2021, pp. 110–117. doi: 10.1109/ICoDSA53588.2021.9617501.

N. Abood, “Big Five Traits: A Critical Review,” Gadjah Mada International Journal of Business, vol. 21, no. 2, pp. 159–186, 2019, doi: 10.22146/gamaijb.34931.

Z. Mushtaq, S. Ashraf, and N. Sabahat, “Predicting MBTI Personality type with K-means Clustering and Gradient Boosting,” in Proceedings - 2020 23rd IEEE International Multi-Topic Conference, INMIC 2020, Nov. 2020, pp. 1–5. doi: 10.1109/INMIC50486.2020.9318078.

R. Moraes, L. L. Pinto, M. Pilankar, and P. Rane, “Personality Assessment Using Social Media for Hiring Candidates,” in 2020 3rd International Conference on Communication Systems, Computing and IT Applications (CSCITA), 2020, pp. 192–197. doi: 10.1109/CSCITA47329.2020.9137818.

A. Dwi Hartanto, E. Utami, S. Adi, S. Raharjo, M. Yusa, and A. Kamaludin, “Classifying User Personality Based on Media Social Posts Using Support Vector Machine Algorithm Based on DISC Approach,” in 2020 2nd International Conference on Cybernetics and Intelligent System, ICORIS 2020, Oct. 2020, pp. 1–4. doi: 10.1109/ICORIS50180.2020.9320760.

H. Setiawan and A. A. Wafi, “Classification of Personality Type Based on Twitter Data Using Machine Learning Techniques,” in 2020 3rd International Conference on Information and Communications Technology, ICOIACT 2020, Nov. 2020, pp. 94–98. doi: 10.1109/ICOIACT50329.2020.9332152.

P. S. Dandannavar, S. R. Mangalwede, and P. M. Kulkarni, “Social Media Text - A Source for Personality Prediction,” in 2018 International Conference on Computational Techniques, Electronics and Mechanical Systems (CTEMS), 2018, pp. 62–65. doi: 10.1109/CTEMS.2018.8769304.

A. v. Kunte and S. Panicker, “Using textual data for Personality Prediction: A Machine Learning Approach,” in 2019 4th International Conference on Information Systems and Computer Networks (ISCON), 2019, pp. 529–533. doi: 10.1109/ISCON47742.2019.9036220.

V. Kotu and B. Deshpande, “Classification,” Data Science, pp. 65–163, Jan. 2019, doi: 10.1016/B978-0-12-814761-0.00004-6.

I. H. Witten, E. Frank, M. A. Hall, and C. J. Pal, “Ensemble learning,” Data Mining, pp. 479–501, Jan. 2017, doi: 10.1016/B978-0-12-804291-5.00012-X.

Y. Zhang et al., Research and Application of AdaBoost Algorithm Based on SVM. 2019.

V. N. Gudivada, D. L. Rao, and A. R. Gudivada, “Information Retrieval: Concepts, Models, and Systems,” Handbook of Statistics, vol. 38, pp. 331–401, Jan. 2018, doi: 10.1016/BS.HOST.2018.07.009.

“Sentiment and emotion lexicons - National Research Council Canada.” https://nrc.canada.ca/en/research-development/products-services/technical-advisory-services/sentiment-emotion-lexicons (accessed Jun. 23, 2022).

“nlp-id · PyPI.” https://pypi.org/project/nlp-id/ (accessed Jul. 05, 2022).

D. Sarkar and V. Natarajan, “Ensemble Machine Learning Cookbook,” 2019.

“1.11. Ensemble methods — scikit-learn 1.1.1 documentation.” https://scikit-learn.org/stable/modules/ensemble.html#adaboost (accessed Jun. 27, 2022).

J. Moreira, A. C. P. de L. F. Carvalho, and T. Horváth, A general introduction to data analytics. John Wiley & Sons, Inc., 2018.

G. Shobha and S. Rangaswamy, “Machine Learning,” Handbook of Statistics, vol. 38, pp. 197–228, Jan. 2018, doi: 10.1016/BS.HOST.2018.07.004.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Identification of Big Five Personality on Twitter Users using the AdaBoost Method

Dimensions Badge
Article History
Submitted: 2022-07-12
Published: 2022-09-30
Abstract View: 4 times
PDF Download: 1 times
How to Cite
Angsaweni, A., & Maharani, W. (2022). Identification of Big Five Personality on Twitter Users using the AdaBoost Method. Building of Informatics, Technology and Science (BITS), 4(2), 377-383. https://doi.org/10.47065/bits.v4i2.1853
Section
Articles

Most read articles by the same author(s)

1 2 > >>