Performance Evaluation of Machine Learning Models for HIV/AIDS Classification


  • Gregorius Airlangga * Mail Atma Jaya Catholic University of Indonesia, Jakarta, Indonesia
  • (*) Corresponding Author
Keywords: Machine Learning; HIV/AIDS Classification; Diagnostic Models; Comparative Analysis; Evaluation Metrics

Abstract

Accurate and early diagnosis of HIV/AIDS is critical for effective treatment and reducing disease transmission. This study evaluates the performance of several machine learning models, including Logistic Regression, Decision Tree, Random Forest, Gradient Boosting, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Naive Bayes, for classifying HIV/AIDS infection status. A dataset comprising 50,000 samples was used, and models were assessed based on accuracy, precision, recall, and F1 score using stratified ten-fold cross-validation to ensure robust evaluation. The results reveal significant trade-offs between sensitivity and specificity across the models. Gradient Boosting achieved the highest accuracy (70.85%) and precision (57.81%), making it suitable for confirmatory testing where minimizing false positives is critical. Conversely, Naive Bayes demonstrated the highest recall (57.99%) and F1 score (51.04%), emphasizing its effectiveness in early-stage diagnostics where sensitivity is paramount. SVM exhibited the highest precision (59.87%) but struggled with recall (11.28%), reflecting its conservative nature in classifying positive cases. These findings underscore the importance of selecting models tailored to specific diagnostic objectives. While Naive Bayes is ideal for comprehensive screening programs, Gradient Boosting and SVM are better suited for confirmatory testing. This research provides valuable insights into the strengths and limitations of machine learning models for medical diagnostics, paving the way for developing more robust, hybrid approaches to optimize sensitivity and specificity in HIV/AIDS classification.

Downloads

Download data is not yet available.

References

M. Mark, “The international problem of HIV/AIDS in the modern world: a comprehensive review of political, economic, and social impacts,” Res Output J Public Heal. Med, vol. 42, pp. 47–52, 2024.

K. Balogun and P. R. Slev, “Towards achieving the end of the HIV epidemic: advances, challenges and scaling-up strategies,” Clin. Biochem., vol. 117, pp. 53–59, 2023.

B. Parekh, L. Westerman, L. Vojnov, and C. Yang, “Human immunodeficiency virus (HIV) infections,” Lab. point-of-care diagnostic Test. Sex. Transm. Infect. Incl. HIV, p. 247, 2023.

J. Prajapati, A. Kumari, and others, “Navigating the Spectrum: A Comprehensive Review of HIV Detection Methods,” 2024.

K. Lakshmanan and B. M. Liu, “Impact of Point-of-Care Testing on Diagnosis, Treatment, and Surveillance of Vaccine-Preventable Viral Infections,” Diagnostics, vol. 15, no. 2, p. 123, 2025.

E. I. Obeagu and G. U. Obeagu, “Early Infant Diagnosis: A Crucial Step in Halting HIV Transmission,” Elit. J. Heal. Sci., vol. 1, no. 1, pp. 1–11, 2023.

S. Chakraborty, “Democratizing nucleic acid-based molecular diagnostic tests for infectious diseases at resource-limited settings--from point of care to extreme point of care,” Sensors & Diagnostics, vol. 3, no. 4, pp. 536–561, 2024.

K. Patel et al., “Forty years since the epidemic: modern paradigms in HIV diagnosis and treatment,” Cureus, vol. 13, no. 5, 2021.

A. Bacon et al., “Review of HIV self testing technologies and promising approaches for the next generation,” Biosensors, vol. 13, no. 2, p. 298, 2023.

A. Padhi, A. Agarwal, S. K. Saxena, and C. D. S. Katoch, “Transforming clinical virology with AI, machine learning and deep learning: a comprehensive review and outlook,” VirusDisease, vol. 34, no. 3, pp. 345–355, 2023.

O. D. Balogun et al., “Integrating ai into health informatics for enhanced public health in Africa: a comprehensive review,” Int. Med. Sci. Res. J., vol. 3, no. 3, pp. 127–144, 2023.

A. M. Zaidan, “The leading global health challenges in the artificial intelligence era,” Front. Public Heal., vol. 11, p. 1328918, 2023.

A. Sharma, A. Lysenko, S. Jia, K. A. Boroevich, and T. Tsunoda, “Advances in AI and machine learning for predictive medicine,” J. Hum. Genet., pp. 1–11, 2024.

K. P. Reddy, M. Satish, A. Prakash, S. M. Babu, P. P. Kumar, and B. S. Devi, “Machine Learning Revolution in Early Disease Detection for Healthcare: Advancements, Challenges, and Future Prospects,” in 2023 IEEE 5th International Conference on Cybernetics, Cognition and Machine Learning Applications (ICCCMLA), 2023, pp. 638–643.

S. Asif et al., “Advancements and Prospects of Machine Learning in Medical Diagnostics: Unveiling the Future of Diagnostic Precision,” Arch. Comput. Methods Eng., pp. 1–31, 2024.

M. M. Siddique, M. M. Bin Seraj, M. N. Adnan, and S. M. Galib, “Artificial Intelligence for Infectious Disease Detection: Prospects and Challenges,” Surveillance, Prev. Control Infect. Dis. An AI Perspect., pp. 1–22, 2024.

N. Hong et al., “State of the art of machine learning--enabled clinical decision support in intensive care units: literature review,” JMIR Med. informatics, vol. 10, no. 3, p. e28781, 2022.

G. Obaido et al., “An interpretable machine learning approach for hepatitis b diagnosis,” Appl. Sci., vol. 12, no. 21, p. 11127, 2022.

H. H. Rashidi, L. T. Dang, S. Albahra, R. Ravindran, and I. H. Khan, “Automated machine learning for endemic active tuberculosis prediction from multiplex serological data,” Sci. Rep., vol. 11, no. 1, p. 17900, 2021.

K. I. Sahibzada et al., “HIV OctaScanner: A Machine Learning Approach to Unveil Proteolytic Cleavage Dynamics in HIV-1 Protease Substrates,” J. Chem. Inf. Model., 2025.

S. Sorooshian, “The Sustainable Development Goals of the United Nations: A Comparative Midterm Research Review,” J. Clean. Prod., p. 142272, 2024.

B. Y. F. Fong, V. T. S. Law, T. C. H. Leung, M. F. Lo, T. K. C. Ng, and H. H. L. Yee, Sustainable development goal 3: Health and well-being of ageing in Hong Kong. Taylor & Francis, 2022.

G. Venkatesh, “A brief analysis of SDG 3--Good health and well-being--and its synergies and trade-offs with the other Sustainable Development Goals,” Probl. Ekorozwoju, vol. 17, no. 2, 2022.

N. Rane, S. Choudhary, and J. Rane, “Ensemble Deep Learning and Machine Learning: Applications, Opportunities, Challenges, and Future Directions,” Oppor. Challenges, Futur. Dir. (May 31, 2024), 2024.

V. R. Modhugu and S. Ponnusamy, “Comparative Analysis of Machine Learning Algorithms for Liver Disease Prediction: SVM, Logistic Regression, and Decision Tree,” Asian J. Res. Comput. Sci., vol. 17, no. 6, pp. 188–201, 2024.

Z. Rahmatinejad et al., “A comparative study of explainable ensemble learning and logistic regression for predicting in-hospital mortality in the emergency department,” Sci. Rep., vol. 14, no. 1, p. 3406, 2024.

J. Pachouly, S. Ahirrao, K. Kotecha, G. Selvachandran, and A. Abraham, “A systematic literature review on software defect prediction using artificial intelligence: Datasets, Data Validation Methods, Approaches, and Tools,” Eng. Appl. Artif. Intell., vol. 111, p. 104773, 2022.

A. Khraisat and A. Alazab, “A critical review of intrusion detection systems in the internet of things: techniques, deployment strategy, validation strategy, attacks, public datasets and challenges,” Cybersecurity, vol. 4, pp. 1–27, 2021.

K. Li et al., “Colonoscopy polyp detection and classification: Dataset creation and comparative evaluations,” PLoS One, vol. 16, no. 8, p. e0255809, 2021.

T. Kyriazos and M. Poga, “Application of machine learning models in social sciences: managing nonlinear relationships,” Encyclopedia, vol. 4, no. 4, pp. 1790–1805, 2024.

B. Koçak, “Key concepts, common pitfalls, and best practices in artificial intelligence and machine learning: focus on radiomics,” Diagnostic Interv. Radiol., vol. 28, no. 5, p. 450, 2022.

C. Aliferis and G. Simon, “Overfitting, Underfitting and General Model Overconfidence and Under-Performance Pitfalls and Best Practices in Machine Learning and AI,” in Artificial Intelligence and Machine Learning in Health Care and Medical Sciences: Best Practices and Pitfalls, Springer, 2024, pp. 477–524.

A. Velu, “AIDS Virus Infection Prediction.” Kaggle, 2023. {https://www.kaggle.com/datasets/aadarshvelu/aids-virus-infection-prediction


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Performance Evaluation of Machine Learning Models for HIV/AIDS Classification

Dimensions Badge
Article History
Submitted: 2025-01-19
Published: 2025-01-31
Abstract View: 71 times
PDF Download: 44 times
How to Cite
Airlangga, G. (2025). Performance Evaluation of Machine Learning Models for HIV/AIDS Classification. Journal of Information System Research (JOSH), 6(2), 1364-1371. https://doi.org/10.47065/josh.v6i2.6790
Issue
Section
Articles

Most read articles by the same author(s)

1 2 > >>