Machine Learning to Identify Persons at High-Risk of Human Immunodeficiency Virus Acquisition in Rural Kenya and Uganda
Overview
Authors
Affiliations
Background: In generalized epidemic settings, strategies are needed to prioritize individuals at higher risk of human immunodeficiency virus (HIV) acquisition for prevention services. We used population-level HIV testing data from rural Kenya and Uganda to construct HIV risk scores and assessed their ability to identify seroconversions.
Methods: During 2013-2017, >75% of residents in 16 communities in the SEARCH study were tested annually for HIV. In this population, we evaluated 3 strategies for using demographic factors to predict the 1-year risk of HIV seroconversion: membership in ≥1 known "risk group" (eg, having a spouse living with HIV), a "model-based" risk score constructed with logistic regression, and a "machine learning" risk score constructed with the Super Learner algorithm. We hypothesized machine learning would identify high-risk individuals more efficiently (fewer persons targeted for a fixed sensitivity) and with higher sensitivity (for a fixed number targeted) than either other approach.
Results: A total of 75 558 persons contributed 166 723 person-years of follow-up; 519 seroconverted. Machine learning improved efficiency. To achieve a fixed sensitivity of 50%, the risk-group strategy targeted 42% of the population, the model-based strategy targeted 27%, and machine learning targeted 18%. Machine learning also improved sensitivity. With an upper limit of 45% targeted, the risk-group strategy correctly classified 58% of seroconversions, the model-based strategy 68%, and machine learning 78%.
Conclusions: Machine learning improved classification of individuals at risk of HIV acquisition compared with a model-based approach or reliance on known risk groups and could inform targeting of prevention strategies in generalized epidemic settings.
Clinical Trials Registration: NCT01864603.
AI applications in HIV research: advances and future directions.
Jin R, Zhang L Front Microbiol. 2025; 16:1541942.
PMID: 40051479 PMC: 11882587. DOI: 10.3389/fmicb.2025.1541942.
Ji X, Tang Z, Osborne S, Van Nguyen T, Mullens A, Dean J Front Public Health. 2025; 12:1511689.
PMID: 39830177 PMC: 11739126. DOI: 10.3389/fpubh.2024.1511689.
Predictors of HIV seroconversion in Botswana.
Cui Y, Moyo S, Pretorius Holme M, Hurwitz K, Choga W, Bennett K AIDS. 2024; 39(3):290-297.
PMID: 39497537 PMC: 11779586. DOI: 10.1097/QAD.0000000000004055.
Rosenberg N, Shook-Sa B, Young A, Zou Y, Stranix-Chibanda L, Yotebieng M Clin Infect Dis. 2024; 79(5):1223-1232.
PMID: 38657086 PMC: 11581698. DOI: 10.1093/cid/ciae211.
Ni Y, Lu Y, Jing F, Wang Q, Xie Y, He X JMIR Public Health Surveill. 2024; 10:e50656.
PMID: 38656769 PMC: 11079758. DOI: 10.2196/50656.