» Articles » PMID: 31697383

Machine Learning to Identify Persons at High-Risk of Human Immunodeficiency Virus Acquisition in Rural Kenya and Uganda

Abstract

Background: In generalized epidemic settings, strategies are needed to prioritize individuals at higher risk of human immunodeficiency virus (HIV) acquisition for prevention services. We used population-level HIV testing data from rural Kenya and Uganda to construct HIV risk scores and assessed their ability to identify seroconversions.

Methods: During 2013-2017, >75% of residents in 16 communities in the SEARCH study were tested annually for HIV. In this population, we evaluated 3 strategies for using demographic factors to predict the 1-year risk of HIV seroconversion: membership in ≥1 known "risk group" (eg, having a spouse living with HIV), a "model-based" risk score constructed with logistic regression, and a "machine learning" risk score constructed with the Super Learner algorithm. We hypothesized machine learning would identify high-risk individuals more efficiently (fewer persons targeted for a fixed sensitivity) and with higher sensitivity (for a fixed number targeted) than either other approach.

Results: A total of 75 558 persons contributed 166 723 person-years of follow-up; 519 seroconverted. Machine learning improved efficiency. To achieve a fixed sensitivity of 50%, the risk-group strategy targeted 42% of the population, the model-based strategy targeted 27%, and machine learning targeted 18%. Machine learning also improved sensitivity. With an upper limit of 45% targeted, the risk-group strategy correctly classified 58% of seroconversions, the model-based strategy 68%, and machine learning 78%.

Conclusions: Machine learning improved classification of individuals at risk of HIV acquisition compared with a model-based approach or reliance on known risk groups and could inform targeting of prevention strategies in generalized epidemic settings.

Clinical Trials Registration: NCT01864603.

Citing Articles

AI applications in HIV research: advances and future directions.

Jin R, Zhang L Front Microbiol. 2025; 16:1541942.

PMID: 40051479 PMC: 11882587. DOI: 10.3389/fmicb.2025.1541942.


STI/HIV risk prediction model development-A novel use of public data to forecast STIs/HIV risk for men who have sex with men.

Ji X, Tang Z, Osborne S, Van Nguyen T, Mullens A, Dean J Front Public Health. 2025; 12:1511689.

PMID: 39830177 PMC: 11739126. DOI: 10.3389/fpubh.2024.1511689.


Predictors of HIV seroconversion in Botswana.

Cui Y, Moyo S, Pretorius Holme M, Hurwitz K, Choga W, Bennett K AIDS. 2024; 39(3):290-297.

PMID: 39497537 PMC: 11779586. DOI: 10.1097/QAD.0000000000004055.


A Human Immunodeficiency Virus Type 1 Risk Assessment Tool for Women Aged 15-49 Years in African Countries: A Pooled Analysis Across 15 Nationally Representative Surveys.

Rosenberg N, Shook-Sa B, Young A, Zou Y, Stranix-Chibanda L, Yotebieng M Clin Infect Dis. 2024; 79(5):1223-1232.

PMID: 38657086 PMC: 11581698. DOI: 10.1093/cid/ciae211.


A Machine Learning Model for Identifying Sexual Health Influencers to Promote the Secondary Distribution of HIV Self-Testing Among Gay, Bisexual, and Other Men Who Have Sex With Men in China: Quasi-Experimental Study.

Ni Y, Lu Y, Jing F, Wang Q, Xie Y, He X JMIR Public Health Surveill. 2024; 10:e50656.

PMID: 38656769 PMC: 11079758. DOI: 10.2196/50656.


References
1.
Laupacis A, Sekar N, Stiell I . Clinical prediction rules. A review and suggested modifications of methodological standards. JAMA. 1997; 277(6):488-94. View

2.
Krakower D, Gruber S, Hsu K, Menchaca J, Maro J, Kruskal B . Development and validation of an automated HIV prediction algorithm to identify candidates for pre-exposure prophylaxis: a modelling study. Lancet HIV. 2019; 6(10):e696-e704. PMC: 7522919. DOI: 10.1016/S2352-3018(19)30139-0. View

3.
Zheng W, Balzer L, van der Laan M, Petersen M . Constrained binary classification using ensemble learning: an application to cost-efficient targeted PrEP strategies. Stat Med. 2017; 37(2):261-279. PMC: 5701877. DOI: 10.1002/sim.7296. View

4.
Chamie G, Clark T, Kabami J, Kadede K, Ssemmondo E, Steinfeld R . A hybrid mobile approach for population-wide HIV testing in rural east Africa: an observational study. Lancet HIV. 2016; 3(3):e111-9. PMC: 4780220. DOI: 10.1016/S2352-3018(15)00251-9. View

5.
Marcus J, Hurley L, Krakower D, Alexeeff S, Silverberg M, Volk J . Use of electronic health record data and machine learning to identify candidates for HIV pre-exposure prophylaxis: a modelling study. Lancet HIV. 2019; 6(10):e688-e695. PMC: 7152802. DOI: 10.1016/S2352-3018(19)30137-7. View