» Articles » PMID: 35178042

Updated HIV-1 Consensus Sequences Change but Stay Within Similar Distance From Worldwide Samples

Overview
Journal Front Microbiol
Specialty Microbiology
Date 2022 Feb 18
PMID 35178042
Authors
Affiliations
Soon will be listed here.
Abstract

HIV consensus sequences are used in various bioinformatic, evolutionary, and vaccine related research. Since the previous HIV-1 subtype and CRF consensus sequences were constructed in 2002, the number of publicly available HIV-1 sequences have grown exponentially, especially from non-EU and US countries. Here, we reconstruct 90 new HIV-1 subtype and CRF consensus sequences from 3,470 high-quality, representative, full genome sequences in the LANL HIV database. While subtypes and CRFs are unevenly spread across the world, in total 89 countries were represented. For consensus sequences that were based on at least 20 genomes, we found that on average 2.3% (range 0.8-10%) of the consensus genome site states changed from 2002 to 2021, of which about half were nucleotide state differences and the rest insertions and deletions. Interestingly, the 2021 consensus sequences were shorter than in 2002, and compared to 4,674 HIV-1 worldwide genome sequences, the 2021 consensuses were somewhat closer to the worldwide genome sequences, i.e., showing on average fewer nucleotide state differences. Some subtypes/CRFs have had limited geographical spread, and thus sampling of subtypes/CRFs is uneven, at least in part, due to the epidemiological dynamics. Thus, taken as a whole, the 2021 consensus sequences likely are good representations of the typical subtype/CRF genome nucleotide states. The new consensus sequences are available at the LANL HIV database.

Citing Articles

HIV-1 Vif global diversity and possible APOBEC-mediated response since 1980.

Lewitus E, Li Y, Rolland M Virus Evol. 2025; 11(1):veae108.

PMID: 39886100 PMC: 11781276. DOI: 10.1093/ve/veae108.


Comparative Evaluation of Open-Source Bioinformatics Pipelines for Full-Length Viral Genome Assembly.

Zsichla L, Zeeb M, Fazekas D, Ay E, Muller D, Metzner K Viruses. 2025; 16(12.

PMID: 39772134 PMC: 11680378. DOI: 10.3390/v16121824.


Generation of Optimized Consensus Sequences for Hepatitis C virus (HCV) Envelope 2 Glycoprotein (E2) by a Modified Algorithm: Implication for a Pan-genomic HCV Vaccine.

Mohabati R, Rezaei R, Mohajel N, Ranjbar M, Samimi-Rad K, Azadmanesh K Avicenna J Med Biotechnol. 2024; 16(4):268-278.

PMID: 39606685 PMC: 11589427. DOI: 10.18502/ajmb.v16i4.16743.


Contemporary HIV-1 consensus Env with AI-assisted redesigned hypervariable loops promote antibody binding.

Bai H, Lewitus E, Li Y, Thomas P, Zemil M, Merbah M Nat Commun. 2024; 15(1):3924.

PMID: 38724518 PMC: 11082178. DOI: 10.1038/s41467-024-48139-x.


A unified classification system for HIV-1 5' long terminal repeats.

Guo X, Yu D, Liu M, Li H, Chen M, Wang X PLoS One. 2024; 19(5):e0301809.

PMID: 38696412 PMC: 11065288. DOI: 10.1371/journal.pone.0301809.


References
1.
Hemelaar J, Elangovan R, Yun J, Dickson-Tetteh L, Fleminger I, Kirtley S . Global and regional molecular epidemiology of HIV-1, 1990-2015: a systematic review, global survey, and trend analysis. Lancet Infect Dis. 2018; 19(2):143-155. DOI: 10.1016/S1473-3099(18)30647-9. View

2.
Sternke M, Tripp K, Barrick D . Consensus sequence design as a general strategy to create hyperstable, biologically active proteins. Proc Natl Acad Sci U S A. 2019; 116(23):11275-11284. PMC: 6561275. DOI: 10.1073/pnas.1816707116. View

3.
Frith M, Mitsuhashi S, Katoh K . lamassemble: Multiple Alignment and Consensus Sequence of Long Reads. Methods Mol Biol. 2020; 2231:135-145. DOI: 10.1007/978-1-0716-1036-7_9. View

4.
Gao F, Weaver E, Lu Z, Li Y, Liao H, Ma B . Antigenicity and immunogenicity of a synthetic human immunodeficiency virus type 1 group m consensus envelope glycoprotein. J Virol. 2004; 79(2):1154-63. PMC: 538535. DOI: 10.1128/JVI.79.2.1154-1163.2005. View

5.
Seah A, Lim M, McAloose D, Prost S, Seimon T . MinION-Based DNA Barcoding of Preserved and Non-Invasively Collected Wildlife Samples. Genes (Basel). 2020; 11(4). PMC: 7230362. DOI: 10.3390/genes11040445. View