QColors: an Algorithm for Conservative Viral Quasispecies Reconstruction from Short and Non-contiguous Next Generation Sequencing Reads
Overview
Affiliations
Next generation sequencing technologies have recently been applied to characterize mutational spectra of the heterogeneous population of viral genotypes (known as a quasispecies) within HIV-infected patients. Such information is clinically relevant because minority genetic subpopulations of HIV within patients enable viral escape from selection pressures such as the immune response and antiretroviral therapy. However, methods for quasispecies sequence reconstruction from next generation sequencing reads are not yet widely used and remains an emerging area of research. Furthermore, the majority of research methodology in HIV has focused on 454 sequencing, while many next-generation sequencing platforms used in practice are limited to shorter read lengths relative to 454 sequencing. Little work has been done in determining how best to address the read length limitations of other platforms. The approach described here incorporates graph representations of both read differences and read overlap to conservatively determine the regions of the sequence with sufficient variability to separate quasispecies sequences. Within these tractable regions of quasispecies inference, we use constraint programming to solve for an optimal quasispecies subsequence determination via vertex coloring of the conflict graph, a representation which also lends itself to data with non-contiguous reads such as paired-end sequencing. We demonstrate the utility of the method by applying it to simulations based on actual intra-patient clonal HIV-1 sequencing data.
Cao C, He J, Mak L, Perera D, Kwok D, Wang J Mol Biol Evol. 2021; 38(6):2660-2672.
PMID: 33547786 PMC: 8136496. DOI: 10.1093/molbev/msab037.
Third-order nanocircuit elements for neuromorphic engineering.
Kumar S, Williams R, Wang Z Nature. 2020; 585(7826):518-523.
PMID: 32968256 DOI: 10.1038/s41586-020-2735-5.
Epidemiological data analysis of viral quasispecies in the next-generation sequencing era.
Knyazev S, Hughes L, Skums P, Zelikovsky A Brief Bioinform. 2020; 22(1):96-108.
PMID: 32568371 PMC: 8485218. DOI: 10.1093/bib/bbaa101.
Evaluation of haplotype callers for next-generation sequencing of viruses.
Eliseev A, Gibson K, Avdeyev P, Novik D, Bendall M, Perez-Losada M Infect Genet Evol. 2020; 82:104277.
PMID: 32151775 PMC: 7293574. DOI: 10.1016/j.meegid.2020.104277.
BHap: a novel approach for bacterial haplotype reconstruction.
Li X, Saadat S, Hu H, Li X Bioinformatics. 2019; 35(22):4624-4631.
PMID: 31004480 PMC: 6931272. DOI: 10.1093/bioinformatics/btz280.