» Articles » PMID: 27295635

Decoding Genetic Variations: Communications-Inspired Haplotype Assembly

Overview
Specialty Biology
Date 2016 Jun 14
PMID 27295635
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

High-throughput DNA sequencing technologies allow fast and affordable sequencing of individual genomes and thus enable unprecedented studies of genetic variations. Information about variations in the genome of an individual is provided by haplotypes, ordered collections of single nucleotide polymorphisms. Knowledge of haplotypes is instrumental in finding genes associated with diseases, drug development, and evolutionary studies. Haplotype assembly from high-throughput sequencing data is challenging due to errors and limited lengths of sequencing reads. The key observation made in this paper is that the minimum error-correction formulation of the haplotype assembly problem is identical to the task of deciphering a coded message received over a noisy channel-a classical problem in the mature field of communication theory. Exploiting this connection, we develop novel haplotype assembly schemes that rely on the bit-flipping and belief propagation algorithms often used in communication systems. The latter algorithm is then adapted to the haplotype assembly of polyploids. We demonstrate on both simulated and experimental data that the proposed algorithms compare favorably with state-of-the-art haplotype assembly methods in terms of accuracy, while being scalable and computationally efficient.

Citing Articles

A chaotic viewpoint-based approach to solve haplotype assembly using hypergraph model.

Olyaee M, Khanteymoori A, Khalifeh K PLoS One. 2020; 15(10):e0241291.

PMID: 33120403 PMC: 7595403. DOI: 10.1371/journal.pone.0241291.


ComHapDet: a spatial community detection algorithm for haplotype assembly.

Sankararaman A, Vikalo H, Baccelli F BMC Genomics. 2020; 21(Suppl 9):586.

PMID: 32900369 PMC: 7488034. DOI: 10.1186/s12864-020-06935-x.


Long-read sequence and assembly of segmental duplications.

Vollger M, Dishuck P, Sorensen M, Welch A, Dang V, Dougherty M Nat Methods. 2018; 16(1):88-94.

PMID: 30559433 PMC: 6382464. DOI: 10.1038/s41592-018-0236-3.


Sparse Tensor Decomposition for Haplotype Assembly of Diploids and Polyploids.

Hashemi A, Zhu B, Vikalo H BMC Genomics. 2018; 19(Suppl 4):191.

PMID: 29589554 PMC: 5872563. DOI: 10.1186/s12864-018-4551-y.


Resolving multicopy duplications using polyploid phasing.

Chaisson M, Mukherjee S, Kannan S, Eichler E Res Comput Mol Biol. 2017; 10229:117-133.

PMID: 28808695 PMC: 5553120. DOI: 10.1007/978-3-319-56970-3_8.