Parking Strategies for Genome Sequencing
Overview
Authors
Affiliations
The parking strategy is an iterative approach to DNA sequencing. Each iteration consists of sequencing a novel portion of target DNA that does not overlap any previously sequenced region. Subject to the constraint of no overlap, each new region is chosen randomly. A parking strategy is often ideal in the early stages of a project for rapidly generating unique data. As a project progresses, parking becomes progressively more expensive and eventually prohibitive. We present a mathematical model with a generalization to allow for overlaps. This model predicts multiple parameters, including progress, costs, and the distribution of gap sizes left by a parking strategy. The highly fragmented nature of the gaps left after an initial parking strategy may make it difficult to finish a project efficiently. Therefore, in addition to our parking model, we model gap closing by walking. Our gap-closing model is generalizable to many other strategies. Our discussion includes modified parking strategies and hybrids with other strategies. A hybrid parking strategy has been employed for portions of the Human Genome Project.
Whole-genome haplotyping approaches and genomic medicine.
Glusman G, Cox H, Roach J Genome Med. 2014; 6(9):73.
PMID: 25473435 PMC: 4254418. DOI: 10.1186/s13073-014-0073-7.
Whole-genome sequencing and assembly with high-throughput, short-read technologies.
Sundquist A, Ronaghi M, Tang H, Pevzner P, Batzoglou S PLoS One. 2007; 2(5):e484.
PMID: 17534434 PMC: 1871613. DOI: 10.1371/journal.pone.0000484.
Yang T, Yu Y, Nah G, Atkins M, Lee S, Frisch D Theor Appl Genet. 2003; 107(4):652-60.
PMID: 12783166 DOI: 10.1007/s00122-003-1302-4.