» Articles » PMID: 10613838

Sequencing a Genome by Walking with Clone-end Sequences: a Mathematical Analysis

Overview
Journal Genome Res
Specialty Genetics
Date 1999 Dec 30
PMID 10613838
Citations 13
Authors
Affiliations
Soon will be listed here.
Abstract

One approach to sequencing a large genome is (1) to sequence a collection of nonoverlapping "seeds" chosen from a genomic library of large-insert clones [such as bacterial artificial chromosomes (BACs)] and then (2) to take successive "walking" steps by selecting and sequencing minimally overlapping clones, using information such as clone-end sequences to identify the overlaps. In this paper we analyze the strategic issues involved in using this approach. We derive formulas showing how two key factors, the initial density of seed clones and the depth of the genomic library used for walking, affect the cost and time of a sequencing project-that is, the amount of redundant sequencing and the number of steps to cover the vast majority of the genome. We also discuss a variant strategy in which a second genomic library with clones having a somewhat smaller insert size is used to close gaps. This approach can dramatically decrease the amount of redundant sequencing, without affecting the rate at which the genome is covered.

Citing Articles

New insights on the phylogeny, evolutionary history, and ecological adaptation mechanism in cycle-cup oaks based on chloroplast genomes.

Li Y, Zheng S, Wang T, Liu M, Kozlowski G, Yi L Ecol Evol. 2024; 14(9):e70318.

PMID: 39290669 PMC: 11407850. DOI: 10.1002/ece3.70318.


Complete Chloroplast Genomes of Four Oaks from the Section Improve the Phylogenetic Analysis and Understanding of Evolutionary Processes in the Genus .

Wang L, Li Y, Zheng S, Kozlowski G, Xu J, Song Y Genes (Basel). 2024; 15(2).

PMID: 38397219 PMC: 10888318. DOI: 10.3390/genes15020230.


Complete Chloroplast Genome of an Endangered Species , and Its Comparative, Evolutionary, and Phylogenetic Study with Other Section Species.

Li Y, Wang T, Kozlowski G, Liu M, Yi L, Song Y Genes (Basel). 2022; 13(7).

PMID: 35885967 PMC: 9316884. DOI: 10.3390/genes13071184.


Levenshtein Distance, Sequence Comparison and Biological Database Search.

Berger B, Waterman M, Yu Y IEEE Trans Inf Theory. 2021; 67(6):3287-3294.

PMID: 34257466 PMC: 8274556. DOI: 10.1109/tit.2020.2996543.


The complete mitochondrial genome of and its phylogenetic implication.

Yuan Y, He Y, Liu S, Ji X, Qin Y, Wang X Mitochondrial DNA B Resour. 2021; 3(2):1183-1184.

PMID: 33490570 PMC: 7800991. DOI: 10.1080/23802359.2018.1524721.