» Articles » PMID: 15496912

Shotgun Sequence Assembly and Recent Segmental Duplications Within the Human Genome

Overview
Journal Nature
Specialty Science
Date 2004 Oct 22
PMID 15496912
Citations 119
Authors
Affiliations
Soon will be listed here.
Abstract

Complex eukaryotic genomes are now being sequenced at an accelerated pace primarily using whole-genome shotgun (WGS) sequence assembly approaches. WGS assembly was initially criticized because of its perceived inability to resolve repeat structures within genomes. Here, we quantify the effect of WGS sequence assembly on large, highly similar repeats by comparison of the segmental duplication content of two different human genome assemblies. Our analysis shows that large (> 15 kilobases) and highly identical (> 97%) duplications are not adequately resolved by WGS assembly. This leads to significant reduction in genome length and the loss of genes embedded within duplications. Comparable analyses of mouse genome assemblies confirm that strict WGS sequence assembly will oversimplify our understanding of mammalian genome structure and evolution; a hybrid strategy using a targeted clone-by-clone approach to resolve duplications is proposed.

Citing Articles

The genomic landscape of 2,023 colorectal cancers.

Cornish A, Gruber A, Kinnersley B, Chubb D, Frangou A, Caravagna G Nature. 2024; 633(8028):127-136.

PMID: 39112709 PMC: 11374690. DOI: 10.1038/s41586-024-07747-9.


Noncoding RNAs in skeletal development and disorders.

Yao Q, He T, Liao J, Liao R, Wu X, Lin L Biol Res. 2024; 57(1):16.

PMID: 38644509 PMC: 11034114. DOI: 10.1186/s40659-024-00497-y.


Inversion polymorphism in a complete human genome assembly.

Porubsky D, Harvey W, Rozanski A, Ebler J, Hops W, Ashraf H Genome Biol. 2023; 24(1):100.

PMID: 37122002 PMC: 10150506. DOI: 10.1186/s13059-023-02919-8.


An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster.

Fiol A, Jurado-Ruiz F, Lopez-Girona E, Aranzana M Plant Methods. 2022; 18(1):105.

PMID: 36030243 PMC: 9419362. DOI: 10.1186/s13007-022-00937-4.


SHIMS 3.0: Highly efficient single-haplotype iterative mapping and sequencing using ultra-long nanopore reads.

Bellott D, Cho T, Jackson E, Skaletsky H, Hughes J, Page D PLoS One. 2022; 17(6):e0269692.

PMID: 35700171 PMC: 9197060. DOI: 10.1371/journal.pone.0269692.