» Articles » PMID: 11691844

Comparing Vertebrate Whole-genome Shotgun Reads to the Human Genome

Overview
Journal Genome Res
Specialty Genetics
Date 2001 Nov 3
PMID 11691844
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Multi-species sequence comparisons are a very efficient way to reveal conserved genes. Because sequence finishing is expensive and time consuming, many genome sequences are likely to stay incomplete. A challenge is to use these fragmented data for understanding the human genome. Methods for using cross-species whole-genome shotgun sequence (WGS) for genome annotation are described in this paper. About one-half million high-quality rat WGS reads (covering 7.5% of the rat genome) generated at the Baylor College of Medicine Human Genome Sequencing Center were compared with the human genome. Using computer-generated random reads as a negative control, a set of parameters was determined for reliable interpretation of BLAST search results. About 10% of the rat reads contain regions that are conserved in the human genomic sequence and about one-third of these include known gene-coding regions. Mapping the conserved regions to human chromosomes showed a 23-fold enrichment for coding regions compared with noncoding regions. This approach can also be applied to other mammalian genomes for gene finding. These data predicted approximately 42,500 genes in the human, slightly more than reported previously.

Citing Articles

Long Noncoding RNA LIFR-AS1: A New Player in Human Cancers.

Bai Z, Wang X, Zhang Z Biomed Res Int. 2022; 2022:1590815.

PMID: 35071590 PMC: 8776453. DOI: 10.1155/2022/1590815.


Characteristics of the tomato nuclear genome as determined by sequencing undermethylated EcoRI digested fragments.

Wang Y, van der Hoeven R, Nielsen R, Mueller L, Tanksley S Theor Appl Genet. 2005; 112(1):72-84.

PMID: 16208505 DOI: 10.1007/s00122-005-0107-z.


An initial strategy for the systematic identification of functional elements in the human genome by low-redundancy comparative sequencing.

Margulies E, Vinson J, Miller W, Jaffe D, Lindblad-Toh K, Chang J Proc Natl Acad Sci U S A. 2005; 102(13):4795-800.

PMID: 15778292 PMC: 555705. DOI: 10.1073/pnas.0409882102.


Strategies and tools for whole-genome alignments.

Couronne O, Poliakov A, Bray N, Ishkhanov T, Ryaboy D, Rubin E Genome Res. 2003; 13(1):73-80.

PMID: 12529308 PMC: 430965. DOI: 10.1101/gr.762503.


Parallel construction of orthologous sequence-ready clone contig maps in multiple species.

Thomas J, Prasad A, Summers T, Lee-Lin S, Maduro V, Idol J Genome Res. 2002; 12(8):1277-85.

PMID: 12176935 PMC: 186643. DOI: 10.1101/gr.283202.

References
1.
Wasserman W, Palumbo M, Thompson W, Fickett J, Lawrence C . Human-mouse genome comparisons to locate regulatory sites. Nat Genet. 2000; 26(2):225-8. DOI: 10.1038/79965. View

2.
Bouck J, McLeod M, Worley K, Gibbs R . The human transcript database: a catalogue of full length cDNA inserts. Bioinformatics. 2000; 16(2):176-7. DOI: 10.1093/bioinformatics/16.2.176. View

3.
Venter J, Adams M, Myers E, Li P, Mural R, Sutton G . The sequence of the human genome. Science. 2001; 291(5507):1304-51. DOI: 10.1126/science.1058040. View

4.
Lander E, Linton L, Birren B, Nusbaum C, Zody M, Baldwin J . Initial sequencing and analysis of the human genome. Nature. 2001; 409(6822):860-921. DOI: 10.1038/35057062. View

5.
Metzker M, Lu J, Gibbs R . Electrophoretically uniform fluorescent dyes for automated DNA sequencing. Science. 1996; 271(5254):1420-2. DOI: 10.1126/science.271.5254.1420. View