» Articles » PMID: 21875934

What Fraction of the Human Genome is Functional?

Overview
Journal Genome Res
Specialty Genetics
Date 2011 Aug 31
PMID 21875934
Citations 87
Authors
Affiliations
Soon will be listed here.
Abstract

Many evolutionary studies over the past decade have estimated α(sel), the proportion of all nucleotides in the human genome that are subject to purifying selection because of their biological function. Most of these studies have estimated the nucleotide substitution rates from genome sequence alignments across many diverse mammals. Some α(sel) estimates will be affected by the heterogeneity of substitution rates in neutral sequence across the genome. Most will also be inaccurate if change in the functional sequence repertoire occurs rapidly relative to the separation of lineages that are being compared. Evidence gathered from both evolutionary and experimental analyses now indicate that rates of "turnover" of functional, predominantly noncoding, sequence are, indeed, high. They are sufficiently high that an estimated 50% of mouse constrained noncoding sequence is predicted not to be shared with rat, a closely related rodent. The rapidity of turnover results in, at least, a twofold underestimate of α(sel) by analyses that measure constraint across the eutherian phylogeny. Approaches that take account of turnover estimate that the steady-state value of α(sel) lies between 10% and 15%. Experimental studies corroborate the predicted rates of loss and gain of noncoding functional sites. These studies show the limitations inherent in the use of deep sequence conservation for identifying functional sequence. Experimental investigations focusing on lineage-specific, noncoding, and functional sequence are now essential if we are to appreciate the complete functional repertoire of the human genome.

Citing Articles

Whole-genome sequencing analysis identifies rare, large-effect noncoding variants and regulatory regions associated with circulating protein levels.

Hawkes G, Chundru K, Jackson L, Patel K, Murray A, Wood A Nat Genet. 2025; 57(3):626-634.

PMID: 39994471 PMC: 11906349. DOI: 10.1038/s41588-025-02095-4.


Whole-genome sequencing in 333,100 individuals reveals rare non-coding single variant and aggregate associations with height.

Hawkes G, Beaumont R, Li Z, Mandla R, Li X, Albert C Nat Commun. 2024; 15(1):8549.

PMID: 39362880 PMC: 11450065. DOI: 10.1038/s41467-024-52579-w.


Predicting the Effect of miRNA on Gene Regulation to Foster Translational Multi-Omics Research-A Review on the Role of Super-Enhancers.

Das S, Rai S Noncoding RNA. 2024; 10(4).

PMID: 39195574 PMC: 11357235. DOI: 10.3390/ncrna10040045.


A Unifying Hypothesis for the Genome Dynamics Proposed to Underlie Neuropsychiatric Phenotypes.

Gericke G Genes (Basel). 2024; 15(4).

PMID: 38674405 PMC: 11049865. DOI: 10.3390/genes15040471.


Regulatory activity is the default DNA state in eukaryotes.

Luthra I, Jensen C, Chen X, Salaudeen A, Rafi A, de Boer C Nat Struct Mol Biol. 2024; 31(3):559-567.

PMID: 38448573 DOI: 10.1038/s41594-024-01235-4.


References
1.
Birney E, Stamatoyannopoulos J, Dutta A, Guigo R, Gingeras T, Margulies E . Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007; 447(7146):799-816. PMC: 2212820. DOI: 10.1038/nature05874. View

2.
Margulies E, Cooper G, Asimenos G, Thomas D, Dewey C, Siepel A . Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome. Genome Res. 2007; 17(6):760-74. PMC: 1891336. DOI: 10.1101/gr.6034307. View

3.
Li J, Miller W . Significance of interspecies matches when evolutionary rate varies. J Comput Biol. 2003; 10(3-4):537-54. DOI: 10.1089/10665270360688174. View

4.
Garber M, Guttman M, Clamp M, Zody M, Friedman N, Xie X . Identifying novel constrained elements by exploiting biased substitution patterns. Bioinformatics. 2009; 25(12):i54-62. PMC: 2687944. DOI: 10.1093/bioinformatics/btp190. View

5.
. A user's guide to the encyclopedia of DNA elements (ENCODE). PLoS Biol. 2011; 9(4):e1001046. PMC: 3079585. DOI: 10.1371/journal.pbio.1001046. View