» Articles » PMID: 22182830

The Relationship Between Proteome Size, Structural Disorder and Organism Complexity

Overview
Journal Genome Biol
Specialties Biology
Genetics
Date 2011 Dec 21
PMID 22182830
Citations 94
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Sequencing the genomes of the first few eukaryotes created the impression that gene number shows no correlation with organism complexity, often referred to as the G-value paradox. Several attempts have previously been made to resolve this paradox, citing multifunctionality of proteins, alternative splicing, microRNAs or non-coding DNA. As intrinsic protein disorder has been linked with complex responses to environmental stimuli and communication between cells, an additional possibility is that structural disorder may effectively increase the complexity of species.

Results: We revisited the G-value paradox by analyzing many new proteomes whose complexity measured with their number of distinct cell types is known. We found that complexity and proteome size measured by the total number of amino acids correlate significantly and have a power function relationship. We systematically analyzed numerous other features in relation to complexity in several organisms and tissues and found: the fraction of protein structural disorder increases significantly between prokaryotes and eukaryotes but does not further increase over the course of evolution; the number of predicted binding sites in disordered regions in a proteome increases with complexity; the fraction of protein disorder, predicted binding sites, alternative splicing and protein-protein interactions all increase with the complexity of human tissues.

Conclusions: We conclude that complexity is a multi-parametric trait, determined by interaction potential, alternative splicing capacity, tissue-specific protein disorder and, above all, proteome size. The G-value paradox is only apparent when plants are grouped with metazoans, as they have a different relationship between complexity and proteome size.

Citing Articles

Molecular and Functional Convergences Associated with Complex Multicellularity in Eukarya.

Pereira Lobo F, da Costa D, Benjamim D, da Silva T, de Oliveira M Mol Biol Evol. 2025; 42(2).

PMID: 39877976 PMC: 11827588. DOI: 10.1093/molbev/msaf013.


Organismal complexity strongly correlates with the number of protein families and domains.

Alvarez-Ponce D, Krishnamurthy S Proc Natl Acad Sci U S A. 2025; 122(5):e2404332122.

PMID: 39874285 PMC: 11804679. DOI: 10.1073/pnas.2404332122.


Evolution of intrinsic disorder in the structural domains of viral and cellular proteomes.

Mughal F, Caetano-Anolles G Sci Rep. 2025; 15(1):2878.

PMID: 39843714 PMC: 11754631. DOI: 10.1038/s41598-025-86045-4.


PICNIC accurately predicts condensate-forming proteins regardless of their structural disorder across organisms.

Hadarovich A, Singh H, Ghosh S, Scheremetjew M, Rostam N, Hyman A Nat Commun. 2024; 15(1):10668.

PMID: 39663388 PMC: 11634905. DOI: 10.1038/s41467-024-55089-x.


The protein domains of vertebrate species in which selection is more effective have greater intrinsic structural disorder.

Weibel C, Wheeler A, James J, Willis S, McShea H, Masel J Elife. 2024; 12.

PMID: 39239703 PMC: 11379457. DOI: 10.7554/eLife.87335.


References
1.
Murzin A, Brenner S, Hubbard T, Chothia C . SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol. 1995; 247(4):536-40. DOI: 10.1006/jmbi.1995.0159. View

2.
Haynes C, Oldfield C, Ji F, Klitgord N, Cusick M, Radivojac P . Intrinsic disorder is a common feature of hub proteins from four eukaryotic interactomes. PLoS Comput Biol. 2006; 2(8):e100. PMC: 1526461. DOI: 10.1371/journal.pcbi.0020100. View

3.
Hegyi H, Schad E, Tompa P . Structural disorder promotes assembly of protein complexes. BMC Struct Biol. 2007; 7:65. PMC: 2194777. DOI: 10.1186/1472-6807-7-65. View

4.
Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W . The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol. 2007; 25(11):1251-5. PMC: 2814061. DOI: 10.1038/nbt1346. View

5.
Dyson H, Wright P . Intrinsically unstructured proteins and their functions. Nat Rev Mol Cell Biol. 2005; 6(3):197-208. DOI: 10.1038/nrm1589. View