» Articles » PMID: 36291695

Functional Tuning of Intrinsically Disordered Regions in Human Proteins by Composition Bias

Overview
Journal Biomolecules
Publisher MDPI
Date 2022 Oct 27
PMID 36291695
Authors
Affiliations
Soon will be listed here.
Abstract

Intrinsically disordered regions (IDRs) in protein sequences are flexible, have low structural constraints and as a result have faster rates of evolution. This lack of evolutionary conservation greatly limits the use of sequence homology for the classification and functional assessment of IDRs, as opposed to globular domains. The study of IDRs requires other properties for their classification and functional prediction. While composition bias is not a necessary property of IDRs, compositionally biased regions (CBRs) have been noted as frequent part of IDRs. We hypothesized that to characterize IDRs, it could be helpful to study their overlap with particular types of CBRs. Here, we evaluate this overlap in the human proteome. A total of 2/3 of residues in IDRs overlap CBRs. Considering CBRs enriched in one type of amino acid, we can distinguish CBRs that tend to be fully included within long IDRs (R, H, N, D, P, G), from those that partially overlap shorter IDRs (S, E, K, T), and others that tend to overlap IDR terminals (Q, A). CBRs overlap more often IDRs in nuclear proteins and in proteins involved in liquid-liquid phase separation (LLPS). Study of protein interaction networks reveals the enrichment of CBRs in IDRs by tandem repetition of short linear motifs (rich in S or P), and the existence of E-rich polar regions that could support specific protein interactions with non-specific interactions. Our results open ways to pin down the function of IDRs from their partial compositional biases.

Citing Articles

SERBP1 interacts with PARP1 and is present in PARylation-dependent protein complexes regulating splicing, cell division, and ribosome biogenesis.

Breunig K, Lei X, Montalbano M, Guardia G, Ostadrahimi S, Alers V Elife. 2025; 13.

PMID: 39937575 PMC: 11820137. DOI: 10.7554/eLife.98152.


Intrinsically Disordered Compositional Bias in Proteins: Sequence Traits, Region Clustering, and Generation of Hypothetical Functional Associations.

Harrison P Bioinform Biol Insights. 2024; 18:11779322241287485.

PMID: 39417089 PMC: 11481073. DOI: 10.1177/11779322241287485.


Identification of Low-Complexity Domains by Compositional Signatures Reveals Class-Specific Frequencies and Functions Across the Domains of Life.

Cascarina S, Ross E PLoS Comput Biol. 2024; 20(5):e1011372.

PMID: 38748749 PMC: 11132505. DOI: 10.1371/journal.pcbi.1011372.


Evolutionary Study of Protein Short Tandem Repeats in Protein Families.

Mier P, Andrade-Navarro M Biomolecules. 2023; 13(7).

PMID: 37509152 PMC: 10377733. DOI: 10.3390/biom13071116.


Phase separating Rho: a widespread regulatory function of disordered regions in proteins revealed in bacteria.

Schumbera E, Mier P, Andrade-Navarro M Signal Transduct Target Ther. 2023; 8(1):253.

PMID: 37344523 PMC: 10284900. DOI: 10.1038/s41392-023-01505-5.


References
1.
Crick S, Jayaraman M, Frieden C, Wetzel R, Pappu R . Fluorescence correlation spectroscopy shows that monomeric polyglutamine molecules form collapsed structures in aqueous solutions. Proc Natl Acad Sci U S A. 2006; 103(45):16764-9. PMC: 1629004. DOI: 10.1073/pnas.0608175103. View

2.
Nott T, Petsalaki E, Farber P, Jervis D, Fussner E, Plochowietz A . Phase transition of a disordered nuage protein generates environmentally responsive membraneless organelles. Mol Cell. 2015; 57(5):936-947. PMC: 4352761. DOI: 10.1016/j.molcel.2015.01.013. View

3.
Hansen J, Lu X, Ross E, Woody R . Intrinsic protein disorder, amino acid composition, and histone terminal domains. J Biol Chem. 2005; 281(4):1853-6. DOI: 10.1074/jbc.R500022200. View

4.
Urbanek A, Popovic M, Morato A, Estana A, Elena-Real C, Mier P . Flanking Regions Determine the Structure of the Poly-Glutamine in Huntingtin through Mechanisms Common among Glutamine-Rich Human Proteins. Structure. 2020; 28(7):733-746.e5. DOI: 10.1016/j.str.2020.04.008. View

5.
Promponas V, Enright A, Tsoka S, Kreil D, Leroy C, Hamodrakas S . CAST: an iterative algorithm for the complexity analysis of sequence tracts. Complexity analysis of sequence tracts. Bioinformatics. 2000; 16(10):915-22. DOI: 10.1093/bioinformatics/16.10.915. View