» Articles » PMID: 38659920

Biophysical Characterization of High-confidence, Small Human Proteins

Overview
Journal bioRxiv
Date 2024 Apr 25
PMID 38659920
Authors
Affiliations
Soon will be listed here.
Abstract

Significant efforts have been made to characterize the biophysical properties of proteins. Small proteins have received less attention because their annotation has historically been less reliable. However, recent improvements in sequencing, proteomics, and bioinformatics techniques have led to the high-confidence annotation of small open reading frames (smORFs) that encode for functional proteins, producing smORF-encoded proteins (SEPs). SEPs have been found to perform critical functions in several species, including humans. While significant efforts have been made to annotate SEPs, less attention has been given to the biophysical properties of these proteins. We characterized the distributions of predicted and curated biophysical properties, including sequence composition, structure, localization, function, and disease association of a conservative list of previously identified human SEPs. We found significant differences between SEPs and both larger proteins and control sets. Additionally, we provide an example of how our characterization of biophysical properties can contribute to distinguishing protein-coding smORFs from non-coding ones in otherwise ambiguous cases.

References
1.
Franzmann T, Alberti S . Prion-like low-complexity sequences: Key regulators of protein solubility and phase behavior. J Biol Chem. 2018; 294(18):7128-7136. PMC: 6509491. DOI: 10.1074/jbc.TM118.001190. View

2.
Amberger J, Bocchini C, Schiettecatte F, Scott A, Hamosh A . OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders. Nucleic Acids Res. 2014; 43(Database issue):D789-98. PMC: 4383985. DOI: 10.1093/nar/gku1205. View

3.
Andrews S, Rothnagel J . Emerging evidence for functional peptides encoded by short open reading frames. Nat Rev Genet. 2014; 15(3):193-204. DOI: 10.1038/nrg3520. View

4.
. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 2020; 49(D1):D480-D489. PMC: 7778908. DOI: 10.1093/nar/gkaa1100. View

5.
Ibrahim A, Khaodeuanepheng N, Amarasekara D, Correia J, Lewis K, Fitzkee N . Intrinsically disordered regions that drive phase separation form a robustly distinct protein class. J Biol Chem. 2022; 299(1):102801. PMC: 9860499. DOI: 10.1016/j.jbc.2022.102801. View