Persistently Conserved Positions in Structurally Similar, Sequence Dissimilar Proteins: Roles in Preserving Protein Fold and Function
Overview
Authors
Affiliations
Many protein pairs that share the same fold do not have any detectable sequence similarity, providing a valuable source of information for studying sequence-structure relationship. In this study, we use a stringent data set of structurally similar, sequence-dissimilar protein pairs to characterize residues that may play a role in the determination of protein structure and/or function. For each protein in the database, we identify amino-acid positions that show residue conservation within both close and distant family members. These positions are termed "persistently conserved". We then proceed to determine the "mutually" persistently conserved (MPC) positions: those structurally aligned positions in a protein pair that are persistently conserved in both pair mates. Because of their intra- and interfamily conservation, these positions are good candidates for determining protein fold and function. We find that 45% of the persistently conserved positions are mutually conserved. A significant fraction of them are located in critical positions for secondary structure determination, they are mostly buried, and many of them form spatial clusters within their protein structures. A substitution matrix based on the subset of MPC positions shows two distinct characteristics: (i) it is different from other available matrices, even those that are derived from structural alignments; (ii) its relative entropy is high, emphasizing the special residue restrictions imposed on these positions. Such a substitution matrix should be valuable for protein design experiments.
Blake K, Kumar H, Loganathan A, Williford E, Diorio-Toth L, Xue Y Commun Biol. 2024; 7(1):336.
PMID: 38493211 PMC: 10944477. DOI: 10.1038/s42003-024-06023-w.
Precision enzyme discovery through targeted mining of metagenomic data.
Ariaeenejad S, Gharechahi J, Foroozandeh Shahraki M, Atanaki F, Han J, Ding X Nat Prod Bioprospect. 2024; 14(1):7.
PMID: 38200389 PMC: 10781932. DOI: 10.1007/s13659-023-00426-8.
Leipart V, Ludvigsen J, Kent M, Sandve S, To T, Arnyasi M Protein Sci. 2022; 31(7):e4369.
PMID: 35762708 PMC: 9207902. DOI: 10.1002/pro.4369.
Kalakoti Y, Yadav S, Sundar D ACS Omega. 2022; 7(14):12138-12146.
PMID: 35449922 PMC: 9016825. DOI: 10.1021/acsomega.2c00424.
Leipart V, Montserrat-Canals M, Cunha E, Luecke H, Herrero-Galan E, Halskau O FEBS Open Bio. 2021; 12(1):51-70.
PMID: 34665931 PMC: 8727950. DOI: 10.1002/2211-5463.13316.