» Articles » PMID: 11790845

Persistently Conserved Positions in Structurally Similar, Sequence Dissimilar Proteins: Roles in Preserving Protein Fold and Function

Overview
Journal Protein Sci
Specialty Biochemistry
Date 2002 Jan 16
PMID 11790845
Citations 26
Authors
Affiliations
Soon will be listed here.
Abstract

Many protein pairs that share the same fold do not have any detectable sequence similarity, providing a valuable source of information for studying sequence-structure relationship. In this study, we use a stringent data set of structurally similar, sequence-dissimilar protein pairs to characterize residues that may play a role in the determination of protein structure and/or function. For each protein in the database, we identify amino-acid positions that show residue conservation within both close and distant family members. These positions are termed "persistently conserved". We then proceed to determine the "mutually" persistently conserved (MPC) positions: those structurally aligned positions in a protein pair that are persistently conserved in both pair mates. Because of their intra- and interfamily conservation, these positions are good candidates for determining protein fold and function. We find that 45% of the persistently conserved positions are mutually conserved. A significant fraction of them are located in critical positions for secondary structure determination, they are mostly buried, and many of them form spatial clusters within their protein structures. A substitution matrix based on the subset of MPC positions shows two distinct characteristics: (i) it is different from other available matrices, even those that are derived from structural alignments; (ii) its relative entropy is high, emphasizing the special residue restrictions imposed on these positions. Such a substitution matrix should be valuable for protein design experiments.

Citing Articles

Sequence-structure-function characterization of the emerging tetracycline destructase family of antibiotic resistance enzymes.

Blake K, Kumar H, Loganathan A, Williford E, Diorio-Toth L, Xue Y Commun Biol. 2024; 7(1):336.

PMID: 38493211 PMC: 10944477. DOI: 10.1038/s42003-024-06023-w.


Precision enzyme discovery through targeted mining of metagenomic data.

Ariaeenejad S, Gharechahi J, Foroozandeh Shahraki M, Atanaki F, Han J, Ding X Nat Prod Bioprospect. 2024; 14(1):7.

PMID: 38200389 PMC: 10781932. DOI: 10.1007/s13659-023-00426-8.


Identification of 121 variants of honey bee Vitellogenin protein sequences with structural differences at functional sites.

Leipart V, Ludvigsen J, Kent M, Sandve S, To T, Arnyasi M Protein Sci. 2022; 31(7):e4369.

PMID: 35762708 PMC: 9207902. DOI: 10.1002/pro.4369.


Deep Neural Network-Assisted Drug Recommendation Systems for Identifying Potential Drug-Target Interactions.

Kalakoti Y, Yadav S, Sundar D ACS Omega. 2022; 7(14):12138-12146.

PMID: 35449922 PMC: 9016825. DOI: 10.1021/acsomega.2c00424.


Structure prediction of honey bee vitellogenin: a multi-domain protein important for insect immunity.

Leipart V, Montserrat-Canals M, Cunha E, Luecke H, Herrero-Galan E, Halskau O FEBS Open Bio. 2021; 12(1):51-70.

PMID: 34665931 PMC: 8727950. DOI: 10.1002/2211-5463.13316.


References
1.
Markiewicz P, Kleina L, Cruz C, Ehret S, Miller J . Genetic studies of the lac repressor. XIV. Analysis of 4000 altered Escherichia coli lac repressors reveals essential and non-essential residues, as well as "spacers" which do not require a specific sequence. J Mol Biol. 1994; 240(5):421-33. DOI: 10.1006/jmbi.1994.1458. View

2.
Brenner S, Levitt M . Expectations from structural genomics. Protein Sci. 2000; 9(1):197-200. PMC: 2144435. DOI: 10.1110/ps.9.1.197. View

3.
Orengo C, Jones D, Thornton J . Protein superfamilies and domain superfolds. Nature. 1994; 372(6507):631-4. DOI: 10.1038/372631a0. View

4.
Kennes C, Pries F, Krooshof G, Bokma E, Kingma J, Janssen D . Replacement of tryptophan residues in haloalkane dehalogenase reduces halide binding and catalytic activity. Eur J Biochem. 1995; 228(2):403-7. View

5.
Murzin A, Brenner S, Hubbard T, Chothia C . SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol. 1995; 247(4):536-40. DOI: 10.1006/jmbi.1995.0159. View