A Structural-alphabet-based Strategy for Finding Structural Motifs Across Protein Families
Overview
Authors
Affiliations
Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a 'corner' architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present 'only' in DBPs were discovered. The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign.
Entropy Analysis of Protein Sequences Reveals a Hierarchical Organization.
Anashkina A, Petrushanko I, Ziganshin R, Orlov Y, Nekrasov A Entropy (Basel). 2021; 23(12).
PMID: 34945953 PMC: 8700119. DOI: 10.3390/e23121647.
Knowledge-based prediction of protein backbone conformation using a structural alphabet.
Vetrivel I, Mahajan S, Tyagi M, Hoffmann L, Sanejouand Y, Srinivasan N PLoS One. 2017; 12(11):e0186215.
PMID: 29161266 PMC: 5697859. DOI: 10.1371/journal.pone.0186215.
Protein flexibility in the light of structural alphabets.
Craveur P, Joseph A, Esque J, Narwani T, Noel F, Shinada N Front Mol Biosci. 2015; 2:20.
PMID: 26075209 PMC: 4445325. DOI: 10.3389/fmolb.2015.00020.
Homopharma: a new concept for exploring the molecular binding mechanisms and drug repurposing.
Chiu Y, Tseng J, Liu K, Lin C, Hsu K, Yang J BMC Genomics. 2014; 15 Suppl 9:S8.
PMID: 25521038 PMC: 4290623. DOI: 10.1186/1471-2164-15-S9-S8.
Use of a structural alphabet to find compatible folds for amino acid sequences.
Mahajan S, de Brevern A, Sanejouand Y, Srinivasan N, Offmann B Protein Sci. 2014; 24(1):145-53.
PMID: 25297700 PMC: 4282420. DOI: 10.1002/pro.2581.