» Articles » PMID: 27334472

Revealing Aperiodic Aspects of Solenoid Proteins from Sequence Information

Overview
Journal Bioinformatics
Specialty Biology
Date 2016 Jun 24
PMID 27334472
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Repeat proteins, which contain multiple repeats of short sequence motifs, form a large but seldom-studied group of proteins. Methods focusing on the analysis of 3D structures of such proteins identified many subtle effects in length distribution of individual motifs that are important for their functions. However, similar analysis was yet not applied to the vast majority of repeat proteins with unknown 3D structures, mostly because of the extreme diversity of the underlying motifs and the resulting difficulty to detect those.

Results: We developed FAIT, a sequence-based algorithm for the precise assignment of individual repeats in repeat proteins and introduced a framework to classify and compare aperiodicity patterns for large protein families. FAIT extracts repeat positions by post-processing FFAS alignment matrices with image processing methods. On examples of proteins with Leucine Rich Repeat (LRR) domains and other solenoids like proteins, we show that the automated analysis with FAIT correctly identifies exact lengths of individual repeats based entirely on sequence information.

Availability And Implementation: https://github.com/GodzikLab/FAIT CONTACT: adam@godziklab.org

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Propagation of Fibrillar Structural Forms in Proteins Stopped by Naturally Occurring Short Polypeptide Chain Fragments.

Roterman I, Banach M, Konieczny L Pharmaceuticals (Basel). 2017; 10(4).

PMID: 29144442 PMC: 5748646. DOI: 10.3390/ph10040089.

References
1.
Vingron M, Argos P . Motif recognition and alignment for many sequences by comparison of dot-matrices. J Mol Biol. 1991; 218(1):33-43. DOI: 10.1016/0022-2836(91)90871-3. View

2.
Kajava A . Structural diversity of leucine-rich repeat proteins. J Mol Biol. 1998; 277(3):519-27. DOI: 10.1006/jmbi.1998.1643. View

3.
Park K, Shen B, Parmeggiani F, Huang P, Stoddard B, Baker D . Control of repeat-protein curvature by computational protein design. Nat Struct Mol Biol. 2015; 22(2):167-74. PMC: 4318719. DOI: 10.1038/nsmb.2938. View

4.
Andrade M, Perez-Iratxeta C, Ponting C . Protein repeats: structures, functions, and evolution. J Struct Biol. 2001; 134(2-3):117-31. DOI: 10.1006/jsbi.2001.4392. View

5.
Xu D, Jaroszewski L, Li Z, Godzik A . FFAS-3D: improving fold recognition by including optimized structural features and template re-ranking. Bioinformatics. 2013; 30(5):660-7. PMC: 3933871. DOI: 10.1093/bioinformatics/btt578. View