» Articles » PMID: 18366648

A Multi-template Combination Algorithm for Protein Comparative Modeling

Overview
Journal BMC Struct Biol
Publisher Biomed Central
Date 2008 Mar 28
PMID 18366648
Citations 50
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Multiple protein templates are commonly used in manual protein structure prediction. However, few automated algorithms of selecting and combining multiple templates are available.

Results: Here we develop an effective multi-template combination algorithm for protein comparative modeling. The algorithm selects templates according to the similarity significance of the alignments between template and target proteins. It combines the whole template-target alignments whose similarity significance score is close to that of the top template-target alignment within a threshold, whereas it only takes alignment fragments from a less similar template-target alignment that align with a sizable uncovered region of the target. We compare the algorithm with the traditional method of using a single top template on the 45 comparative modeling targets (i.e. easy template-based modeling targets) used in the seventh edition of Critical Assessment of Techniques for Protein Structure Prediction (CASP7). The multi-template combination algorithm improves the GDT-TS scores of predicted models by 6.8% on average. The statistical analysis shows that the improvement is significant (p-value < 10-4). Compared with the ideal approach that always uses the best template, the multi-template approach yields only slightly better performance. During the CASP7 experiment, the preliminary implementation of the multi-template combination algorithm (FOLDpro) was ranked second among 67 servers in the category of high-accuracy structure prediction in terms of GDT-TS measure.

Conclusion: We have developed a novel multi-template algorithm to improve protein comparative modeling.

Citing Articles

Recent Progress of Protein Tertiary Structure Prediction.

Wuyun Q, Chen Y, Shen Y, Cao Y, Hu G, Cui W Molecules. 2024; 29(4).

PMID: 38398585 PMC: 10893003. DOI: 10.3390/molecules29040832.


Contact-Assisted Threading in Low-Homology Protein Modeling.

Bhattacharya S, Roche R, Shuvo M, Moussad B, Bhattacharya D Methods Mol Biol. 2023; 2627:41-59.

PMID: 36959441 PMC: 10340115. DOI: 10.1007/978-1-0716-2974-1_3.


CRFalign: A Sequence-Structure Alignment of Proteins Based on a Combination of HMM-HMM Comparison and Conditional Random Fields.

Lee S, Joo K, Sim S, Lee J, Lee I, Lee J Molecules. 2022; 27(12).

PMID: 35744836 PMC: 9231382. DOI: 10.3390/molecules27123711.


MULTICOM2 open-source protein structure prediction system powered by deep learning and distance prediction.

Wu T, Liu J, Guo Z, Hou J, Cheng J Sci Rep. 2021; 11(1):13155.

PMID: 34162922 PMC: 8222248. DOI: 10.1038/s41598-021-92395-6.


Chemical system biology approach to identify multi-targeting FDA inhibitors for treating COVID-19 and associated health complications.

Naik B, Mattaparthi V, Gupta N, Ojha R, Das P, Singh S J Biomol Struct Dyn. 2021; 40(19):9543-9567.

PMID: 34062110 PMC: 8171008. DOI: 10.1080/07391102.2021.1931451.


References
1.
Tress M, Ezkurdia I, Grana O, Lopez G, Valencia A . Assessment of predictions submitted for the CASP6 comparative modeling category. Proteins. 2005; 61 Suppl 7:27-45. DOI: 10.1002/prot.20720. View

2.
Zhou H, Zhou Y . Quantifying the effect of burial of amino acid residues on protein stability. Proteins. 2003; 54(2):315-22. DOI: 10.1002/prot.10584. View

3.
Westbrook J, Feng Z, Chen L, Yang H, Berman H . The Protein Data Bank and structural genomics. Nucleic Acids Res. 2003; 31(1):489-91. PMC: 165515. DOI: 10.1093/nar/gkg068. View

4.
Sadreyev R, Grishin N . COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance. J Mol Biol. 2003; 326(1):317-36. DOI: 10.1016/s0022-2836(02)01371-2. View

5.
Zhang Y, Skolnick J . Automated structure prediction of weakly homologous proteins on a genomic scale. Proc Natl Acad Sci U S A. 2004; 101(20):7594-9. PMC: 419651. DOI: 10.1073/pnas.0305695101. View