A Multi-template Combination Algorithm for Protein Comparative Modeling
Overview
Molecular Biology
Authors
Affiliations
Background: Multiple protein templates are commonly used in manual protein structure prediction. However, few automated algorithms of selecting and combining multiple templates are available.
Results: Here we develop an effective multi-template combination algorithm for protein comparative modeling. The algorithm selects templates according to the similarity significance of the alignments between template and target proteins. It combines the whole template-target alignments whose similarity significance score is close to that of the top template-target alignment within a threshold, whereas it only takes alignment fragments from a less similar template-target alignment that align with a sizable uncovered region of the target. We compare the algorithm with the traditional method of using a single top template on the 45 comparative modeling targets (i.e. easy template-based modeling targets) used in the seventh edition of Critical Assessment of Techniques for Protein Structure Prediction (CASP7). The multi-template combination algorithm improves the GDT-TS scores of predicted models by 6.8% on average. The statistical analysis shows that the improvement is significant (p-value < 10-4). Compared with the ideal approach that always uses the best template, the multi-template approach yields only slightly better performance. During the CASP7 experiment, the preliminary implementation of the multi-template combination algorithm (FOLDpro) was ranked second among 67 servers in the category of high-accuracy structure prediction in terms of GDT-TS measure.
Conclusion: We have developed a novel multi-template algorithm to improve protein comparative modeling.
Recent Progress of Protein Tertiary Structure Prediction.
Wuyun Q, Chen Y, Shen Y, Cao Y, Hu G, Cui W Molecules. 2024; 29(4).
PMID: 38398585 PMC: 10893003. DOI: 10.3390/molecules29040832.
Contact-Assisted Threading in Low-Homology Protein Modeling.
Bhattacharya S, Roche R, Shuvo M, Moussad B, Bhattacharya D Methods Mol Biol. 2023; 2627:41-59.
PMID: 36959441 PMC: 10340115. DOI: 10.1007/978-1-0716-2974-1_3.
Lee S, Joo K, Sim S, Lee J, Lee I, Lee J Molecules. 2022; 27(12).
PMID: 35744836 PMC: 9231382. DOI: 10.3390/molecules27123711.
Wu T, Liu J, Guo Z, Hou J, Cheng J Sci Rep. 2021; 11(1):13155.
PMID: 34162922 PMC: 8222248. DOI: 10.1038/s41598-021-92395-6.
Naik B, Mattaparthi V, Gupta N, Ojha R, Das P, Singh S J Biomol Struct Dyn. 2021; 40(19):9543-9567.
PMID: 34062110 PMC: 8171008. DOI: 10.1080/07391102.2021.1931451.