» Articles » PMID: 38339009

Making Use of Averaging Methods in MODELLER for Protein Structure Prediction

Overview
Journal Int J Mol Sci
Publisher MDPI
Date 2024 Feb 10
PMID 38339009
Authors
Affiliations
Soon will be listed here.
Abstract

Recent advances in protein structure prediction, driven by AlphaFold 2 and machine learning, demonstrate proficiency in static structures but encounter challenges in capturing essential dynamic features crucial for understanding biological function. In this context, homology-based modeling emerges as a cost-effective and computationally efficient alternative. The MODELLER (version 10.5, accessed on 30 November 2023) algorithm can be harnessed for this purpose since it computes intermediate models during simulated annealing, enabling the exploration of attainable configurational states and energies while minimizing its objective function. There have been a few attempts to date to improve the models generated by its algorithm, and in particular, there is no literature regarding the implementation of an averaging procedure involving the intermediate models in the MODELLER algorithm. In this study, we examined MODELLER's output using 225 target-template pairs, extracting the best representatives of intermediate models. Applying an averaging procedure to the selected intermediate structures based on statistical potentials, we aimed to determine: (1) whether averaging improves the quality of structural models during the building phase; (2) if ranking by statistical potentials reliably selects the best models, leading to improved final model quality; (3) whether using a single template versus multiple templates affects the averaging approach; (4) whether the "ensemble" nature of the MODELLER building phase can be harnessed to capture low-energy conformations in holo structures modeling. Our findings indicate that while improvements typically fall short of a few decimal points in the model evaluation metric, a notable fraction of configurations exhibit slightly higher similarity to the native structure than MODELLER's proposed final model. The averaging-building procedure proves particularly beneficial in (1) regions of low sequence identity between the target and template(s), the most challenging aspect of homology modeling; (2) holo protein conformations generation, an area in which MODELLER and related tools usually fall short of the expected performance.

References
1.
Kryshtafovych A, Monastyrskyy B, Fidelis K, Moult J, Schwede T, Tramontano A . Evaluation of the template-based modeling in CASP12. Proteins. 2017; 86 Suppl 1:321-334. PMC: 5877821. DOI: 10.1002/prot.25425. View

2.
Lee C, Su B, Tseng Y . Comparative studies of AlphaFold, RoseTTAFold and Modeller: a case study involving the use of G-protein-coupled receptors. Brief Bioinform. 2022; 23(5). PMC: 9487610. DOI: 10.1093/bib/bbac308. View

3.
Shen M, Sali A . Statistical potential for assessment and prediction of protein structures. Protein Sci. 2006; 15(11):2507-24. PMC: 2242414. DOI: 10.1110/ps.062416606. View

4.
Sala D, Engelberger F, Mchaourab H, Meiler J . Modeling conformational states of proteins with AlphaFold. Curr Opin Struct Biol. 2023; 81:102645. DOI: 10.1016/j.sbi.2023.102645. View

5.
Chae M, Krull F, Knapp E . Optimized distance-dependent atom-pair-based potential DOOP for protein structure prediction. Proteins. 2015; 83(5):881-90. DOI: 10.1002/prot.24782. View