MAMMOTH (matching Molecular Models Obtained from Theory): an Automated Method for Model Comparison
Overview
Affiliations
Advances in structural genomics and protein structure prediction require the design of automatic, fast, objective, and well benchmarked methods capable of comparing and assessing the similarity of low-resolution three-dimensional structures, via experimental or theoretical approaches. Here, a new method for sequence-independent structural alignment is presented that allows comparison of an experimental protein structure with an arbitrary low-resolution protein tertiary model. The heuristic algorithm is given and then used to show that it can describe random structural alignments of proteins with different folds with good accuracy by an extreme value distribution. From this observation, a structural similarity score between two proteins or two different conformations of the same protein is derived from the likelihood of obtaining a given structural alignment by chance. The performance of the derived score is then compared with well established, consensus manual-based scores and data sets. We found that the new approach correlates better than other tools with the gold standard provided by a human evaluator. Timings indicate that the algorithm is fast enough for routine use with large databases of protein models. Overall, our results indicate that the new program (MAMMOTH) will be a good tool for protein structure comparisons in structural genomics applications. MAMMOTH is available from our web site at http://physbio.mssm.edu/~ortizg/.
Impact of Alignments on the Accuracy of Protein Subcellular Localization Predictions.
Gillani M, Pollastri G Proteins. 2024; 93(3):745-759.
PMID: 39575640 PMC: 11809130. DOI: 10.1002/prot.26767.
GTalign: spatial index-driven protein structure alignment, superposition, and search.
Margelevicius M Nat Commun. 2024; 15(1):7305.
PMID: 39181863 PMC: 11344802. DOI: 10.1038/s41467-024-51669-z.
LoCoHD: a metric for comparing local environments of proteins.
Fazekas Z, Menyhard D, Perczel A Nat Commun. 2024; 15(1):4029.
PMID: 38740745 PMC: 11091161. DOI: 10.1038/s41467-024-48225-0.
Structome: a tool for the rapid assembly of datasets for structural phylogenetics.
Malik A, Langer D, Verma C, Poole A, Allison J Bioinform Adv. 2023; 3(1):vbad134.
PMID: 38046099 PMC: 10692761. DOI: 10.1093/bioadv/vbad134.
Cinaroglu S, Biggin P J Chem Inf Model. 2023; 63(19):6095-6108.
PMID: 37759363 PMC: 10565830. DOI: 10.1021/acs.jcim.3c01041.