» Articles » PMID: 21098306

Protein Structure Determination by Exhaustive Search of Protein Data Bank Derived Databases

Overview
Specialty Science
Date 2010 Nov 25
PMID 21098306
Citations 34
Authors
Affiliations
Soon will be listed here.
Abstract

Parallel sequence and structure alignment tools have become ubiquitous and invaluable at all levels in the study of biological systems. We demonstrate the application and utility of this same parallel search paradigm to the process of protein structure determination, benefitting from the large and growing corpus of known structures. Such searches were previously computationally intractable. Through the method of Wide Search Molecular Replacement, developed here, they can be completed in a few hours with the aide of national-scale federated cyberinfrastructure. By dramatically expanding the range of models considered for structure determination, we show that small (less than 12% structural coverage) and low sequence identity (less than 20% identity) template structures can be identified through multidimensional template scoring metrics and used for structure determination. Many new macromolecular complexes can benefit significantly from such a technique due to the lack of known homologous protein folds or sequences. We demonstrate the effectiveness of the method by determining the structure of a full-length p97 homologue from Trichoplusia ni. Example cases with the MHC/T-cell receptor complex and the EmoB protein provide systematic estimates of minimum sequence identity, structure coverage, and structural similarity required for this method to succeed. We describe how this structure-search approach and other novel computationally intensive workflows are made tractable through integration with the US national computational cyberinfrastructure, allowing, for example, rapid processing of the entire Structural Classification of Proteins protein fragment database.

Citing Articles

Introduction of the Capsules environment to support further growth of the SBGrid structural biology software collection.

Herre C, Ho A, Eisenbraun B, Vincent J, Nicholson T, Boutsioukis G Acta Crystallogr D Struct Biol. 2024; 80(Pt 6):439-450.

PMID: 38832828 PMC: 11154594. DOI: 10.1107/S2059798324004881.


Siderophore-mediated iron acquisition by .

Kumar A, Chakravorty S, Yang T, Russo T, Newton S, Klebba P J Bacteriol. 2024; 206(5):e0002424.

PMID: 38591913 PMC: 11112993. DOI: 10.1128/jb.00024-24.


Fluorescent Binding Protein Sensors for Detection and Quantification of Biochemicals, Metabolites, and Natural Products.

Newton S, Klebba P Bio Protoc. 2022; 12(22).

PMID: 36532683 PMC: 9724013. DOI: 10.21769/BioProtoc.4543.


CCP4 Cloud for structure determination and project management in macromolecular crystallography.

Krissinel E, Lebedev A, Uski V, Ballard C, Keegan R, Kovalevskiy O Acta Crystallogr D Struct Biol. 2022; 78(Pt 9):1079-1089.

PMID: 36048148 PMC: 9435598. DOI: 10.1107/S2059798322007987.


: a neural-network-based approach for identification of unknown proteins in X-ray crystallography and cryo-EM.

Chojnowski G, Simpkin A, Leonardo D, Seifert-Davila W, Vivas-Ruiz D, Keegan R IUCrJ. 2022; 9(Pt 1):86-97.

PMID: 35059213 PMC: 8733886. DOI: 10.1107/S2052252521011088.


References
1.
Nissen M, Youn B, Knowles B, Ballinger J, Jun S, Belchik S . Crystal structures of NADH:FMN oxidoreductase (EmoB) at different stages of catalysis. J Biol Chem. 2008; 283(42):28710-20. PMC: 2661417. DOI: 10.1074/jbc.M804535200. View

2.
Keegan R, Winn M . MrBUMP: an automated pipeline for molecular replacement. Acta Crystallogr D Biol Crystallogr. 2007; 64(Pt 1):119-24. PMC: 2394800. DOI: 10.1107/S0907444907037195. View

3.
Schwarzenbacher R, Godzik A, Jaroszewski L . The JCSG MR pipeline: optimized alignments, multiple models and parallel searches. Acta Crystallogr D Biol Crystallogr. 2007; 64(Pt 1):133-40. PMC: 2394805. DOI: 10.1107/S0907444907050111. View

4.
Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H . The Protein Data Bank. Nucleic Acids Res. 1999; 28(1):235-42. PMC: 102472. DOI: 10.1093/nar/28.1.235. View

5.
Zwart P, Afonine P, Grosse-Kunstleve R, Hung L, Ioerger T, McCoy A . Automated structure solution with the PHENIX suite. Methods Mol Biol. 2008; 426:419-35. DOI: 10.1007/978-1-60327-058-8_28. View