» Articles » PMID: 33957120

Structural Genomics and the Protein Data Bank

Overview
Journal J Biol Chem
Specialty Biochemistry
Date 2021 May 6
PMID 33957120
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

The field of Structural Genomics arose over the last 3 decades to address a large and rapidly growing divergence between microbial genomic, functional, and structural data. Several international programs took advantage of the vast genomic sequence information and evaluated the feasibility of structure determination for expanded and newly discovered protein families. As a consequence, structural genomics has developed structure-determination pipelines and applied them to a wide range of novel, uncharacterized proteins, often from "microbial dark matter," and later to proteins from human pathogens. Advances were especially needed in protein production and rapid de novo structure solution. The experimental three-dimensional models were promptly made public, facilitating structure determination of other members of the family and helping to understand their molecular and biochemical functions. Improvements in experimental methods and databases resulted in fast progress in molecular and structural biology. The Protein Data Bank structure repository played a central role in the coordination of structural genomics efforts and the structural biology community as a whole. It facilitated development of standards and validation tools essential for maintaining high quality of deposited structural data.

Citing Articles

20 years of crystal hits: progress and promise in ultrahigh-throughput crystallization screening.

Lynch M, Snell M, Potter S, Snell E, Bowman S Acta Crystallogr D Struct Biol. 2023; 79(Pt 3):198-205.

PMID: 36876429 PMC: 9986797. DOI: 10.1107/S2059798323001274.


Unsupervised Machine Learning Organization of the Functional Dark Proteome of Gram-Negative "Superbugs": Six Protein Clusters Amenable for Distinct Scientific Applications.

Sicilia C, Corral-Lugo A, Smialowski P, McConnell M, Martin-Galiano A ACS Omega. 2022; 7(50):46131-46145.

PMID: 36570227 PMC: 9774411. DOI: 10.1021/acsomega.2c04076.


RCSB Protein Data Bank (RCSB.org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning.

Burley S, Bhikadiya C, Bi C, Bittrich S, Chao H, Chen L Nucleic Acids Res. 2022; 51(D1):D488-D508.

PMID: 36420884 PMC: 9825554. DOI: 10.1093/nar/gkac1077.


Protein Data Bank: A Comprehensive Review of 3D Structure Holdings and Worldwide Utilization by Researchers, Educators, and Students.

Burley S, Berman H, Duarte J, Feng Z, Flatt J, Hudson B Biomolecules. 2022; 12(10).

PMID: 36291635 PMC: 9599165. DOI: 10.3390/biom12101425.


Capturing the geometry, function, and evolution of enzymes with 3D templates.

Riziotis I, Thornton J Protein Sci. 2022; 31(7):e4363.

PMID: 35762726 PMC: 9207746. DOI: 10.1002/pro.4363.

References
1.
Grabowski M, Langner K, Cymborowski M, Porebski P, Sroka P, Zheng H . A public database of macromolecular diffraction experiments. Acta Crystallogr D Struct Biol. 2016; 72(Pt 11):1181-1193. PMC: 5108346. DOI: 10.1107/S2059798316014716. View

2.
Kim Y, Babnigg G, Jedrzejczak R, Eschenfeldt W, Li H, Maltseva N . High-throughput protein purification and quality assessment for crystallization. Methods. 2011; 55(1):12-28. PMC: 3690762. DOI: 10.1016/j.ymeth.2011.07.010. View

3.
Lawson C . Unified data resource for cryo-EM. Methods Enzymol. 2010; 483:73-90. PMC: 2966391. DOI: 10.1016/S0076-6879(10)83004-6. View

4.
Service R . 'The game has changed.' AI triumphs at protein folding. Science. 2020; 370(6521):1144-1145. DOI: 10.1126/science.370.6521.1144. View

5.
Tepper J, Nardi G, Sutt H . Carcinoma of the pancreas: review of MGH experience from 1963 to 1973. Analysis of surgical failure and implications for radiation therapy. Cancer. 1976; 37(3):1519-24. DOI: 10.1002/1097-0142(197603)37:3<1519::aid-cncr2820370340>3.0.co;2-o. View