» Articles » PMID: 34265844

Highly Accurate Protein Structure Prediction with AlphaFold

Abstract

Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort, the structures of around 100,000 unique proteins have been determined, but this represents a small fraction of the billions of known protein sequences. Structural coverage is bottlenecked by the months to years of painstaking effort required to determine a single protein structure. Accurate computational approaches are needed to address this gap and to enable large-scale structural bioinformatics. Predicting the three-dimensional structure that a protein will adopt based solely on its amino acid sequence-the structure prediction component of the 'protein folding problem'-has been an important open research problem for more than 50 years. Despite recent progress, existing methods fall far short of atomic accuracy, especially when no homologous structure is available. Here we provide the first computational method that can regularly predict protein structures with atomic accuracy even in cases in which no similar structure is known. We validated an entirely redesigned version of our neural network-based model, AlphaFold, in the challenging 14th Critical Assessment of protein Structure Prediction (CASP14), demonstrating accuracy competitive with experimental structures in a majority of cases and greatly outperforming other methods. Underpinning the latest version of AlphaFold is a novel machine learning approach that incorporates physical and biological knowledge about protein structure, leveraging multi-sequence alignments, into the design of the deep learning algorithm.

Citing Articles

Immunological assessment of NSFu1: A novel fusion molecule constructed from structural proteins of SARS-CoV-2 for improving COVID-19 antibody detection.

Arif S, Akhter M, Anwar A, Javaid S, Ashi Z, Shad M Arch Microbiol. 2025; 207(4):88.

PMID: 40088274 DOI: 10.1007/s00203-025-04286-3.


Integrating artificial intelligence in drug discovery and early drug development: a transformative approach.

Ocana A, Pandiella A, Privat C, Bravo I, Luengo-Oroz M, Amir E Biomark Res. 2025; 13(1):45.

PMID: 40087789 DOI: 10.1186/s40364-025-00758-2.


Exploring FAM13A-N-Myc interactions to uncover potential targets in MYCN-amplified neuroblastoma: a study of protein interactions and molecular dynamics simulations.

Yin H, Liu T, Wu D, Li X, Li G, Song W BMC Cancer. 2025; 25(1):470.

PMID: 40087586 DOI: 10.1186/s12885-025-13903-9.


Septoria tritici blotch resistance gene Stb15 encodes a lectin receptor-like kinase.

Hafeez A, Chartrain L, Feng C, Cambon F, Clarke M, Griffiths S Nat Plants. 2025; .

PMID: 40087541 DOI: 10.1038/s41477-025-01920-2.


De novo design of self-assembling peptides with antimicrobial activity guided by deep learning.

Liu H, Song Z, Zhang Y, Wu B, Chen D, Zhou Z Nat Mater. 2025; .

PMID: 40087536 DOI: 10.1038/s41563-025-02164-3.


References
1.
Xu J, McPartlon M, Li J . Improved protein structure prediction by deep learning irrespective of co-evolution information. Nat Mach Intell. 2021; 3:601-609. PMC: 8340610. DOI: 10.1038/s42256-021-00348-5. View

2.
AlQuraishi M . End-to-End Differentiable Learning of Protein Structure. Cell Syst. 2019; 8(4):292-301.e3. PMC: 6513320. DOI: 10.1016/j.cels.2019.03.006. View

3.
Roy A, Kucukural A, Zhang Y . I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc. 2010; 5(4):725-38. PMC: 2849174. DOI: 10.1038/nprot.2010.5. View

4.
Zemla A . LGA: A method for finding 3D similarities in protein structures. Nucleic Acids Res. 2003; 31(13):3370-4. PMC: 168977. DOI: 10.1093/nar/gkg571. View

5.
Harris C, Millman K, van der Walt S, Gommers R, Virtanen P, Cournapeau D . Array programming with NumPy. Nature. 2020; 585(7825):357-362. PMC: 7759461. DOI: 10.1038/s41586-020-2649-2. View